Apache Kafka – A Tool for Streaming Data into the Cluster
Data visualization on a displayPhoto: Luke Chesser / Unsplash · Royalty-free Big Data
Mohd Naeem

Apache Kafka – A Tool for Streaming Data into the Cluster

Data note ·   What is Streaming?  So what if you have to capture live data or logs from a web servers, you have data…

Read Note
Apache Flume – Hadoop Specific Streaming Tool
Executive KPI dashboard on monitorsPhoto: Carlos Muza / Unsplash · Royalty-free Big Data
Mohd Naeem

Apache Flume – Hadoop Specific Streaming Tool

Data note · What is Apache Flume: As we know that Apache Kafka is a generic streaming tool which can handle not only Hadoop specific…

Read Note
Python – A Refresher
Operations team reviewing cloud metricsPhoto: Luke Chesser / Unsplash · Royalty-free Python
Mohd Naeem

Python – A Refresher

Ops note · Python – Part 1 of 5 What is Python and Why: open source programming language interpreted at run-time(unlike compiled Java, C#, like…

Read Note
Linux Commands – A Refresher
Laptop with component code openPhoto: Christina @ wocintechchat.com / Unsplash · Royalty-free Linux
Mohd Naeem

Linux Commands – A Refresher

Ops note · Linux Commands - Part 1 of 4 What is Linux - an operating system open-source software consists of a core, called as…

Read Note
How to Install Hortonworks Sandbox Using Docker
Stand-up meeting in a tech officePhoto: Lucas / Unsplash · Royalty-free Big Data
Mohd Naeem

How to Install Hortonworks Sandbox Using Docker

Ops note · As we know that "Hortonworks Sandbox" is a customized Hadoop VM, which you can install using any of the virtualization tools like…

Read Note