Other Hadoop Technologies
Data note · The list is quite big but quite a few are noteworthy to be mentioned: Impala: Cloudera's alternative Hortonwork's Hive Faster than Hive…
Read NoteData note · The list is quite big but quite a few are noteworthy to be mentioned: Impala: Cloudera's alternative Hortonwork's Hive Faster than Hive…
Read NoteData note · Apache Nifi: It is a data streaming and transformation tool It has a nice Web based UI where we can configure the…
Read NoteData note · Why Flink: more scalable than Storm upto more than 1000s of nodes( massive scale) more fault tolerant than Storm maintain "state snapshots"…
Read NoteData note · Apache Storm Vs Apache Spark Streaming: Apache Storm - real time up to a sub-second level and is event based Apache Spark Streaming…
Read NoteData note · Why process big data in real time? Big data is really huge, so if we still use batch processing ( E.g. running…
Read NoteData note · What is Streaming? So what if you have to capture live data or logs from a web servers, you have data…
Read NoteData note · What is Apache Flume: As we know that Apache Kafka is a generic streaming tool which can handle not only Hadoop specific…
Read NoteData note · What is Apache Zeppelin - Notebook interface to the core as well as custom Big data technologies. an analysis and visualization tool…
Read NoteData note · There are quite a few important under the hood players in a Hadoop System - those who manage the cluster - they…
Read NoteData note · What is Presto: has a SQL interface to query. connects to multiple databases including Cassandra(which Drill can't). a big plus - OLTP…
Read Note