How to Install Hortonworks Sandbox Using Docker
Ops note · As we know that "Hortonworks Sandbox" is a customized Hadoop VM, which you can install using any of the virtualization tools like…
Read NoteOps note · As we know that "Hortonworks Sandbox" is a customized Hadoop VM, which you can install using any of the virtualization tools like…
Read NoteData note · Apache Phoenix is another query engine similar to Apache Drill but unlike Drill which can connect to any databases, it can only…
Read NoteData note · Apache Drill can sit on the top of any data source - be it relational, non-relational, S3, JSON etc. It presents an…
Read NoteData note · The previous session on MongoDB was on Big data integration of MongoDB and how to use Spark and Python to access data…
Read NoteData note · Why MongoDB? : Lets evaluate MongoDB on CAP theorem to assert 'Why MongoDB' Partition tolerance is a MUST in Bigdata scenarios as…
Read NoteData note · Why Cassandra: Before we discuss Cassandara, we have to also discuss about something called as CAP Theorem - As per CAP(Consistency, Availability…
Read NoteData note · Interacting with HDFS using HBase and Python was very powerful but it was also very engaging as we havd to do a…
Read NoteData note · What is HBase: HBase is a NoSQL/non-relational answer your big data queries where relational databases can't be as scalable as non relational…
Read NoteData note · The distributed Hadoop file system can not only retrieve data from flat files but also my structured as well as unstructured sources.…
Read NoteEditorial reprint · As Spark 2 supports datasets which is the extension of RDDs, we can use these datasets to model into a Machine Learning…
Read Note