Hadoop, one of the most well-known and widely used open source distributed framework used for large scale data processing. It is based on five main building blocks which are MapReduce Framework, YARN infrastructure, Storage, HDFS Federation, and Cluster. This is …