When traditional storage systems and essential programming found it challenging to handle huge volume of data. Google found GFS (Google File system), MapReduce and Big table, that handled hardware, storage and parallel programming challenges.

The computational challenges and solutions in GFS, lead to HDFS and MapReduce as a framework at Yahoo, which was then donated to Apache Software Foundation. When MapReduce, HDFS with Hadoop Common and Hadoop YARN became Hadoop, it had widespread adoption outside Yahoo by large open source community. …

Vinodh Rhaj

Analytics Professional practicing data solutions of all shapes and sizes

Get the Medium app