It was in the year 2000-2001, there were two important people who were associated with the project called NUTCH. The project NUTCH was basically aimed to its building an open source scalable and robust Internet search engine. One main person was Doug Cutting and other person was Mike who was a graduate student at the University of Washington.
Now while they were working on the project it was actually in 2003 and 2004 Google’s white paper on distributed file system which actually called as big table and Google’s MapReduce paper came out in 2004. Back then the Google’s MapReduce paper was implemented using C++ and Google was actually internally using them for the search engine implementation. So NUTCH had lot of common features with the white paper what Google actually had released.
Back in 2006 Dough Cutting joined Yahoo and heavily inspired by the research papers of Google. He actually created an open source Framework called as Hadoop. In fact, the name Hadoop was name after his son’s toy elephant and Hadoop traces back its root to NUTCH because Dough Cutting was one of the fore most person working on NUTCH and the idea what they did try to incorporate into NUTCH was very similar the Google research paper.
So they try to include the original ideas of NUTCH and try to also make a few changes into it, based on research papers of Google's are the big table or Google’s distributed file system and then and also Google MapReduce. The only difference being Hadoop MapReduce was implemented in JAVA and then Hadoop actually went on to become a full-fledged Apache project and open source project in fact and stable version of Hadoop was first used in Yahoo the year 2008.So it is pretty old technology but it's right now the buzz word in the industry.
Now while they were working on the project it was actually in 2003 and 2004 Google’s white paper on distributed file system which actually called as big table and Google’s MapReduce paper came out in 2004. Back then the Google’s MapReduce paper was implemented using C++ and Google was actually internally using them for the search engine implementation. So NUTCH had lot of common features with the white paper what Google actually had released.
Back in 2006 Dough Cutting joined Yahoo and heavily inspired by the research papers of Google. He actually created an open source Framework called as Hadoop. In fact, the name Hadoop was name after his son’s toy elephant and Hadoop traces back its root to NUTCH because Dough Cutting was one of the fore most person working on NUTCH and the idea what they did try to incorporate into NUTCH was very similar the Google research paper.
So they try to include the original ideas of NUTCH and try to also make a few changes into it, based on research papers of Google's are the big table or Google’s distributed file system and then and also Google MapReduce. The only difference being Hadoop MapReduce was implemented in JAVA and then Hadoop actually went on to become a full-fledged Apache project and open source project in fact and stable version of Hadoop was first used in Yahoo the year 2008.So it is pretty old technology but it's right now the buzz word in the industry.
No comments:
Post a Comment