Introduction to Hadoop

Apache Hadoop is a free distributed software framework developed in Java , C++ and bash scripts that supports data intensive distributed applications. It enables applications to work with thousands of nodes and petabytes of data. Hadoop was inspired by Google‘s MapReduce and Google File System (GFS) papers.

Hadoop is a top level Apache project, being built and used by a community of contributors from all over the world Yahoo! has been the largest contributor to the project and uses Hadoop extensively in its web search and advertising businesses.