Versions Compared


  • This line was added.
  • This line was removed.
  • Formatting was changed.


Apache Hadoop is a framework for running applications on large cluster clusters built of commodity hardware. The Hadoop framework transparently provides applications for both reliability and data motion. Hadoop implements a computational paradigm named Map/Reduce, where the application is divided into many small fragments of work, each of which may be executed or re-executed on any node in the cluster. In addition, it provides a distributed file system (HDFS) that stores data on the compute nodes, providing very high aggregate bandwidth across the cluster. Both MapReduce and the Hadoop Distributed File System are designed so that node failures are automatically handled by the framework.



More The content is available on of the deprecated old hadoop wiki . New content will be added here on Confluence. We are also migrating existing content from the old wiki to Confluence.


( has been moved to HADOOP2.


Hadoop Roadmap 


See the Apache Hadoop Releases page for prior releases.