View Source

This is the home of the Hadoop space.

Apache Hadoop is a framework for running applications on large cluster built of commodity hardware. The Hadoop framework transparently provides applications both reliability and data motion. Hadoop implements a computational paradigm named Map/Reduce, where the application is divided into many small fragments of work, each of which may be executed or re-executed on any node in the cluster. In addition, it provides a distributed file system (HDFS) that stores data on the compute nodes, providing very high aggregate bandwidth across the cluster. Both MapReduce and the Hadoop Distributed File System are designed so that node failures are automatically handled by the framework.

Please note that most of the information is available from the old hadoop wiki. As the old wiki is deprecated inside the Apache Software Foundation, new pages are created here. Sooner or later the valid and up-to-date information will be migrated to this wiki.

Roadmap

Hadoop Roadmap

Releases

See the Apache Hadoop Releases page for prior releases.