Hadoop
From Wikipedia, the free encyclopedia
| Lucene Hadoop | |
| Developer: | Apache Software Foundation |
|---|---|
| Latest release: | 0.12.1 / March 17, 2007 |
| OS: | Cross-platform |
| Use: | Search Engine |
| License: | Apache 2.0 Licence |
| Website: | lucene.apache.org/hadoop/ |
Hadoop is a Free Java software framework that supports distributed applications running on large clusters of commodity computers that process huge amounts of data. It is an Apache Lucene sub-project and was originally developed to support distribution for Nutch.[1] Hadoop consists of a distributed filesystem reminiscent of GoogleFS named the "Hadoop Distributed File System" (HDFS) and a MapReduce implementation.[2]
Hadoop was named after its creator's child's stuffed elephant.
- ^ "Hadoop is a Lucene sub-project that contains the distributed computing platform that was formerly a part of Nutch. This includes the Hadoop Distributed Filesystem (HDFS) and an implementation of map/reduce." About Hadoop
- ^ "Hadoop is a framework for running applications on large clusters of commodity hardware. The Hadoop framework transparently provides applications both reliability and data motion. Hadoop implements a computational paradigm named map/reduce, where the application is divided into many small fragments of work, each of which may be executed or reexecuted on any node in the cluster. In addition, it provides a distributed file system that stores data on the compute nodes, providing very high aggregate bandwidth across the cluster. Both map/reduce and the distributed file system are designed so that node failures are automatically handled by the framework." About Hadoop
- Hadoop website
- Mention of Nutch and Hadoop in an article about Google
- IBM MapReduce Tools for Eclipse
|
|
|
|---|---|
| Top level Projects | Apache HTTP Server • ActiveMQ • Ant • APR • Beehive • Cayenne • Cocoon • Directory • Excalibur • Forrest • Geronimo • Gump • iBATIS • Jackrabbit • James • Lenya • Maven • Mina • MyFaces • OFBiz • mod_perl • SpamAssassin • Struts • Tcl • Tomcat • Axis • Axis2 • WSIF • XMLBeans • Tapestry • HiveMind • WebWork 2 • Harmony • Velocity • Santuario • Shale |
| Apache Jakarta Project | BCEL • BSF • Cactus • Commons • ECS • HttpComponents • JCS • JMeter • ORO • POI • Regexp • Slide • Taglibs • Turbine |
| Apache DB | Derby • Torque • DdlUtils • OJB • JDO |
| Apache Portals | Jetspeed 1 • Jetspeed 2 • Graffito • Pluto • WSRP4J |
| Apache Lucene | Lucene Java • Nutch • Hadoop • Lucene4c • Lucy |
| Apache XML | AxKit • Xalan • Xerces |
| XML Graphics | Batik • FOP |
| Apache Logging | Log4j • Log4Cxx • Log4Perl • Log4PLSQL |
| Apache Incubator | XAP • River • OpenEJB • OpenJPA • ServiceMix • Wicket • Graffito • Tuscany • Log4Net • Roller • Felix • Abdera • CeltiXfire • FtpServer • Heraldry • Ivy • JuiCE • Kabuki • Lokahi • Lucene.Net • mod_ftp • NMaven • Ode • stdcxx • Woden • WSRP4J • Yoko • Log4PHP • WADI • Qpid • stdcxx • TripleSoup • UIMA • wadi |
| License: Apache License • Website: apache.org | |