Hadoop 2.0.0-mr1-cdh4.7.1 API

Core
org.apache.hadoop.filecache  
org.apache.hadoop.mapred A software framework for easily writing applications which process vast amounts of data (multi-terabyte data-sets) parallelly on large clusters (thousands of nodes) built of commodity hardware in a reliable, fault-tolerant manner.
org.apache.hadoop.mapred.jobcontrol Utilities for managing dependent jobs.
org.apache.hadoop.mapred.join Given a set of sorted datasets keyed with the same class and yielding equal partitions, it is possible to effect a join of those datasets prior to the map.
org.apache.hadoop.mapred.lib Library of generally useful mappers, reducers, and partitioners.
org.apache.hadoop.mapred.lib.aggregate Classes for performing various counting and aggregations.
org.apache.hadoop.mapred.lib.db org.apache.hadoop.mapred.lib.db Package
org.apache.hadoop.mapred.pipes Hadoop Pipes allows C++ code to use Hadoop DFS and map/reduce.
org.apache.hadoop.mapred.tools  
org.apache.hadoop.mapreduce  
org.apache.hadoop.mapreduce.counters This package contains the implementations of different types of map-reduce counters.
org.apache.hadoop.mapreduce.lib.db org.apache.hadoop.mapred.lib.db Package
org.apache.hadoop.mapreduce.lib.fieldsel  
org.apache.hadoop.mapreduce.lib.input  
org.apache.hadoop.mapreduce.lib.jobcontrol Utilities for managing dependent jobs.
org.apache.hadoop.mapreduce.lib.map  
org.apache.hadoop.mapreduce.lib.output  
org.apache.hadoop.mapreduce.lib.partition  
org.apache.hadoop.mapreduce.lib.reduce  
org.apache.hadoop.mapreduce.security  
org.apache.hadoop.mapreduce.security.token  
org.apache.hadoop.mapreduce.security.token.delegation  
org.apache.hadoop.mapreduce.server.jobtracker  
org.apache.hadoop.mapreduce.server.tasktracker  
org.apache.hadoop.mapreduce.server.tasktracker.userlogs  
org.apache.hadoop.mapreduce.split  
org.apache.hadoop.mapreduce.task  
org.apache.hadoop.mapreduce.util  
org.apache.hadoop.mr1security  
org.apache.hadoop.mr1security.authorize  
org.apache.hadoop.mr1tools  
org.apache.hadoop.typedbytes Typed bytes are sequences of bytes in which the first byte is a type code.
org.apache.hadoop.util  

 

Examples
org.apache.hadoop.examples Hadoop example code.
org.apache.hadoop.examples.dancing This package is a distributed implementation of Knuth's dancing links algorithm that can run under Hadoop.
org.apache.hadoop.examples.terasort This package consists of 3 map/reduce applications for Hadoop to compete in the annual terabyte sort competition.

 

contrib: Streaming
org.apache.hadoop.streaming Hadoop Streaming is a utility which allows users to create and run Map-Reduce jobs with any executables (e.g.
org.apache.hadoop.streaming.io  

 

contrib: DataJoin
org.apache.hadoop.contrib.utils.join  

 

contrib: Index
org.apache.hadoop.contrib.index.example  
org.apache.hadoop.contrib.index.lucene  
org.apache.hadoop.contrib.index.main  
org.apache.hadoop.contrib.index.mapred  

 

contrib: FailMon
org.apache.hadoop.contrib.failmon  

 



Copyright © 2009 The Apache Software Foundation