A Community for Hadoop Users
Tags: hadoop
Permalink Reply by Jason Venner on January 26, 2010 at 8:03pm
Permalink Reply by Adeel on January 27, 2010 at 6:38am A reducer can not start until all of the data that will be it's input is fully ordered.
The ordering can not complete until all of the map tasks have finished, as any map may have data that will go to any reducer (reduce task).
The reduce task often starts at job start, but the first call to the user's reduce method will only happen after all of the map tasks have completed.
Permalink Reply by Jason Venner on January 27, 2010 at 7:17pm
Permalink Reply by Prasen on February 21, 2010 at 10:33pm For all intents and purposes your reduce doesn't start until the reduce % hits 60%
the parts that run prior to that are involved in preparing the data for your reduce tasks.
It the job output is a confusing information presentation.
Permalink Reply by dachuan huang on March 6, 2010 at 9:57pm 6 members
4 members
11 members
1 member
9 members
© 2012 Created by Jason Venner.