Through analyzing the MapReduce scheduling mechanism, this project illustrates the reasons of system slot resource wasting which results in reduce tasks waiting around, and it proposes the development of a method detailing the start times of reduce tasks dynamically according to each job context, including the task completion time and the size of map output. There is no doubt that the use of this method will decrease the reduce completion time and system average response time in Hadoop platforms.
more>>