Posted on 23-02-2008
Filed Under (documentation) by Linux Poweruser Programmer

Seattle Conference on Scalability: MapReduce Used on Large Data Sets
Google engEDU
30 min – Jun 23, 2007

Google Tech Talks
June 23, 2007

2007 Google Seattle Conference on Scalability:
Using MapReduce on Large Geographic Datasets
Speaker: Barry Brumitt, Google, Inc.

MapReduce is a model and library designed to
simplify distributed processing of huge datasets on large clusters of
computers. This is achieved by providing a general mechanism
which largely relieves the programmer from having to handle
challenging distributed computing problems such as data
distribution, process coordination, fault tolerance, and scaling. While
working on Google maps, I’ve used MapReduce extensively to
process and transform datasets which describe the earth’s
geography. In this talk, I’ll introduce MapReduce, demonstrating its
broad applicability through example problems ranging from basic
data transformation to complex graph processing, all the in the
context of geographic data.
video
http://video.google.com/videoplay?docid=741403180270990805


Sphere: Related Content

Tags: , , , , , , , ,

Related posts

(0) Comments    Read More   
Post a Comment
Name:
Email:
Website:
Comments: