1
Posted on 10:39 by Johan Oskarsson and filed under , ,

On the 6th of October there is a presentation at Skillsmatter in London about "Hadoop and Pig" by Ricardo Varela. This is not an event organized by HUGUK but it certainly looks interesting. More details.
Apache Hadoop is an open source Java software framework for running data-intensive applications on large clusters of commodity hardware, which is heavily invested in and used by Yahoo. Pig is a platform on top of Hadoop that includes a high-level language for expressing data analysis programs in a simple manner. Hadoop and Pig are used in Yahoo for different tasks, from helping create the various indexes for web search to multi-language entity recognition, handling levels of petabytes of info and tens of thousands of jobs per week.

In this talk we will offer an introduction to the MapReduce model, Hadoop and Pig and how you can leverage them to process big data with a small cost.
1
Response to ... Hadoop and Pig
Ricardo Varela said... 20 October 2009 14:26

thanks to everybody who dropped by and to Johan for the posting! :)

Slides and video are available at http://skillsmatter.com/podcast/os-mobile-server/introduction-to-data-processing-with-hadoop-and-pig and, as we commented, we want to schedule some followup workshop at some point so will keep everybody up to date (via twitter and ydn blog)

Whenever we have a next huguk meetup I'll try to make a reminder too!

Post a Comment