Posted on 10:18 by Johan Oskarsson and filed under ,

We've added the second and final presentation for the monday meetup. Tim Hawkings from Contextured will be telling us about Apache Nutch, the project Hadoop was a part of in the early days.

Nutch is open source web-search software. It builds on Lucene Java, adding web-specifics, such as a crawler, a link-graph database, parsers for HTML and other document formats, etc.

Please sign up now if you intend to come. Hope to see you all there!

Thanks to Dimi for helping me set up this presentation.