Experience on working with nutch

This blog is about collection informations on working with nutch during our master project.

Monday, April 18, 2011

Interesting sites related to Nutch

Crawl Script

How to implement re-crawling

Nutch Homepage

How Nutch maps to "Map and Reduce"

What is MapReduce?

http://www.slideshare.net/abial/nutch-webscale-search-engine-toolkit

Talks on Search, Lucene and Performance

Slides on Nutch

Paper on Nutch performance and use cases

Tutorial on Nutch

How to setup an Hadoop cluster?

Post to plugin for a self-made Language Detection

Plugin.xml showing all extension points

Scaling of Nutch and Lucene

Talk on Hadoop and fellow Apache projects

Nice interview with Doug Cutting

Work in Progress...
Posted by Unknown at 10:16 AM
Email ThisBlogThis!Share to XShare to FacebookShare to Pinterest

No comments:

Post a Comment

Newer Post Home
Subscribe to: Post Comments (Atom)

  • Lucid Imagination
  • Apache Hadoop
  • Apache Lucene
  • Apache Nutch
  • Apache Mahout

Blog Archive

  • ►  2012 (1)
    • ►  September (1)
  • ▼  2011 (4)
    • ►  May (1)
    • ▼  April (3)
      • Problem with Luke and Nutch1.2
      • Gettin Nutch running with windows
      • Interesting sites related to Nutch
Awesome Inc. theme. Powered by Blogger.