[ links ]
- Information Retrieval in Wikipedia.
- The Lucene project
- C.D. Manning, P. Raghavan, H. Schütze: Introduction to Information Retrieval, online version. There's an associated set of slides.
- C.J. van Rijsbergen, Information retrieval Online book. Old (1979), but most basic concepts are still relevant
- Implementations of stemmers for several languages
- Several good explanations of PageRank and link analysis:
- The Anatomy of a Large-Scale Hypertextual Web Search Engine, Sergey Brin and Lawrence Page
- Web search for a planet: The Google cluster architecture, by Luiz André Barroso, Jeffrey Dean, and Urs Hölzle
- The Google file system, by Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung
- Mining Massive Datasets online course at Coursera