Module Description (excerpted from the bulletin): This module discusses the basic concepts and methods of information retrieval including capturing, representing, storing, organizing, and retrieving unstructured or loosely structured information. The most well-known aspect of information retrieval is document retrieval: the process of indexing and retrieving text documents. However, the field of information retrieval includes almost any type of unstructured or semi-structured data, including newswire stories, transcribed speech, email, blogs, images, or video. Therefore, information retrieval is a critical aspect of Web search engines. This module also serves as the foundation for subsequent modules on the understanding, processing and retrieval of particular web media.
N.B. We will be teaching and using the Python programming language throughout this class. We will using Python 2.6.6 instead of the updated Python 3.x, as the NLTK library that we will also be using is currently incompatible with 3.x.
Course Characteristics:
Note: There will only be five tutorials; each tutorial is on a subject related to a homework assignment, and the tutorials are held every other week.
Note to NUS-external visitors: Welcome! If you're a fellow I.R. course instructor looking for lecture material, you can see the syllabus menu item on the left for a preview. Please contact me if you'd like to use any of my material. Thanks!
This document, index.html, has been accessed 213 times since 25-Jun-24 11:57:13 +08. This is the 2nd time it has been accessed today.
A total of 125 different hosts have accessed this document in the last 152 days; your host, 18.225.175.230, has accessed it 1 times.
If you're interested, complete statistics for this document are also available, including breakdowns by top-level domain, host name, and date.
Min-Yen Kan <kanmy@comp.nus.edu.sg> Wed Dec 8 13:16:25 SGT 2010 | Version: 1.0 | Last modified: Wed Feb 2 12:39:09 2011