CSE788.i02: Information Retrieval (Spring 2008)

Instructor: Hui Fang

Time & Place: 10:00-11:18am T/R, DL 480


Quick Link: lecture notes

Announcements

Basic Information

Course Goals

With the increasing amount of textual information, such as web pages, it is important to develop effective systems to help users manage and exploit the information. Web search engines, such as Google, are good examples of such systems. In this course, you will learn the underlying technologies of information retrieval systems and get hands-on project experience. Topics include information retrieval models, information filtering, text classification, text clustering etc.

Prerequisites

Students should come with good programming skills. If you are not sure whether you have the right background, please contact the instructor.

Grading

Syllabus (tentative)

Date Topic Slides Readings Presenter(s)
Tue 3/25 Course Overview pdf As We May Think Hui Fang
Thur 3/27 General Advices on Presentation N/A N/A Hui Fang
Tue 4/1 Intro to Text Retrieval pdf Modern Information Retrieval: A Brief Overview Hui Fang
Thur 4/3 1. Vector Space Model

2. Probabilistic Model

pdf

pdf

1. Pivoted Document Length Normalization

2. Probabilistic Relevance Models Based on Document and Query Generation

WenBin Zhang

Venu Satuluri

Tue 4/8 1. Language Modeling Approach

2. Axiomatic Approach

pdf

pdf

1. A Study of Smoothing methods for language models applied to ad hoc information retrieval

2. An Exploration of Axiomatic Approach to Information Retrieval

Darla Shockley

Hong Yuan

Thur 4/10 Feedback pdf

pdf

Chapter 9:Relevance Feedback

Model-based feedback in the language modeling approach to information retrieval

TanTan Liu

Zhe Xu

Tue 4/15 Project Proposal(1) N/A N/A All
Thur 4/17

Room: DL280

Project Proposal (2) N/A N/A All
Tue 4/22 Query Expansion and Suggestions pdf Semantic Term Matching in Axiomatic Approach to IR

Generating query substitutions

Anna Wolf

Fan Wang

Thur 4/24 Evaluation and User Study pdf IR evaluation methods for retrieving highly relevant documents

Re-examining the Potential Effectiveness of Interactive Query Expansion

Zhe Xu

Joe Bolinger

Tue 4/29 Link Structure pdf PageRank

Topic-Sensitive PageRank

Matt Goyder

Tim Weale

Thur 5/1 Implicit Feedback pdf Accurately Interpreting Clickthrough Data as Implicit Feedback

Context-Sensitive Information Retrieval Using Implicit Feedback

Kelly Yackovich

Aditya Torvi

Tue 5/6 Text Categorization

Deep Web

pdf A re-examination of text categorization methods

Toward Large Scale Integration: Building a MetaQuerier over Databases on the Web

Aditya Torvi

Wei Jiang

Thur 5/8 Text Clustering pdf Probabilistic Latent Semantic Indexing

A cross-collection mixture model for comparative text mining

Anna Wolf

Darla Shockley

Tue 5/13 Information Extraction

Entity Rank

pdf Automatic segmentation of text into structured records

EntityRank: Searching Entities Directly and Holistically

Kelly Yackovich

Fan Wang

Thur 5/15 Information Filtering pdf Empirical Analysis of Predictive Algorithms for Collaborative Filtering

Novelty and Redundancy Deteiction in Adpative Filtering

Joe Bolinger

Erdem Yalcin

Tue 5/20 Text Summarization pdf 1. Query-relevant summarization using FAQs

2.News to Go: Hierarchical Text Summarization for Mobile Devices

Venu Satuluri

Wei Jiang

Thur 5/22 Information Visualization and User Interface pdf Visualization of serach results: a comparative evaluation of text, 2D and 3D interfaces

Grouper: A Dynamic clustering interface to web search results

Matt Goyder

Yuan Hong

Tue 5/27 Project Presentation (I) All
Thur 5/29 Project Presentation (II) All