Topics to be discussed:
Searching, retrieval, indexing: algorithms and applications
Metrics and evaluation: issues and case studies
- Gerald Salton, "A New Comparison Between Conventional Indexing (MEDLARS)
and Automatic Text Processing (SMART)," Cornell University Computer
Science Technical Report TR71-115, December 1971.
Finding an appropriate, on-line reading for traditional information
retrieval is difficult. IR is a mature field, so overview material exists
only in textbooks and the current research literature deals mainly with
esoterica. The above technical report is an early result from Gerald
Salton, a pioneer in modern IR, and features the SMART on-line retrieval
system, a testbed for many IR experiments. This report highlights
many key issues in IR: precision and recall; boolean searching vs vector
matching; relevance feedback; manual indexing vs automatic processing.
- Ch 2 in PDL:BBB.
This chapter covers some of the material from lecture #2 and some of the
material from this week's lecture (lecture #3).