This lecture focuses on the Open Archives Initiative Protocol for
Metadata Harvesting (OAI-PMH). We will cover its origins, current
status and future directions.
- Herbert Van de Sompel, Thomas Krichel, Michael L. Nelson, Patrick
Hochstenbach, Victor M. Lyapunov, Kurt Maly, Mohammad Zubair, Mohamed
Kholief, Xiaoming Liu, and Heath O'Connell, The UPS Prototype: An
Experimental End-User Service across E-Print Archives, D-Lib Magazine,
6(2), February 2000.
UPS was the demo system that lead to the creation of the Santa Fe
Convention. The SFC is the historical predecessor to what is now known as OAI.
Michael L. Nelson, Better Interoperability Through the Open Archives Initiative, New Review of Information Networking 7, 2002, pp. 133-145.
This paper gives an overview of OAI-PMH 1.1 and contrasts it with
distributed searching. There are subtle but significant changes in
OAI-PMH 2.0; they will be covered in class. Note that this paper, and
Cliff's below, were written before the name of the protocol had
stablilized to "OAI-PMH".
- Clifford Lynch. 2001. Metadata Harvesting and the Open Archives Initiative.
ARL Monthly Report 217, August 2001.
This article discusses some implications of the proliferation of
- McCown, F., Liu, X., Nelson, M. L., and Zubair, M. "Search engine coverage of the OAI-PMH corpus." IEEE Internet Computing 10(2), 2006, pp. 66-73.
- Kat Hagedorn, Joshua Santelli, "Google Still Not Indexing Hidden Web URLs", 14(7/8), 2008.
The above two papers (published well after OAI-PMH has been widely
adopted) discuss how search engines are not using OAI-PMH to discover
resources in the deep web.