- DL Case Studies: Netlib, NHSE, WATERS, LTRS, NTRS, LISAR/LAVA, NIX, UCSTRI,
Physics e-print, NCSTRL
- This week we begin looking at the architecture of some significant WWW DLs.
I have tried to pick readings that are representative of classes of DLs.
In particular, these papers show DLs that have an increasing level of
sophistication that comes at the expense of greater synchronization and
participation requirements of the remote sites.
- M. VanHeyningen, "The Unified Computer Science Technical Report Index:
Lessons in Indexing Diverse Resources," Proceedings of the 2nd
International World Wide Web Conference, October 19-21, 1994, pp. 535-543.
This paper documents the development of the UCSTRI system. UCSTRI was
interesting in that it was an early effort that provided a surprisingly good
search interface to a collection of anonymous FTP servers. The key is
that the anonymous FTP sites did nothing to participate in UCSTRI; their
contents were cataloged and heuristics applied to guess the formats, etc.
This is a DL equivalent of current meta-searchers like Altavista, Infoseek
- M. Nelson and M.-H. Maa, "Optimizing the NASA Technical Report Server,"
Internet Research, 6(1), 1996, pp. 64-70.
This paper gives an overview of some architectural improvements in NTRS,
most notably integrating parallel searching. NTRS is a gateway to 15+
different DLs, and requires the remote sites to meet a minimal requirement
for participation (in contrast to UCSTRI). Most of the individual nodes
in NTRS are are of the hybrid http/ftp server variety discussed in the
- J. Davis and C. Lagoze, "The Networked Computer Science Technical Report
Library," Cornell CS TR96-1595, July, 1996.
This paper gives an overview of NCSTRL and the protocol that it is built
upon, Dienst. NCSTRL follows the independent, distributed publisher model
similar to DLs discussed in the above papers, but differs in that it
requires the installation of a sophisticated suite of software for
publication management, indexing, and serving. NCSTRL has 100+
participants, and Dienst is probably the most sophisticated and rich WWW
DL system in widespread use.