Topics to be covered:
- Publishing models, DL input, scanning, re-keying, OCR
- SGML and other formats
- Metadata: purpose, formats, use
- Ginsparg, P., "First Steps Toward Electronic Research Communication,"
Computers in Physics, 8, 1994, pp. 390-396.
Not surprisingly, much of the literature concerning the intersection
between the Internet and scholarly publication has been by those who are
(openly or secretly) fascinated by the scholarly publication process itself.
Most of the discussions are simply proposals on how to re-implement existing
processes (review, payment, etc.) using the new technology. Ginsparg is
one of the first to argue that Internet technologies can radically alter
the research communication model, and not simply provide "electronic
journals." This is possible by having the research authors
directly input their results into a DL, bypassing the traditional
journal routing/review system.
- Lesk, M. "Books Into Bytes," Scientific American, March, 1997.
http://www.lesk.com/mlesk/sciam97/sciam97.html (original version)
http://www.sciam.com/0397issue/0397lesk.html (published version)
Most of the material in this article appears in the textbook. Using many
non-STI examples, Lesk gives brief examples and economic figures for
scanning, OCRing, rekeying, and media conversion. There are also mentions
of metadata use (Text Encoding Initiative), markup languages (SGML), and
format permanence. The focus of this article is converting information
into digital formats.
- Heery, R. "Review of Metadata Formats," Program 30(4), Octobert 1996.
There is a lot of literature about metadata ("data about data"); Heery gives a
readable overview with a small number of relevant examples. In traditional
libraries, the distinction between metadata and data was well established:
the metadata provided a "pointer" to the physical location of the object.
In a DL, this distinction has blurred, and DL builders must understand
the subtle distinctions when populating their DLs. "MARC" and "Dublin
Core" are among the relevant sample formats discussed.