Linking: Hubs and Authorities, Citation Linking, Citation Analysis
This week we will look at how documents link to each other, and how we
can use such links to traverse a corpus.
- J. Kleinberg, S. Lawrence, The Structure of the Web. Science, 294(2001), 18
49.
http://www.cs.cornell.edu/home/kleinber/sci01.pdf
This is a readable introduction to "Hubs" and "Authorities" in web pages.
The members of the Clever Project at IBM have come up with algorithms for
deducing which web pages are "authorities" (as determined by the number
inbound links) and "hubs" (as determined by the number of out-bound links).
This has important implications for searching for information, and is
closely related to the concept of citation analysis.
- Steve Lawrence, Kurt Bollacker, C. Lee Giles, "Digital Libraries and
Autonomous Citation Indexing," IEEE Computer, June 1999.
http://www.neci.nec.com/~lawrence/papers/aci-computer99/aci-computer99.pdf
This paper discusses the CiteSeer / Research Index demonstration DL for
"Autonomous Citation Indexing". This involves esstentially reading in
many papers, and creating the linkage between these papers as indicated
by their references. In order to gain wide coverage, the papers are
processed as raw PostScript, PDF, etc. and the citations extracted
automatically and the linkages built around them. This method of automatic
citation extraction represents a big stride ahead of traditional
citation indexing methods which have traditionally been performed by hand.
- Helen Atkins, "The ISI Web of Science - Links and Electronic Journals",
D-Lib Magazine, 5(9), September 1999.
http://www.dlib.org/dlib/september99/atkins/09atkins.html
This article discusses the on-line citation indexing product of ISI. ISI
was founded by Eugene Garfield, the father of citation analysis.
- Nancy R. Kaplan and Michael L. Nelson, Determining the Publication
Impact of a Digital Library, Journal of the American Society of
Information Science, 51(4), 2000, pp. 324-339.
http://www.cs.odu.edu/~mln/pubs/jasis-dl.pdf
This paper discusses the results of trying to measure the impact of a
digital library on the aerospace community. Ultimately, it did not uncover
the results we hoped to find, but it did raise a number of issues about
measuring usage of a DL.