CS 751/851 - Introduction to Digital Libraries Final Exam Michael L. Nelson Due: Scheduled Day for Final Exam The final is worth 24% of your grade, so each question will count 3 points. Contact me if you need clarification on some of the questions. Use the readings, class notes, other materials, and me, but not each other. Please turn in both hard and soft copies. I estimate that each question will take 40-60 minutes a piece, and probably no less than 1/2 page each. References to support your claims (where appropriate) are helpful. You are not limited to the readings we have covered in class. 1. What do you see as the major limitations of "lazy preservation"? 2. Discuss in detail the benefits and limitations of employing the technique described in chapter 8 of McCown's dissertation. 3. Summarize Lynch's "Open Questions" in his 2001 article about OAI-PMH. Briefly describe how the OAI-PMH community has (currently) addressed each of these open questions. 4. Discuss in detail the problems with meta searching as identified by Gravano et al. Does (and if so, how) SRU and OpenSearch address these problems? 5. What are the advantages and drawbacks of having the web server involved in the preservation process as per Smith's dissertation? 6. Compare and contrast the four crawling algoritms presented in the paper "Crawling the Web". 7. Could Resource Maps be used to replace or augment the current implementation of the CRATE model? Discuss in detail how this could be done (or why it could not). 8. Describe in detail the arguments for and against using time as a dimension for content negotiation. What is your personal position? Extra Credit -- Each Question is worth three points: 9. Describe your personal collection of digital content (class work, photos, music, email, etc.) and in detail assess your existing strategies against the threats in the paper "Requirements for Digital Preservation Systems: A Bottom-Up Approach". In areas where your strategies are lacking, proposal a realistic scenario for how they could be improved. 10. Work through a *detailed*, specific scenario of the polling described in section 4 of the Maniatis et al. paper. 11. As per the paper "Search engine coverage of the OAI-PMH corpus", find a resource that is described by an OAI-PMH record but not index by the three major search engines (Google, Yahoo, Bing). Give plenty of details to support your claim.