Date of Award
Summer 2005
Document Type
Thesis
Degree Name
Master of Science (MS)
Department
Computer Science
Committee Director
Michael L. Nelson
Committee Member
Johan Bollen
Committee Member
Ravi Mukkamula
Abstract
We present Opal, a framework for interactively locating missing web pages (http status code 404). Opal is an example of "in vivo" preservation: harnessing the collective behavior of web archives, commercial search engines, and research projects for the purpose of preservation. Opal servers learn from their experiences and are able to share their knowledge with other Opal servers using the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH). Using cached copies that can be found on the web, Opal creates lexical signatures which are then used to search for similar versions of the web page. Using the OAI-PMH to facilitate inter-Opal learning extends the utilization of OAI-PMH in a novel manner. We present the architecture of the Opal framework, discuss a reference implementation of the framework, and present a quantitative analysis of the framework that indicates that Opal could be effectively deployed.
Rights
In Copyright. URI: http://rightsstatements.org/vocab/InC/1.0/ This Item is protected by copyright and/or related rights. You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s).
DOI
10.25777/359p-qw88
Recommended Citation
Harrison, Terry L..
"Opal: In Vivo Based Preservation Framework for Locating Lost Web Pages"
(2005). Master of Science (MS), Thesis, Computer Science, Old Dominion University, DOI: 10.25777/359p-qw88
https://digitalcommons.odu.edu/computerscience_etds/23