Document Type

Conference Paper

Publication Date

2018

Publication Title

Proceedings of the 2018 Web Archiving and Digital Libraries Workshop

Pages

12-13

Conference Name

Web Archiving and Digital Libraries Workshop (WADL 2018), June 6, 2018, Fort Worth, Texas

Abstract

[First paragraph] Preserving and replaying modern web pages in high-fidelity has become an increasingly difficult task due to the increased usage of JavaScript. Reliance on server-side rewriting alone results in live-leakage and or the inability to replay a page due to the preserved JavaScript performing an action not permissible from the archive. The current state-of-the-art high fidelity archival preservation and replay solutions rely on handcrafted client-side URL rewriting libraries specifically tailored for the archive, namely Webrecoder's and Pywb's wombat.js [12]. Web archives not utilizing client-side rewriting rely on server-side rewriting that misses URLs used in a manner not accounted for by the archive or involve client-side execution of JavaScript by the browser.

Rights

© 2018 Edward A. Fox, Martin Klein, and Zhiwu Xie.

Included with the kind written permission of the copyright holders and the author.

Now published under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Original Publication Citation

Berlin, J. A., Nelson, M. L., & Weigle, M. C. (2018, June 6, 2018). Swimming in a sea of Javascript or: How I learned to stop worrying and love high fidelity replay [Conference Paper]. Web Archiving and Digital Libraries Workshop, Fort Worth, Texas. http://hdl.handle.net/10919/97988

ORCID

0000-0003-3749-8116 (Nelson), 0000-0002-2787-7166 (Weigle)

Share

COinS