Files

Download

Download Full Text (5.4 MB)

Download Powerpoint presentation (8.9 MB)

Description

PDF of a powerpoint presentation from the Columbia University Web Archiving Collaboration: New Tools and Models Conference, in New York, New York, June 4-5, 2015. Also available on Slideshare.

Publication Date

6-4-2015

Keywords

Archive-It, Data sets, Mementos, Off-topic pages, Seed URls, TimeMaps, Web archives, Web crawlers

Disciplines

Archival Science | Computer Sciences

Tools Managing Seed URls (Detecting Off-Topic Pages)


Share

COinS