1 | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | R | S | T | W | Y | Z

Lafayette College Web Archive

The College Archives documents the history of Lafayette College by identifying, preserving, and making available records of permanent value produced by administrative offices, academic departments and programs, faculty, student, and alumni groups.  Beginning in July 2015, Lafayette College partnered with the Internet Archive’s Archive-It program to collect, preserve, and provide access to our institutional web presence through the Wayback Machine, a digital archive of the World Wide Web.  The Lafayette College Web Archive is curated by the College Archives and managed with assistance from Information Technology Services and the Communications Division.

 

Mission Statement

As an extension of its overall mission, the College Archives gathers web content made available via the institution’s public website.  This includes administrative and academic webpages as well as publications, policies, events, and news of the College community as presented online.

Acquisition Method

The College Archives uses the subscription service Archive-It and its open source software Heritrix to crawl the College’s web site (https://www.lafayette.edu) and harvest web pages.  The crawler captures web domains by taking a snapshot of the page and storing a copy in the Internet Archive, which can be accessed through Archive-It and the WayBack Machine.  Captured files are stored in the WARC (Web ARChive) file format, which is the Library of Congress preferred archival format for web sites harvested in bulk.

Crawl Scope & Limitations

Our current web archiving program crawls publically available web content that is part of the https://www.lafayette.edu domain.  File formats and types captured include HTML, JavaScript, PDFs, embedded images, videos, and audio.  Frequency of capture of https://www.lafayette.edu is currently three times a year, with captures of webpages that change often (i.e., calendar events) occurring more frequently.

Links to Lafayette College administered external sites (i.e., www.goleopards.com; www.facebook.com; https://twitter.com/; https://www.flickr.com/) and other organization sites (CNN, The Chronicle of Higher Education, NASA) are not captured.  Content not crawled also includes digital repository collections, databases, streaming media, and password-protected sites.

Access & Use

Researchers can access Lafayette College’s web archive by using the search box at the top of this page or through the College’s collection page at https://archive-it.org/collections/5983.  Content is searchable by keyword or specific URL.  Results may have advanced search options applied, including file format and capture date range. Within each result a Show All Captures link will provide a chronological list of captures for that specific URL within the Wayback Machine. Captured sites are available to view in Wayback within 24 hours after a crawl has finished.  Full-text searching is available 7 days after the crawl has been completed, tested, and saved.

Researchers may also click on the following links to access recent captures of the Lafayette College website:

July 16, 2015

March 9, 2016

May 17, 2016

October 24, 2016

Rights Management

The Lafayette College Web Archive is a collection of the institution’s publically available web content.  For inquiries relating to use, reproduction, and copyright, please contact archives@lafayette.edu.

Contact

Questions or comments?

Are you a member of the Lafayette College community and your public web content via the https://www.lafayette.edu domain is not currently captured by our web archiving program?

Please contact us at archives@lafayette.edu.