In 2009 (after the completion of the pilot project to collect women’s blogs) we began collecting web sites created by individuals and organizations whose papers/records are at the library.
While completing the finding aid, the processor should include a file unit description of the web site, even if it has not yet been harvested.
Ideally, by the time the processor has completed the finding aid, a first harvest will have been successful and the web content will have been added to the Schlesinger Library Sites web archiving collection (SL Sites). The Digital Librarian/Archivist will supply Paula Aloisio with a url for the collection’s archived content, and Paula will create a URN and add the hot link(s) to the finding aid. If the harvested site is not available when the finding aid is complete, the finding aid will still be posted and the URN/link will be added by Paula later. The processor should, however, include all the information as if the web content were available.
The work flow will take this shape:
- Oftentimes the donor agreement will list the donor's web site for archiving. If a web site is not listed for archiving in the deed of gift or purchase agreement and the site is not mentioned in the correspondence file, the processor searches for a web site for the person or organization. If the processor finds a web site he or she will contact Kathy Jacob about requesting permission to archive the site or will contact the donor directly for permission.
- If a web site is found and will be archived, the processor checks to see if the site is already being captured in Archive-It: https://archive-it.org/collections/8237.
- If the web site isn't in Archive-It, the processor sends the web site URL to Laura Peimer to begin harvesting of the site. At this point, the processor should be able to tell Laura whether the site needs a one-time capture or if it should be harvested annually or bi-annually. The schedule default is annual but if the organization doesn’t exist anymore or if the individual has died, we might want to schedule a one-time capture. Alternatively, if the site is very active, we can capture it bi-annually. The processor should also indicate if there's anything particularly important on the site that we should ensure the Archive-It crawler captures (e.g. video clips, certain images, etc.)
In 2018 a pilot project was successfully completed linking digitized video content from the archived web site of the Women's Encampment to matching Vt-# entries in the Women's Encampment collection’s finding aid: https://hollisarchives.lib.harvard.edu/repositories/8/resources/8359
By matching already digitized content from an archived web site to the finding aid -- where we have the corresponding analog content listed-- we are able to provide historical footage more readily and without the additional cost of reformatting on our end. When surveying, processors should consider identifying any possible 1-to-1 matches between digitized audiovisual content on the organization's or individual's archived web site with any physical original tapes in the collection.
- If a web site exists and is being archived, the processor will add the following to the finding aid (see Sonia Fuentes for example: http://nrs.harvard.edu/urn-3:RAD.SCHL:sch01256):
- In the Extent tag include "# archived web site(s)"
- EXAMPLE: 12.93 linear feet ((31 file boxes) plus 2 folio+ folders, 10 photograph folders, 3 audiotapes, 4 videotapes, 1 archived web site)
- In the Scope and Content: “XX’s web site is being captured periodically as part of Schlesinger Library’s web archiving program."
EXAMPLE: Also included is Griffin's web site, which is being captured periodically as part of Schlesinger Library's web archiving program.
See Papers of Susan Griffin finding aid.
- In the Extent tag include "# archived web site(s)"
- In file unit descriptions, use “E” as the container followed by a file unit number. New practice (as of January 2019) is to provide more information in the <unittitle>. This will include specifying that it is an "archived web site" and adding the actual URL of the site.
EXAMPLE: E.1. Susan Brownmiller’s archived web site: http://www.susanbrownmiller.com, 2010-ongoing. [hot link added by Paula]
- If there are multiple web sites that are grouped under one E#, the URLs should be listed within a scope and content tag for the folder.
EXAMPLE: E.1. Deanna Booher's archived web sites as Queen Kong and Queen Adrena, 2018-ongoing. [hot link added by Paula]
Scope and Content note: GLAMAZON QUEEN KONG (http://goddessqueenkong.blogspot.com/); Queen Adrena (http://queenadrena.com/); Queen Adrena (http://queenadrena.net/); Matilda the Hun from GLOW; Queen Kong (http://queenkong.com/).
- In file unit descriptions, use “E” as the container followed by a file unit number. New practice (as of January 2019) is to provide more information in the <unittitle>. This will include specifying that it is an "archived web site" and adding the actual URL of the site.
- In the added entries include: 655 _7 Web archives $$2 aat
- *Please note: we are no longer adding “electronic records” in the quantity or in the added entries when there are archived web sites.
- *Please note: we are no longer adding “electronic records” in the quantity or in the added entries when there are archived web sites.
- In the added entries include: 655 _7 Web archives $$2 aat
- The collection, series, and subseries dates should exclude the web site date(s), which should only appear in the item description (e.g., E.1. Web site, 2010-ongoing)
Note about processing addenda: We include links to archived web sites in the finding aid and HOLLIS record for both the original collection and any addenda.
If you have a web site in your finding aid, please mention it to Paula when you give her the XML document for review. She will need to create a "NET holdings" for the bib record. Processors shouldn't worry about this, but should let her know so she can plan to make the necessary record.