Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • Single volume file name example: 010010723_0001.jp2

  • Multi-volume file name example: 008105127_v0007_0001.jp2

  • Multi-volume and multi-issue file name example: 008105127_v0007_n003_0001.jp2
  • Manuscript collection file name example: morgan_601_705_volIV_0001.jp2

Note: Occasionally, a project may require a different file name pattern due to project partners' specific need.   For example, The Black Teacher Archive project need to name file as [Project_code]_[OCLC#]_[State_code]_[Year]_[Volume#]_[Issue#] (for example, bta_30786193_MA_1966_038_008.jp2).

...

MarcXML file names consist of two components: [HOLLIS_ID].xml2

[HOLLIS_ID]: Hollis system identifier (e.g. 011835322)

.xml: File format (XML) extension

Packaging tag file names (see next section – Use of the “bagit” file-packaging and –

...

interchange protocol).

Use of the “Bagit” file-packaging and -interchange protocol

The data files provided will be arranged and inventoried in accordance with the “Bagit” specification promoted by the Preservation Directorate of the Library of Congress.

To learn more about Bagit and to investigate the freely available tools available for checking the integrity of the Bagit-packaged data, we suggest you consult the following online resources:

A Library of Congress produced video designed to introduce the “Bagit” specification: http://www.youtube.com/watch?v=l3p3ao_JSfo

Opensource Bagit software tools: https://github.com/LibraryOfCongress

Wikipedia entry: https://en.wikipedia.org/wiki/BagIt

Organization of files and file system on portable media (i.e, portable hard drive)3

<root directory>
| bag-info.txt  
| bagit.txt
| manifest-md5.txt
| tagmanifest-md5.txt
|

...

|--

...

 data

...


    |
    

...

|--

...

 [BATCH ID] (see note 4)
        |-- 

...

[UNIQUE_ID]-

...

mets.xml(single volume METS file example:_007984492-

...

mets.xml)
        | 
        |-- 

...

[UNIQUE_ID]_[VOLUME_ID]-METS.xml(multivolume METS file example: 

...

000652831_v0002-

...

mets.xml)
        |
        |-- 

...

[UNIQUE_ID]-METS.xml (manuscript collection METS file example: 

...

morgan_601_705_volIV-

...

mets.xml)
        |
        |-- 

...

batch.xml(see note 5) (technical metadata file)
        |
        |-- [HOLLIS_ID].xml(see note 6) (MarcXML file, e.g., 000652831.xml)
        |
        

...

|--

...

 [UNIQUE_ID(see note 7)]/(manuscript collection example, e.g. morgan_601_705_volIV)
             |
             |-- [UNIQUE_ID]_[####].jp2
             |-- morgan_601_705_volIV_0001.jp2
             |-- morgan_601_705_volIV_0002.jp2
             |-- morgan_601_705_volIV_0003.jp2
             |-- morgan_601_705_volIV_0004.jp2
                 ...
             |-- morgan_601_705_volIV_0099.jp2
        

...

|--

...

 [HOLLIS_ID]/(single volume monograph example, e.g. 007984492)
             |
             |-- [HOLLIS_ID]_[####].jp2
             |-- 007984492_0001.jp2
             |-- 0079984492_0002.jp2
                 ...
             |-- 007984492_0099.jp2
        

...

|--

...

 [UNIQUE_ID]_[VOLUME_ID]/(see note 8) (multi-volume example, e.g. 000652831_v0002)
             |
             |-- [VOLUME_ID]_[####].jp2
             |-- 000652831_v0002_0001.jp2
             |-- 000652831_v0002_0002.jp2
             |-- 000652831_v0002_0003.jp2
               ...
             |-- 000652831_v0002_0099.jp2


Info
titleNOTES
1. In cases where the record identifier includes space, the spaces will be replaced by underscores.

...


2. MARCXML records are only available for items that have been cataloged in Harvard’s bibliographic database, HOLLIS.

...


3. If more than one disk is needed, a batch may span more than one disk; the corresponding metadata files for the batch will appear on each disk.
4. Batch level identifiers are assigned to groups of titles prepared and submitted together for scanning. These named “batches” will be maintained from scanning all the way through deposit to Harvard's Digital Repository Service and transfer of data to project partners beyond the Harvard libraries.
Inclusion of technical metadata is optional.
5. Inclusion of MARCXML files are optional.
6. The title's unique identifier is used as the directory name.
7. Individual volumes or fascicles will be labeled using a two- or three-digit sequence number (e.g., v001, v002, v099, v123).