Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents

...

For monograph and manuscript materials, we group files into items and items into a batch3.  For example, the following batch contains three items.  Two items contain only JPEG2000 image files, and one item contains JPEG2000, OCR plain text, and OCR ALTO XML files.

    [BATCH ID] (see note 4example: Batch02 or Box03)
        |-- [UNIQUE_ID]-mets.xml (single volume METS file example: 007984492-mets.xml)
        |-- [UNIQUE_ID]_[VOLUME_ID]-mets.xml (multivolume METS file example: 000652831_v0002-mets.xml)
        |-- [UNIQUE_ID]-mets.xml (manuscript collection METS file example: morgan_601_705_volIV-mets.xml)
        |-- [HOLLIS_ID].xml (MARCXML or MODS xml file, e.g., 000652831.xml)
| |-- [UNIQUE_ID]/ (manuscript collection example, e.g. morgan_601_705_volIV) | |-- [UNIQUE_ID]_[####].jp2 |-- morgan_601_705_volIV_0001.jp2 |-- morgan_601_705_volIV_0002.jp2 |-- morgan_601_705_volIV_0003.jp2 |-- morgan_601_705_volIV_0004.jp2 ... |-- morgan_601_705_volIV_0099.jp2
| |-- [HOLLIS_ID]/(single volume monograph example, e.g. 007984492) | |-- [HOLLIS_ID]_[####].jp2
|-- [HOLLIS_ID]_[####].txt
|-- [HOLLIS_ID]_[####].xml |-- 007984492_0001.jp2
|-- 007984492_0001.txt
|-- 007984492_0001.xml |-- 0079984492_0002.jp2 ... |-- 007984492_0099.jp2
|-- 007984492_0099.txt
|-- 007984492_0099.xml
| |-- [UNIQUE_ID]_[VOLUME_ID]/ (multi-volume example, e.g. 000652831_v0002) | |-- [VOLUME_ID]_[v####]_[####].jp2 |-- 000652831_v0002_0001.jp2 |-- 000652831_v0002_0002.jp2 |-- 000652831_v0002_0003.jp2 ... |-- 000652831_v0002_0099.jp2

Photograph and other art objects


For photographs and other art objects, we group files into batches.  For example, the following batch contains Here is another example from Black Teacher Archive Project showing a batch with project specific file name patterns. 

   [BATCH ID] (example: GWU_02)
        |-- [Project_code]_[OCLC#]_[State_code]_[Year]_[Volume#]_[Issue#]-mets.xml (issue level METS file, example: bta_45355957_VA_1957_038_005_mets.xml)
|-- [Project_code]_[OCLC#]_[State_code]_[Year]_[Volume#]_[Issue#]-mets.xml (issue level METS file, example: bta_45355957_VA_1957_038_006_mets.xml) |-- [Project_code]_[OCLC#].xml (MARCXML or MODS xml file for the series, example: bta_45355957.xml)
| |-- [Project_code]_[OCLC#]_[State_code]_[Year]_[Volume#]_[Issue#]/ (issue directory, example: bta_45355957_VA_1957_038_005) | |-- [Project_code]_[OCLC#]_[State_code]_[Year]_[Volume#]_[Issue#]_[####].jp2
|-- [Project_code]_[OCLC#]_[State_code]_[Year]_[Volume#]_[Issue#]_[####].txt
|-- [Project_code]_[OCLC#]_[State_code]_[Year]_[Volume#]_[Issue#]_[####].xml |-- bta_45355957_VA_1957_038_005_0001.jp2
|-- bta_45355957_VA_1957_038_005_0001.txt
|-- bta_45355957_VA_1957_038_005_0001.xml |-- bta_45355957_VA_1957_038_005_0002.jp2
|-- bta_45355957_VA_1957_038_005_0002.txt
|-- bta_45355957_VA_1957_038_005_0002.xml ... |-- bta_45355957_VA_1957_038_005_0432.jp2
  |-- bta_45355957_VA_1957_038_005_0432.txt
|-- bta_45355957_VA_1957_038_005_0432.xml
| |-- [Project_code]_[OCLC#]_[State_code]_[Year]_[Volume#]_[Issue#]/ (issue directory, example: bta_45355957_VA_1957_038_006) | |-- [Project_code]_[OCLC#]_[State_code]_[Year]_[Volume#]_[Issue#]_[####].jp2
|-- [Project_code]_[OCLC#]_[State_code]_[Year]_[Volume#]_[Issue#]_[####].txt
|-- [Project_code]_[OCLC#]_[State_code]_[Year]_[Volume#]_[Issue#]_[####].xml |-- bta_45355957_VA_1957_038_006_0001.jp2
|-- bta_45355957_VA_1957_038_006_0001.txt
|-- bta_45355957_VA_1957_038_006_0001.xml
|-- bta_45355957_VA_1957_038_006_0002.jp2
|-- bta_45355957_VA_1957_038_006_0002.txt
|-- bta_45355957_VA_1957_038_006_0002.xml
...
|-- bta_45355957_VA_1957_038_006_0030.jp2
  |-- bta_45355957_VA_1957_038_006_0030.txt
|-- bta_45355957_VA_1957_038_006_0030.xml


Photograph and other art objects

For photographs and other art objects, we group files into batches.  For example, the following batch contains a set of JPEG2000 files.

    [BATCH ID]

...

 (example: Batch02 or Album03)
    |-- ss_123458.jp2
    |-- ss_458790.jp2
       ...
   |-- ss_987692.jp2

 

NOTE: Each delivery may contain several batches.

...

  • Hard disk, flash drive
    • The repositories can borrow the media from Imaging Services or pay for them
    • Recommended for large sets of data
  • Google shared drive
    • A google account from the data recipient needs to be provided to Imaging Services
  • MS shared directory
    • The data recipient's email address needs to be provided to Imaging Services
    • Suitable for small sets of data
  • Secure file transfer (https://filetransfer.harvard.edu)
    • The data recipient's email address needs to be provided to Imaging Services.
    • The data recipient outside Harvard University needs to set up a guest account.
    • Suitable for small sets of data which need encryption during file transfer.


  1. Anchor
    note1
    note1
    In cases where the record identifier includes space, the spaces will be replaced by underscores.

  2. Anchor
    note2
    note2
    MARCXML records are only available for items that have been cataloged in Harvard’s bibliographic database, HOLLIS.

  3. Anchor
    note3
    note3
    Batch level identifiers are assigned to groups of titles prepared and submitted together for scanning. These named “batches” will be maintained from scanning all the way through deposit to Harvard's Digital Repository Service and transfer of data to project partners beyond the Harvard libraries. Inclusion of technical metadata is optional.