Background
Packages from ProQuest
ETDs from ProQuest can have a variety of files and file types. Each file with a currently supported content model will need a separate DRS Object and be assigned a 'Role' in DRS. Supplementary files that do not have a supported content model, like CAD files, will all be put in one Opaque Object.
For example, a ProQuest ETD could have the following files
- Thesis in PDF format
- mets.xml file
- License files in PDF (or other) formats
- Supplementary files, such as thesis appendices, data sets, videos, etc,. in a variety of formats
DRS Roles and Relationships
In DRS objects and files are assigned roles to help categorize material as well as facilitate relationships between objects. (NEED link to DRS Guide section)
File to DRS Role mapping
File from ETD | DRS Object Role | Relationship from THESIS object |
---|---|---|
Thesis PDF | THESIS | - |
Supplementary files | THESIS_SUPPLEMENT | HAS_SUPPLEMENT |
License files | LICENSE | (Rights Block) |
mets.xml | DOCUMENTATION | HAS_DOCUMENTATION |
DRS Projects, Batches, Objects, and Content Models in BatchBuilder
BatchBuilder is the main tool used to organize content for deposit into DRS.
Preparing a batch
- Get the file information for all files from the mets.xml FileSec
- Filename
- Mime-type
- Use category - CONTENT or LICENSE
- AMDID - amd_primary, amd_supplemental, amd_license
- Create Object OSNs based on ProQuest ID, school code and Role
- Create File OSNs based on ProQuest ID and Role
- Create mapping.txt file to associate files with appropriate Object OSN and File OSN
- Locate the MARCXML file for the ETD and generate a MODS file
- Get DASH URN from MARCXML if present
- Get Alma MMSID by using query with ProQuestETD ID
Create an object for each file
- Content Model based on file Mime-type
- Role based on Use category and AMDID
- Primary PDF thesis gets ROLE=THESIS
- Other files in CONTENT group get ROLE=THESIS_SUPPLEMENT
- Files in ‘LICENSE’ group get ROLE=LICENSE
- Mets.xml file gets ROLE=DOCUMENTATION
- File with ROLE=THESIS gets MODS descriptive metadata
Add relationships and Harvard Metadata inks to THESIS Object
- Has_supplement
- Has_licence
- Has _documentation
- HOLLIS link based on MMSID
- DASH link based on DASH ID
Deposit all but THESIS Object
Wait for load report
Add Object URNs for related objects to THESIS object batch
Deposit THESIS Object
Files in ETD Submission directory
<fileSec> <fileGrp ID="etdadmin-mets-fgrp-1" USE="CONTENT"> <file GROUPID="etdadmin-mets-file-group" ID="etdadmin-mets-file-2132021" MIMETYPE="application/pdf" ADMID="amd_primary" SEQ="1"> <FLocat LOCTYPE="URL" xlink:href="thesis_pdfa_allisonhyatt.pdf"/> </file> <file GROUPID="etdadmin-mets-file-group" ID="etdadmin-mets-file-2132069" MIMETYPE="application/pdf" ADMID="amd_supplemental_1" SEQ="1"> <FLocat LOCTYPE="URL" xlink:href="appendices_pdfa_allisonhyatt.pdf"/> </file> </fileGrp> <fileGrp ID="etdadmin-mets-fgrp-2" USE="LICENSE"> <file GROUPID="etdadmin-mets-file-group" ID="etdadmin-mets-file-2046147" MIMETYPE="application/pdf" ADMID="amd_license_2046147"> <FLocat LOCTYPE="URL" xlink:href="setup_2E592954-F85C-11EA-ABB1-E61AE629DA94.pdf"/> </file> </fileGrp> </fileSec>
Variables from fileSec
Variable | Source | Example |
---|---|---|
Filename | FLocat xlink:href | thesis_pdfa_allisonhyatt.pdf |
Mime-type | file MIMETYPE | application/pdf |
USE type | fileGrp USE | CONTENT |
ADMID | file ADMID | amd_primary |
Examples of values assigned to a filename
Filename | Mime-type | AMDID | USE Type |
---|---|---|---|
thesis_pdfa_allisonhyatt.pdf | application/pdf | amd_primary | CONTENT |
appendices_pdfa_allisonhyatt.pdf | application/pdf | amd_supplemental_1 | CONTENT |
setup_2E592954-F85C-11EA-ABB1-E61AE629DA94.pdf | application/pdf | amd_license_2046147 | LICENSE |
ROLE assignments
USE Type | AMDID | ROLE |
---|---|---|
CONTENT | amd_primary | THESIS |
CONTENT | amd_supplemental_[/d] | THESIS_SUPPLEMENT |
LICENSE | amd_license_[/d+] | LICENSE |
N/A | mets.xml | DOCUMENTATION |