Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 15 Next »

Background

ETDs from ProQuest can have a variety of files and file types.  Each file will need a separate DRS Object and be assinged a 'Role' in DRS.

For example, a ProQuest ETD could have the following files

  • Thesis in PDF format
  • Mets.xml file
  • License files in PDF (or other) formats
  • Supplementary files, such as thesis appendices, data sets, videos, etc,. in a variety of formats

Preparing a batch

  • Get the file information for all files from the mets.xml FileSec
    • Filename
    • Mime-type
    • Use category - CONTENT or LICENSE
    • AMDID - amd_primary, amd_supplemental, amd_license 
  • Create Object OSNs based on ProQuest ID, school code and Role
  • Create File OSNs based on ProQuest ID and Role
  • Create mapping.txt file to associate files with appropriate Object OSN and File OSN
  • Locate the MARCXML file for the ETD and generate a MODS file
  • Get DASH URN from MARCXML if present
  • Get Alma MMSID by using query with ProQuestETD ID


Create an object for each file

  • Content Model based on file Mime-type
  • Role based on Use category and AMDID
    • Primary PDF thesis gets ROLE=THESIS
    • Other files in CONTENT group get ROLE=THESIS_SUPPLEMENT
    • Files in ‘LICENSE’ group get ROLE=LICENSE
    • Mets.xml file gets ROLE=DOCUMENTATION
  • File with ROLE=THESIS gets MODS descriptive metadata


Add relationships and Harvard Metadata inks to THESIS Object

  • Has_supplement
  • Has_licence
  • Has _documentation
  • HOLLIS link based on MMSID
  • DASH link based on DASH ID


Deposit all but THESIS Object

Wait for load report

Add Object URNs for related objects to THESIS object batch

Deposit THESIS Object


Files in ETD Submission directory


FileSec of mets.xml
  <fileSec>
    <fileGrp ID="etdadmin-mets-fgrp-1" USE="CONTENT">
      <file GROUPID="etdadmin-mets-file-group" ID="etdadmin-mets-file-2132021" MIMETYPE="application/pdf" ADMID="amd_primary" SEQ="1">
        <FLocat LOCTYPE="URL" xlink:href="thesis_pdfa_allisonhyatt.pdf"/>
      </file>
      <file GROUPID="etdadmin-mets-file-group" ID="etdadmin-mets-file-2132069" MIMETYPE="application/pdf" ADMID="amd_supplemental_1" SEQ="1">
        <FLocat LOCTYPE="URL" xlink:href="appendices_pdfa_allisonhyatt.pdf"/>
      </file>
    </fileGrp>
    <fileGrp ID="etdadmin-mets-fgrp-2" USE="LICENSE">
      <file GROUPID="etdadmin-mets-file-group" ID="etdadmin-mets-file-2046147" MIMETYPE="application/pdf" ADMID="amd_license_2046147">
        <FLocat LOCTYPE="URL" xlink:href="setup_2E592954-F85C-11EA-ABB1-E61AE629DA94.pdf"/>
      </file>
    </fileGrp>
  </fileSec>


Variables from fileSec

Variable

Source

Example

FilenameFLocat xlink:hrefthesis_pdfa_allisonhyatt.pdf
Mime-typefile MIMETYPEapplication/pdf
USE typefileGrp USECONTENT
ADMIDfile ADMIDamd_primary


Examples of values assigned to a filename

FilenameMime-typeAMDIDUSE Type
thesis_pdfa_allisonhyatt.pdfapplication/pdfamd_primaryCONTENT
appendices_pdfa_allisonhyatt.pdfapplication/pdfamd_supplemental_1CONTENT
setup_2E592954-F85C-11EA-ABB1-E61AE629DA94.pdfapplication/pdfamd_license_2046147LICENSE





ROLE assignments

USE TypeAMDIDROLE

CONTENT

amd_primaryTHESIS
CONTENTamd_supplemental_[/d]THESIS_SUPPLEMENT
LICENSEamd_license_[/d+]LICENSE
N/Amets.xmlDOCUMENTATION
  • No labels