Google Scanning Project -- Lamont DOC
- Emily Kelly
- Claire DeMarco
Overview
Workflow Summary
- Items are pulled from Lamont Documents (DOC) and shipped to Google Ann Arbor for scanning. HL Access Services is managing pulling and shipping of materials.
- LTS uploads metadata files to Google and places items in transit. Note:Â Google requires receipt of metadata prior to receiving physical materials for scanning.
- After scanning, these items will be discarded - these are part of the Google sheet fed workflow.
Detailed Workflow
Notes:
- Google requires item metadata in advance of the items' arrival for scanning.Â
Who is responsible? (color coded)
- Harvard Depository staff
- HL Access Services / Shipping partners
- LTS staff
- Google Ann Arbor staff
Procedures
- Access Services pulls items from DOC in batches, creating an Alma set and uploading item manifest to Google.
- LTS processes Alma set as follows:
- Combine multiple sets into a single batch.
- Extract metadata per Google spec and send to Google:
- In Alma, open the publishing profile "Google@item level" for editing: Alma > Resources > Publishing > Publishing profiles, then select the ellipsis button next to "Google@item level" profile and select "Edit".
- Update the "Google@item level" profile parameters as follows:
- Set name: Select the itemized set you just created.
- File name prefix: Update with current batch information, following the same convention, e.g. "harvard_bookcart_200-242".
- Save the profile.
- Run "Google@item level" profile by selecting "Run" from the profile's ellipsis button. Output files are deposited as *.tar.gz format on almadrop in /dropbox/alma/alma/google.
- SFTP to almadrop as the alma user to download the metadata output files to the local machine, then upload to the shared Google Drive. Google automatically ingests files, no notification needed.
Work with Allison P. to contact bbunnell@google.com for access to the Google Drive. Google contact for metadata is Kurt Groetsch (kgroetsch@google.com).
- Place items into transit awaiting post-processing reshelving:
- Log onto almadrop as ltsadmin user and run remotetransit.py script on barcode list. (See /wiki/spaces/LibraryTechServices/pages/59098446.) Items will appear to have been put into transit from the owning library's main circ desk (e.g. WID_CIRC). Any existing requests will be queued.
- Access Services physically ships batched materials to Google.
- Google retrieves metadata from shared Google Drive and scans batched materials.
Google Scanning Batch ReportsÂ
 | File | Modified |
---|---|---|
Microsoft Excel Spreadsheet DOC_PAL005_Physical items detailed report with core holdings, bib data - Flexible input.xlsx |
Feb 28, 2020 by Emily Kelly | |
Microsoft Excel Spreadsheet DOC_PAL004_Physical items detailed report with core holdings, bib data - Flexible input.xlsx |
Feb 28, 2020 by Emily Kelly | |
Microsoft Excel Spreadsheet DOC_PAL003_Physical items detailed report with core holdings, bib data - Flexible input.xlsx |
Feb 28, 2020 by Emily Kelly | |
Microsoft Excel Spreadsheet DOC_PAL002_Physical items detailed report with core holdings, bib data - Flexible input.xlsx |
Feb 28, 2020 by Emily Kelly | |
Microsoft Excel Spreadsheet DOC_PAL001_Physical items detailed report with core holdings, bib data - Flexible input.xlsx |
Feb 28, 2020 by Emily Kelly |