POD Aggregator (IvyPlus) data set and mapping (aka Data Lake)

Harvard contributes metadata to POD: https://pod.stanford.edu

Incremental files are sent daily. 

Included as of Nov 2022:

  • All active bibs with active MARC holdings (from Alma system), as MARCxml. 
    • This includes NET holdings, which contain links to locally digitized resources available to the public, as well as a small amount of non-Harvard public resources
    • This includes FIG holdings, which contain links to material digitized through the Google Book Search project. Some links may point to HathiTrust. 
  • NOT included - bibs without MARC holdings, i.e.: 
    • No licensed e-journals, or activated e-journal collections such as Directory of Open Access Journals, or free e-resources managed as portfolios in Alma
    • No licensed e-books
    • No activated database records such as encyclopedias

General notes on data:

  • The data is published from Alma with the Linked Data enrichment option, which adds the $$0 (subfield zero) with linked data URIs when a heading is linked to an authority record. For detailed information, see https://developers.exlibrisgroup.com/alma/integrations/linked_data.
  • Private data is excluded per this Alma norm rule
  • Each bib record is embedded with fields from linked holdings records
  • Non-Latin parallel fields will be stored in linked 880 fields. MARC spec: https://www.loc.gov/marc/bibliographic/bd880.html
  • All fields from the same holding will have the same $8 value. 
  • All items from the same holding will have the $8 value of the linked holding. 

Bibs are enriched with the following public holdings fields. For definitions of local fields visit Harvard defined MARC fields

  • 852
  • 007
  • 506, 538, 583
  • 541, 561, 562, 563
  • 843, 845
  • 856
  • 866, 867, 868
  • 966, 967, 968
  • 977

Bibs are enriched with item data. Each item is represented by a 876 field. Subfields are defined as follows: 

SubfieldContentNote
0 (zero)internal item identifier (PID)assigned by Alma
bpermanent library codecorresponds to 852 $b
cpermanent location code corresponds to 852 $c
l (lowercase L)current library codeif the item is in a temporary library/location, this value will be the temporary library, otherwise it will be the permanent library
mcurrent location codesee above, for location
7call number typeclassification scheme
ncall numberentire call number from 852 compressed into single subfield
dEnum A
eEnum B
fChron I
gChron J
3descriptionusually only present for serials / multi-volumes
qitem statusbinary value: in place (1) or not in place (0)
jprocess typee.g. missing etc. Code definitions Item Statuses, Process Types, and Required Fields
pbarcode
tcopy IDfrom item record, not holding 852
ymaterial typecode definitions: Item Material Types
hitem policyloan period / restrictions, encoded as 2 digit value. Definitions: Item Policies
xprovenanceused for ReCAP. See ReCAP Shared Collection - Alma coding for items and holdings
zstat note 2used for retention commitments (ReCAP / Hathi)
8holding IDidentifier of holding to which item is linked