POD Aggregator (IvyPlus) data set and mapping (aka Data Lake)
Harvard contributes metadata to POD: https://pod.stanford.edu
Incremental files are sent daily.
Included as of Nov 2022:
All active bibs with active MARC holdings (from Alma system), as MARCxml.
This includes NET holdings, which contain links to locally digitized resources available to the public, as well as a small amount of non-Harvard public resources
This includes FIG holdings, which contain links to material digitized through the Google Book Search project. Some links may point to HathiTrust.
NOT included - bibs without MARC holdings, i.e.:
No licensed e-journals, or activated e-journal collections such as Directory of Open Access Journals, or free e-resources managed as portfolios in Alma
No licensed e-books
No activated database records such as encyclopedias
General notes on data:
The data is published from Alma with the Linked Data enrichment option, which adds the $$0 (subfield zero) with linked data URIs when a heading is linked to an authority record. For detailed information, see https://developers.exlibrisgroup.com/alma/integrations/linked_data.
Private data is excluded per this Alma norm rule
Each bib record is embedded with fields from linked holdings records
Non-Latin parallel fields will be stored in linked 880 fields. MARC spec: https://www.loc.gov/marc/bibliographic/bd880.html
All fields from the same holding will have the same $8 value.
All items from the same holding will have the $8 value of the linked holding.
Bibs are enriched with the following public holdings fields. For definitions of local fields visit Harvard defined MARC fields
852
007
506, 538, 583
541, 561, 562, 563
843, 845
856
866, 867, 868
966, 967, 968
977
Bibs are enriched with item data. Each item is represented by a 876 field. Subfields are defined as follows:
Subfield | Content | Note |
|---|---|---|
0 (zero) | internal item identifier (PID) | assigned by Alma |
b | permanent library code | corresponds to 852 $b |
c | permanent location code | corresponds to 852 $c |
l (lowercase L) | current library code | if the item is in a temporary library/location, this value will be the temporary library, otherwise it will be the permanent library |
m | current location code | see above, for location |
7 | call number type | classification scheme |
n | call number | entire call number from 852 compressed into single subfield |
d | Enum A |
|
e | Enum B |
|
f | Chron I |
|
g | Chron J |
|
3 | description | usually only present for serials / multi-volumes |
q | item status | binary value: in place (1) or not in place (0) |
j | process type | e.g. missing etc. Code definitions Item Statuses, Process Types, and Required Fields |
p | barcode |
|
t | copy ID | from item record, not holding 852 |
y | material type | code definitions: Item Material Types |
h | item policy | loan period / restrictions, encoded as 2 digit value. Definitions: Item Policies |
x | provenance | used for ReCAP. See ReCAP Shared Collection - Alma coding for items and holdings |
z | stat note 2 | used for retention commitments (ReCAP / Hathi) |
8 | holding ID | identifier of holding to which item is linked |