POD Aggregator (IvyPlus) data set and mapping (aka Data Lake)

POD Aggregator (IvyPlus) data set and mapping (aka Data Lake)

Harvard contributes metadata to POD: https://pod.stanford.edu

Incremental files are sent daily. 

Included as of Nov 2022:

  • All active bibs with active MARC holdings (from Alma system), as MARCxml. 

    • This includes NET holdings, which contain links to locally digitized resources available to the public, as well as a small amount of non-Harvard public resources

    • This includes FIG holdings, which contain links to material digitized through the Google Book Search project. Some links may point to HathiTrust. 

  • NOT included - bibs without MARC holdings, i.e.: 

    • No licensed e-journals, or activated e-journal collections such as Directory of Open Access Journals, or free e-resources managed as portfolios in Alma

    • No licensed e-books

    • No activated database records such as encyclopedias

General notes on data:

  • The data is published from Alma with the Linked Data enrichment option, which adds the $$0 (subfield zero) with linked data URIs when a heading is linked to an authority record. For detailed information, see https://developers.exlibrisgroup.com/alma/integrations/linked_data.

  • Private data is excluded per this Alma norm rule

  • Each bib record is embedded with fields from linked holdings records

  • Non-Latin parallel fields will be stored in linked 880 fields. MARC spec: https://www.loc.gov/marc/bibliographic/bd880.html

  • All fields from the same holding will have the same $8 value. 

  • All items from the same holding will have the $8 value of the linked holding. 

Bibs are enriched with the following public holdings fields. For definitions of local fields visit Harvard defined MARC fields

  • 852

  • 007

  • 506, 538, 583

  • 541, 561, 562, 563

  • 843, 845

  • 856

  • 866, 867, 868

  • 966, 967, 968

  • 977

Bibs are enriched with item data. Each item is represented by a 876 field. Subfields are defined as follows: 

Subfield

Content

Note

Subfield

Content

Note

0 (zero)

internal item identifier (PID)

assigned by Alma

b

permanent library code

corresponds to 852 $b

c

permanent location code 

corresponds to 852 $c

l (lowercase L)

current library code

if the item is in a temporary library/location, this value will be the temporary library, otherwise it will be the permanent library

m

current location code

see above, for location

7

call number type

classification scheme

n

call number

entire call number from 852 compressed into single subfield

d

Enum A

 

e

Enum B

 

f

Chron I

 

g

Chron J

 

3

description

usually only present for serials / multi-volumes

q

item status

binary value: in place (1) or not in place (0)

j

process type

e.g. missing etc. Code definitions Item Statuses, Process Types, and Required Fields

p

barcode

 

t

copy ID

from item record, not holding 852

y

material type

code definitions: Item Material Types

h

item policy

loan period / restrictions, encoded as 2 digit value. Definitions: Item Policies

x

provenance

used for ReCAP. See ReCAP Shared Collection - Alma coding for items and holdings

z

stat note 2

used for retention commitments (ReCAP / Hathi)

8

holding ID

identifier of holding to which item is linked


We don't have a way to export this macro.