/
Google Scanning Report
Google Scanning Report
Data provided by Google that records the date the item was received, whether or not it was considered scannable, physical conditions preventing scanning, scanning metadata such as OCR quality, and other information involving full text availability. See also Hathi Report documentation.
Field name | Definition | Data type |
---|---|---|
Check-in Date | Date on which book was checked into a Google scanning center. | Date |
Digitization Method | Method by which the volume was digitized. In data: SHEETFED, NON_DESTRUCTIVE, UNKNOWN_SCAN_TITLE, DIGIFEED, null. | Text |
Material Conditions | Controlled list of terms for conditions noted about the book at check-in. Condition may or may not have prevented scanning. Most items are 'None'. Repeatable. ○ Foldouts in book ○ Oversized Book ○ Pages Stuck Together ○ Tight Gutter ○ Defaced ○ Overchop ○ Other ○ Opted out ○ Uncut pages ○ Publication date ○ Poor condition ○ No Metadata ○ Duplicate Book ○ Unsupported Format ○ Partner Request ○ Opted out - Special | Text |
OCR Analysis Score | Indicates text quality based on garbage text detector. 0 is bad, 100 is good. | Number |
OCR GTD Score | Indicates text quality based on OCR engine confidence. 0 is bad, 100 is good. | Number |
Opted Out | Indicates if a volume was opted out of scanning, either by the copyright holder or by Google policy, after it had been scanned. In these cases, volumes become unavailable for conversion and download from GRIN. In data: true, false. | Text |
Overall Error Percent | Percentage of a book determined by Google's error-detection algorithms to contain errors of all types. | Number |
Processed Date | Most recent date the volume was processed. Processing consists of cleaning, cropping, and digitally "flattening" pages, including optical character recognition. Volumes are processed shortly after they are scanned, and reprocessed infrequently after that. | Date |
Scanned Barcode | Key identifier supplied by the library and used to track both the physical and digital copies of the volume, as well as to retrieve the book on books.google.com (part of the URL structure) | Text |
Scanned Date | Most recent date the volume was scanned. | Date |
Scanning Status | Most recently reported status of the volume. In data: IN_PROCESS, CONVERTED, PREVIOUSLY_DOWNLOADED, CHECKED_IN, NOT_AVAILABLE_FOR_DOWNLOAD, NEW, null. | Text |
Viewability | Values in data: -, VIEW_FULL, VIEW_SNIPPET, VIEW_METADATA, VIEW_NONE. Dash is in field for items where State is Not Available for Download (where scannable is true) or Checked In where scannable is false. Note that Hathi data, not Google is the source of full text flags in HART. | Text |
Viewability Updated Date | Date on which viewability status of a volume changed in Google Book Search | Date |
, multiple selections available,