Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

[ TODO stub ]

Table of Contents

This project charter will document the project to convert JP2s to HTJ2K once the tools are developed and the pipeline is operational. There may be other data transformation pipelines that we will be added as they are needed for integration with new DRSis an umbrella page for project charters specific to data quality assessment and improvement in advance of, and continuing past, a DRS Futures release. Full charters are developed for more specific projects in sub-pages.

I. Problem/Value Statement

Problem Statement

We want to improve the quality of existing records in DRS in order to take full advantage of the software architecture and workflow improvements that DRS Futures promises to bring about.

Areas of particular interest are:

  1. Substandard media files, especially images, many of which are not fit for public delivery and are taxing our delivery infrastructure, causing disruption and slowness of services.
  2. Digital records lacking accurate and exhaustive metadata, which may affect relevance of search results in content management, digital preservation, and discovery services.
  3.  Relevance and appropriateness of the current DRS ontology and content models, which should be measured against today's Harvard stakeholders' needs.

Business Value

By creating delivery files better suited for high-traffic, highly visible public platforms, we would drastically improve the overall quality of our public-facing services.

II. Vision and Approach

Even though delivery services are out of scope for the DRS Futures project, the changes necessary to improve delivery files begins at the DRS level, by creating ad-hoc deliverable copies of archival and production masters, which are currently serving the dual purpose of archival and delivery. The solution to this is to establish post-deposit processes that automatically generate an ad-hoc delivery file for each archival file deposited, thus separating the two purposes.

Metadata completeness and accuracy may be assessed by establishing a set of criteria that can be computationally evaluated, to rank and classify existing digital records using automated tools. Remediation actions may be decided once an exhaustive report is available from this assessment step.

The current DRS content model and ontology

Our vision aligns with the following Harvard Library multi-year goals and objectives (MYGOs):

  • MYGO #8: Focus technical services on effective workflows and metadata that matter the most
    • By offering a centralized service for efficient, high-quality, and scalable processing of highly visible (public) files that can be used across campus we would encourage discontinuing one-off solutions that individual departments have been developing for lack of a better alternative, incurring in additional maintenance costs and inconsistent, often substandard output quality.
  • MYGO #10: Focus on space as a service, considering the most cost-effective approaches to user interests, collections security and preservation, and staff needs in HL and HCL facilities
    • [ To Do ]
  • MYGO #14: Minimize the environmental impact of collections, services, and spaces
    • As described in MYGO #10, the goal of this project is to save computing resources, thus reducing the environmental footprint of our services.

Our vision aligns with the following HUIT objectives and key results (OKRs): 

  • Develop a plan for automation in each service area for critical, frequently used or heavily manual workflows
    • This project seeks to optimize one of the most critical and frequently used workflows in the content production chain.

III. In Scope/Out of Scope

In Scope

  • [ To Do ]


Out of Scope

  • [ To Do ]