abstract
- This report documents EOSC EDEN ("Enhancing Digital preservation strategies at European and National level") Milestone 1.1 "Identification of Core Preservation Processes (CPPs) for WP2" by EOSC EDEN WP1 T1.2. A Core Preservation Process (CPP) is a specific action that every Trustworthy Digital Archive should undertake adequately - either directly or through its associated parties or services, in order to fulfill its digital preservation missions as evidenced in its preservation policy. The following assertions define the scope of CPPs: They focus on the operational activities required by digital preservation (understood as covering short- to long-term preservation) and do not cover strategic/managerial digital preservation activities nor the whole list of activities of a generic information management system, including secure IT infrastructures. Though digital preservation requires a deep knowledge of the properties, structure and possible uses of digital objects, which can be domain or discipline specific, the scope of CPP is limited to generic processes that take place in general when performing digital preservation regardless of specific content, domain or discipline. The list of CPPs and their description are established by a group of digital preservation practitioners. In addition, consensus within the digital preservation community about this list will be evidenced by references to prominent maturity models, self-assessment, and certification frameworks used by said community. CPPs are described as a sequence of implementable steps, either by humans or by automation. This publication contains the following parts: M1.1 Report on Identification of Core Preservation Processes: Design, Guidance and Summary of Findings Annex Part 1 - Template and Glossary: CPP Template Glossary Annex Part 2 - CPP Descriptions: CPP-001 Checksum Generation and Recording CPP-002 Checksum Validation CPP-003 Integrity Checking CPP-004 Data Corruption Management CPP-005 Identifier Management CPP-006 AIP Batch Export CPP-007 Virus Scanning CPP-008 File Format Identification CPP-009 Metadata Extraction CPP-010 File Format Validation CPP-011 Replication CPP-012 Risk Mitigation CPP-013 Object Management Reporting CPP-014 File Migration CPP-015 Emulation and Rendering Tools CPP-016 Metadata Ingest and Management CPP-017 Disposal CPP-018 Community Watch CPP-019 Data Quality Assessment CPP-020 Rights Management CPP-021 AIP Versioning CPP-022 Significant Properties Definition CPP-023 Risk Definition and Extraction CPP-024 Enabling Discovery CPP-025 Enabling Access CPP-026 File Normalisation CPP-027 File Repair CPP-028 Creation of Derivatives CPP-029 Ingest CPP-030 Refreshment In addition to the dataset here, a visualisation tool for CPP relationships is available at https://cpp.fd-dev.csc.fi (Code available at: https://github.com/EOSC-EDEN/wp1-cpp-visualization ).