Workflow-EHRI
📝 last update: 2023-09-03
💡 Presentation of the Project
The European Holocaust Research Infrastructure (EHRI) is a transnational organization with partners all across Europe, Israel, and the United States of America. They promote collaboration on Holocaust research and easy access to scattered sources. To this end, they created the EHRI Online Editions, which are collections of archival documents on the Holocaust, gathered around a more specific theme like testimonies on the persecution of Jews or diplomatic reports for instance.
🧱 Structure of the Repository
-
- ODD (
.xml
–.html
–.pdf
) - RelaxNG Schema
- Overview Table
- Python scripts
- ODD (
-
Presentation slides from the EHRI meeting (4 July 2023)
📋 Content
This repository holds the workflow for the semi-automated processing of EHRI's digital editions. The repository is composed of three folders, one for each issue.
✅ Issue no. 1 — Writing an exhaustive ODD
The "ODD" folder contains all the files related to the creation of the exhaustive ODD for the EHRI digital editions.
- The overview table summarizes the elements and attributes used by the editors for the four existing digital editions (BeGrenzte Flucht, Early Holocaust Testimony, Diplomatic Reports, and Nisko). This is the result of the preparatory work for writing the actual ODD.
- The Python scripts (extraction and research) and their results, used as a basis for the overview table.
- The updated version of the ODD (IN PROGRESS), available in different formats (XML, HTML, and PDF).
- The RelaxNG schema that should be applied to the EHRI files to ensure conformance to the ODD.
🔜 Issue no. 2 — Creating a semi-automated processing workflow
The "Encoding" folder will contain:
- The encoding templates for metadata and index entries.
- General guidelines for the semantic encoding of the files, and a tutorial for adding a named entity in one of the indices.
- The Python script for an automated encoding of the files'
<teiHeader>
(metadata), with an exhaustive step-by-step guide to run the script.
🔜 Issue no. 3 — Setting up a dedicated TEI Publisher application
The "TEI-Publisher" folder will contain:
- Specifications for the application
- Visual mock-ups for the application
- TEI Publisher prototype application
📑 Communications
- "Writing an ODD for the EHRI Online Editions -- Preparatory Work." Digital Intellectuals, 5 June 2023, https://digitalintellectuals.hypotheses.org/4765.
- "Writing an ODD for the EHRI Online Editions -- Specifications and Documentation." Digital Intellectuals, 28 July 2023, https://digitalintellectuals.hypotheses.org/5034.