Mentions légales du service

Skip to content
Snippets Groups Projects

Workflow-EHRI

📝 last update: 2023-09-03

💡 Presentation of the Project

The European Holocaust Research Infrastructure (EHRI) is a transnational organization with partners all across Europe, Israel, and the United States of America. They promote collaboration on Holocaust research and easy access to scattered sources. To this end, they created the EHRI Online Editions, which are collections of archival documents on the Holocaust, gathered around a more specific theme like testimonies on the persecution of Jews or diplomatic reports for instance.

🧱 Structure of the Repository

📋 Content

This repository holds the workflow for the semi-automated processing of EHRI's digital editions. The repository is composed of three folders, one for each issue.

✅ Issue no. 1 — Writing an exhaustive ODD

The "ODD" folder contains all the files related to the creation of the exhaustive ODD for the EHRI digital editions.

🔜 Issue no. 2 — Creating a semi-automated processing workflow

The "Encoding" folder will contain:

  • The encoding templates for metadata and index entries.
  • General guidelines for the semantic encoding of the files, and a tutorial for adding a named entity in one of the indices.
  • The Python script for an automated encoding of the files' <teiHeader> (metadata), with an exhaustive step-by-step guide to run the script.

🔜 Issue no. 3 — Setting up a dedicated TEI Publisher application

The "TEI-Publisher" folder will contain:

📑 Communications