|
|
# Imports
|
|
|
|
|
|
In the 'Images' tab of a Document you can find an 'Import' button which allows you to feed data from different sources to eScriptorium.
|
|
|
Note that you can **NOT** import both the images and the corresponding transcription at the same time, you need to do it in two steps.
|
|
|
Note that you can **NOT** yet import both the images and the corresponding transcription at the same time, you need to do it in two steps.
|
|
|
|
|
|
## IIIF
|
|
|
Input a valid iiif manifesto uri to import all its images in full resolution along with metadatas.
|
|
|
|
|
|
## Alto XML
|
|
|
Upload a valid [ALTO XML](https://en.wikipedia.org/wiki/ALTO_(XML)) file for segmentation and transcriptions.
|
|
|
The file is strictly validated against ALTO v4 (http://www.loc.gov/standards/alto/ns-v4#), if it's not valid an error message will (perhaps) help you fix the issue.
|
|
|
Each `<Page>` in the file will correspond to a selected card in the same order.
|
|
|
The file is strictly validated against [a future version of ALTO v4](https://gitlab.inria.fr/scripta/escriptorium/blob/develop/app/escriptorium/static/alto-4-1-baselines.xsd), if it's not valid an error message will (hopefully) help you fix the issue.
|
|
|
|
|
|
The 'name' field is the name of the transcription in which the text content will be stored (you can select it above the transcription panel). It is possible to import content from different files in the same transcription this way.
|
|
|
By defaults the segmentation for the selected images, both regions and lines, will be deleted. You can disable this behavior by unchecking 'Override existing segmentation.', in which case the system will try to match the lines and regions by their `ID` attribute. The old content for matching lines is then stored in its history and new lines/regions are created when no matching existing element are found.
|
|
|
|
|
|
Here is the most basic example, 2 pages with 2 lines each.
|
|
|
Other tags (description, glyphs) are ignored for now.
|
|
|
A basic example:
|
|
|
```xml
|
|
|
<?xml version="1.0" encoding="UTF-8"?>
|
|
|
<alto xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
|
|
|
xmlns="http://www.loc.gov/standards/alto/ns-v4#"
|
|
|
xsi:schemaLocation="http://www.loc.gov/standards/alto/ns-v4# http://www.loc.gov/standards/alto/v4/alto-4-0.xsd">
|
|
|
<Description>
|
|
|
<MeasurementUnit>pixel</MeasurementUnit>
|
|
|
<sourceImageInformation>
|
|
|
<fileName>my_image.png</fileName>
|
|
|
</sourceImageInformation>
|
|
|
</Description>
|
|
|
<Layout>
|
|
|
<Page WIDTH="850" HEIGHT="1083" PHYSICAL_IMG_NR="0" ID="page_0">
|
|
|
<PrintSpace HPOS="0" VPOS="0" WIDTH="850" HEIGHT="1083">
|
|
|
<TextBlock HPOS="0" VPOS="0" ID="textblock_0" WIDTH="850" HEIGHT="1083">
|
|
|
<TextLine ID="line_0" HPOS="160" VPOS="771" WIDTH="0" HEIGHT="28">
|
|
|
<String ID="segment_0" CONTENT="This is an example" HPOS="160" VPOS="771" WIDTH="0" HEIGHT="28" WC="0.9995"></String>
|
|
|
<TextLine ID="line_0" HPOS="160" VPOS="771" WIDTH="25" HEIGHT="28">
|
|
|
<Shape>
|
|
|
<Polygon POINTS="160,771,185,799"/>
|
|
|
</Shape>
|
|
|
<String ID="segment_0" CONTENT="This is an example" HPOS="160" VPOS="771" WIDTH="25" HEIGHT="28" WC="0.9995"></String>
|
|
|
</TextLine>
|
|
|
</TextBlock>
|
|
|
</PrintSpace>
|
|
|
</Page>
|
|
|
<Page WIDTH="850" HEIGHT="1083" PHYSICAL_IMG_NR="0" ID="page_1">
|
|
|
<PrintSpace HPOS="0" VPOS="0" WIDTH="850" HEIGHT="1083">
|
|
|
<TextBlock HPOS="0" VPOS="0" ID="textblock_1" WIDTH="850" HEIGHT="1083">
|
|
|
<TextLine ID="line_1" HPOS="160" VPOS="771" WIDTH="0" HEIGHT="28">
|
|
|
<String ID="segment_1" CONTENT="This is also an example" HPOS="160" VPOS="771" WIDTH="0" HEIGHT="28" WC="0.9995"></String>
|
|
|
</TextLine>
|
|
|
</TextBlock>
|
|
|
</PrintSpace>
|
|
|
</Page>
|
|
|
</Layout>
|
|
|
</alto>`
|
|
|
```
|
... | ... | |