... | ... | @@ -72,7 +72,7 @@ if you faced an error message like this |
|
|
Update the attributes xmlns and schemaLocation of `<PcGts>` to supported version as descirbed above.
|
|
|
By defaults the segmentation for the selected images, both regions and lines, will be deleted. You can disable this behavior by unchecking 'Override existing segmentation.', in which case the system will try to match the lines and regions by their `ID` attribute. The old content for matching lines is then stored in its history and new lines/regions are created when no matching existing element are found.
|
|
|
TextRegion tag have a liste of coordinates as type `x1,y1 x2,y2...xn,yn` it describe a polygon. Baseline tag is optional in PageXml.
|
|
|
the content ca be stored in `Textline` or each `Word` is separated for example
|
|
|
the content ca be stored in `Textline` for example
|
|
|
|
|
|
|
|
|
<TextLine id="r2l1" custom="readingOrder {index:0;}">
|
... | ... | @@ -83,7 +83,7 @@ the content ca be stored in `Textline` or each `Word` is separated for example |
|
|
</TextEquiv>
|
|
|
</TextLine>
|
|
|
|
|
|
|
|
|
or each `Word` is separated with its `Coords` and content inside a `Textline`
|
|
|
<TextLine id="l1">
|
|
|
<Coords points="1550,422 1555,422"/>
|
|
|
<Word id="w122" language="Hebrew" primaryScript="Hebr - Hebrew"
|
... | ... | |