... | ... | @@ -83,22 +83,23 @@ the content ca be stored in `Textline` or each `Word` is separated for example |
|
|
</TextLine>
|
|
|
|
|
|
|
|
|
<TextLine id="l1">
|
|
|
<Coords points="1550,422 1555,422"/>
|
|
|
<Word id="w122" language="Hebrew" primaryScript="Hebr - Hebrew"
|
|
|
readingDirection="right-to-left">
|
|
|
<Coords points="926,424 926,426"/>
|
|
|
<TextEquiv>
|
|
|
<Unicode>ע"י</Unicode>
|
|
|
</TextEquiv></Word>
|
|
|
<Word id="w45" language="Hebrew" primaryScript="Hebr - Hebrew"
|
|
|
readingDirection="right-to-left">
|
|
|
<Coords points="531,464 687,464 "/>
|
|
|
<TextEquiv>
|
|
|
<Unicode>הוט</Unicode>
|
|
|
</TextEquiv>
|
|
|
</Word>
|
|
|
<TextLine>
|
|
|
<TextLine id="l1">
|
|
|
<Coords points="1550,422 1555,422"/>
|
|
|
<Word id="w122" language="Hebrew" primaryScript="Hebr - Hebrew"
|
|
|
readingDirection="right-to-left">
|
|
|
<Coords points="926,424 926,426"/>
|
|
|
<TextEquiv>
|
|
|
<Unicode>ע"י</Unicode>
|
|
|
</TextEquiv>
|
|
|
</Word>
|
|
|
<Word id="w45" language="Hebrew" primaryScript="Hebr - Hebrew"
|
|
|
readingDirection="right-to-left">
|
|
|
<Coords points="531,464 687,464 "/>
|
|
|
<TextEquiv>
|
|
|
<Unicode>הוט</Unicode>
|
|
|
</TextEquiv>
|
|
|
</Word>
|
|
|
<TextLine>
|
|
|
|
|
|
example of full PageXML file :
|
|
|
```xml
|
... | ... | |