The primary purpose of the reading interface is to display the texts in a manner resembling traditional critical editions, enriched with modern lexicographic tools and providing a streamlined and user-friendly digital experience.
When you open a page for a particular text, the Greek version appears as the first one by default. To add additional versions, click on the "Add parallel version" menu top right. To remove a text from the display (except for Greek), click on the Close button in the respective text pane.
Wherever applicable, blue numbers enclosed in brackets - e.g. [16a5] - represent Bekker numbers, references to Kühn's editions of Galen's works. In the Greek versions, blue pipes like | indicate every line in the original edition not divisible by 5 or not ending in 1. Red pipes like | are used in Syriac and Arabic text indicate a page/folio number and line number of an edition or a manuscript (see individual text info pages for details); hover over one with the mouse cursor to display it. Orange circles like • denote an apparatus entry; click it to display the entry, click again to close it. All the information relating to the edition, including what these marks refer to, is displayed after clicking on the (i) button in the upper right corner.
By default, the individual text panes display the entire text. To navigate to a
particular portion of it, you can use (in addition to scrolling)
one of the two dropdown menus. The first menu allows you to move between chapters or equivalent units (e.g.
individual propositions in Euclid's work). The second menu
enables you to navigate the text based on standard divisions of the critical editions of the Greek text,
such as Bekker numbers. In both cases, the content of the window will
scroll to the sync unit (see below) containing the marker of the division.
Arabic numbers enclosed in parentheses like (4) (which are referred to as 'sync marks') indicate synchronization (sync) units; these are sentences or syntactic units of similar extent which are linked through all text versions. Hover over the sync mark to highlight the associated sync unit across all displayed texts. Whenever you scroll in any of the texts, you can click on the sync mark to synchronize the texts to that particular sync unit.
Each word can be looked up in a number of online dictionaries, e.g. Perseus Greek Word Study Tool, Sedra for Syriac, ElixirFM morphological analyzer for Arabic, and Glossarium Graeco-Arabicum for both Greek and Arabic. Whenever you hover over a word, the cursor changes to a question mark and the word is underlined. This indicates that you can click it to open a dictionary menu where you can search for the word (now highlighted) in any of these dictionaries.
Automated lemmatization and part-of-speech tagging (see below) has been provided for Greek (CLTK, based on the PROIEL treebank) and Arabic (Camel Tools). For Syriac, some texts have been annotated manually, others automatically using manual annotation as training data; both the manual and the automated annotation we provided by the Gregori project. In all cases, as is common, the accuracy is not 100%. For the purposes of achieving the best results, some dictionaries are provided with the wordform, others with the lemma.
While the reading interface is designed to engage with the texts in a more traditional manner, the parallel linguistic corpus is the main tool for lexicographic and linguistic analysis of the texts edited within the scope of the project.
The functionality is provided by the NoSketchEngine corpus management software. Due to the technical limitations, the sentence (sync unit) alignment requires the creation of 6 parallel corpora and as the number of versions is not the same across works, this in turn requires inserting dummy works and dummy sync units. The table below provides an overview.
HUNAYNNET_gre | HUNAYNNET_syr1 | HUNAYNNET_syr2 | HUNAYNNET_syr3 | HUNAYNNET_ara1 | HUNAYNNET_ara2 |
---|---|---|---|---|---|
Categoriae | Categoriae (Syriac: Anonymous) | Categoriae (Syriac: George bishop of the Arabs) | Categoriae (Syriac: Jacob) | Categoriae (Arabic: Isḥāq ibn Ḥunayn) | Dummy |
De interpretatione | De interpretatione (Syriac: Anonymous) | De interpretatione (Syriac: George bishop of the Arabs) | Dummy | De interpretatione (Arabic: Ascribed to Isḥāq ibn Ḥunayn) | De interpretatione (Arabic: Anonymous) |
Analytica priora, book I, chapters 1-7 | Analytica priora, book I, chapters 1-7 (Syriac: Ascribed to George bishop of the Arabs) | Analytica priora, book I, chapters 1-7 (Syriac: Ascribed to Proba) | Dummy | Analytica priora, book I, chapters 1-7 (Arabic: Tadārā or Ṯaḏārā (= Theodore Abū Qurra?)) | Dummy |
Analytica priora, book I, chapters 8-46 | Analytica priora, book I, chapters 8-46 (Syriac: Ascribed to George bishop of the Arabs) | Dummy | Dummy | Analytica priora, book I, chapters 8-46 (Arabic: Tadārā or Ṯaḏārā (= Theodore Abū Qurra?)) | Dummy |
Analytica priora, book II | Analytica priora, book II (Syriac: Ascribed to George bishop of the Arabs) | Dummy | Dummy | Analytica priora, book II (Arabic) | Dummy |
De poetica | De poetica (Syriac) | Dummy | Dummy | De poetica (Arabic) | Dummy |
De virtutibus et vitiis | De virtutibus et vitiis (Syriac) | Dummy | Dummy | De virtutibus et vitiis (Arabic) | Dummy |
De mundo | De mundo (Syriac: Ascribed to Sergius of Rēšʿaynā) | Dummy | Dummy | De mundo (Arabic: ʿĪsā ibn Ibrāhīm al-Nafīsī) | Dummy |
Isagoge | Isagoge (Syriac: Anonymous) | Isagoge (Syriac: Ascribed to Athanasius of Balad) | Dummy | Isagoge (Arabic: Abū ʿUṯmān al-Dimašqī) | Dummy |
De simplicium medicamentorum temperamentis et facultatibus liber VI (SMT 6) | De simplicium medicamentorum temperamentis et facultatibus liber VI (Syriac: Sergius of Rēšʿaynā) | Dummy | Dummy | De simplicium medicamentorum temperamentis et facultatibus liber VI (Arabic: Ḥunayn ibn Isḥāq) | De simplicium medicamentorum temperamentis et facultatibus liber VI (Arabic: Al-Biṭrīq) |
De simplicium medicamentorum temperamentis et facultatibus liber VII (SMT 7) | De simplicium medicamentorum temperamentis et facultatibus liber VII (Syriac) | Dummy | Dummy | De simplicium medicamentorum temperamentis et facultatibus liber VII (Arabic) | Dummy |
De simplicium medicamentorum temperamentis et facultatibus liber VIII (SMT 8) | De simplicium medicamentorum temperamentis et facultatibus liber VIII (Syriac) | Dummy | Dummy | De simplicium medicamentorum temperamentis et facultatibus liber VIII (Arabic) | Dummy |
Ars medica | Ars medica (Syriac: Ascribed to Sergius of Rēšʿaynā) | Dummy | Dummy | Ars medica (Arabic: Ḥunayn ibn Isḥāq) | Dummy |
Aphorismi | Aphorismi (Syriac) | Dummy | Dummy | Aphorismi (Arabic) | Dummy |
Prognosticon | Prognosticon (Syriac) | Dummy | Dummy | Prognosticon (Arabic) | Dummy |
Elements | Elements (Syriac) | Dummy | Dummy | Elements (Arabic) | Dummy |
All texts in the parallel corpus are annotated on the token (wordform) level. This means that each wordform has several levels of linguistic information assigned to enable linguistic analysis and to enable better search capabilities. The following table provides an overview of the levels of annotation.
annotation | example | explanation |
---|---|---|
wordform | ζῴῳ | word as it appears in the text; in Arabic and Syriac, this includes the proclitics and enclitics |
lemma | ζῷον | dictionary entry in its base form (depending on the language), without any clitics or morphology |
normalized wordform | ζωω | wordform with any but the most basic characters removed; for Greek, this means breathings and accents; for Arabic, this includes vowels and the hamza; for Syriac, all vowel marks and dots are removed |
part-of-speech tag | NOUN | the word class of the wordform expressed as an abbreviation |
For Greek and Arabic, the parts of speech are those contained in the standardized universal part of speech tagset. The full list can be found in the table below.
tag | part of speech | explanation |
---|---|---|
ADJ | adjective | words denoting properties which are inflected according to gender, number and case and agree with nouns in all those; this includes participles and ordinal numerals |
ADP | adposition | prepositions and postpositions |
ADV | adverb | manner words, i.e. words that answer questions like “how”, “when”, “where” and “by what means” |
AUX | auxiliary | auxiliary verbs, including copulas |
CCONJ | coordinating conjunction | |
DET | determiner | definite articles and quantifiers |
INTJ | interjection | |
NOUN | noun | |
NUM | numeral | |
PART | particle | verbal particles |
PRON | pronoun | |
PROPN | proper name | names of people, countries, cities etc. |
PUNCT | punctuation | |
SCONJ | subordinating conjunction | |
SYM | symbol | |
VERB | verb | |
X | anything else |
For Syriac, we employ the tagset developed by Kindt et. al 2018 (PDF) as modified below:
tag | part of speech | explanation |
---|---|---|
ADJ | adjective | words denoting properties which are inflected according to gender, number and state and agree with nouns in all those |
ADV | adverb | manner words, i.e. words that answer questions like “how”, “when”, “where” and “by what means”, that modify predicates, adjectives or other adverbs |
CARD | cardinal numeral | |
NAME | proper noun | names of people, countries, cities etc. |
NOUN | generic noun | |
ORD | ordinal numeral | |
PART | particle | all prepositions (including the ubiquitous d-); all conjunctions; negators; interjections; existentials and other grammatical words |
PRO_dem | demonstrative pronoun | |
PRO_ind | indefinite pronoun | |
PRO_int | interrogative pronoun | |
PRO_pers | personal pronoun | |
V | verb | verbs, including participles |