Help | HUNAYNNET

READING INTERFACE

The primary purpose of the reading interface is to display the texts in a manner resembling traditional critical editions, enriched with modern lexicographic tools and providing a streamlined and user-friendly digital experience.

Adding text versions

When you open a page for a particular text, the Greek version appears as the first one by default. To add additional versions, click on the "Add parallel version" menu top right. To remove a text from the display (except for Greek), click on the Close button in the respective text pane.

Edition information and critical apparatus

Wherever applicable, blue numbers enclosed in brackets - e.g. [16a5] - represent Bekker numbers, references to Kühn's editions of Galen's works. In the Greek versions, blue pipes like | indicate every line in the original edition not divisible by 5 or not ending in 1. Red pipes like | are used in Syriac and Arabic text indicate a page/folio number and line number of an edition or a manuscript (see individual text info pages for details); hover over one with the mouse cursor to display it. Orange circles like • denote an apparatus entry; click it to display the entry, click again to close it. All the information relating to the edition, including what these marks refer to, is displayed after clicking on the (i) button in the upper right corner.

Navigate the text

By default, the individual text panes display the entire text. To navigate to a particular portion of it, you can use (in addition to scrolling) one of the two dropdown menus. The first menu allows you to move between chapters or equivalent units (e.g. individual propositions in Euclid's work). The second menu enables you to navigate the text based on standard divisions of the critical editions of the Greek text, such as Bekker numbers. In both cases, the content of the window will scroll to the sync unit (see below) containing the marker of the division.

Annotation and lexicographic information

Arabic numbers enclosed in parentheses like (4) (which are referred to as 'sync marks') indicate synchronization (sync) units; these are sentences or syntactic units of similar extent which are linked through all text versions. Hover over the sync mark to highlight the associated sync unit across all displayed texts. Whenever you scroll in any of the texts, you can click on the sync mark to synchronize the texts to that particular sync unit.

Each word can be looked up in a number of online dictionaries, e.g. Perseus Greek Word Study Tool, Sedra for Syriac, ElixirFM morphological analyzer for Arabic, and Glossarium Graeco-Arabicum for both Greek and Arabic. Whenever you hover over a word, the cursor changes to a question mark and the word is underlined. This indicates that you can click it to open a dictionary menu where you can search for the word (now highlighted) in any of these dictionaries.

Automated lemmatization and part-of-speech tagging (see below) has been provided for Greek (CLTK, based on the PROIEL treebank) and Arabic (Camel Tools). For Syriac, some texts have been annotated manually, others automatically using manual annotation as training data; both the manual and the automated annotation we provided by the Gregori project. In all cases, as is common, the accuracy is not 100%. For the purposes of achieving the best results, some dictionaries are provided with the wordform, others with the lemma.

Parallel corpus

While the reading interface is designed to engage with the texts in a more traditional manner, the parallel linguistic corpus is the main tool for lexicographic and linguistic analysis of the texts edited within the scope of the project.

The functionality is provided by the NoSketchEngine corpus management software. Due to the technical limitations, the sentence (sync unit) alignment requires the creation of 6 parallel corpora and as the number of versions is not the same across works, this in turn requires inserting dummy works and dummy sync units. The table below provides an overview.

HUNAYNNET_gre	HUNAYNNET_syr1	HUNAYNNET_syr2	HUNAYNNET_syr3	HUNAYNNET_ara1	HUNAYNNET_ara2
Categoriae	Categoriae (Syriac: Anonymous)	Categoriae (Syriac: George bishop of the Arabs)	Categoriae (Syriac: Jacob)	Categoriae (Arabic: Isḥāq ibn Ḥunayn)	Dummy
De interpretatione	De interpretatione (Syriac: Anonymous)	De interpretatione (Syriac: George bishop of the Arabs)	Dummy	De interpretatione (Arabic: Ascribed to Isḥāq ibn Ḥunayn)	De interpretatione (Arabic: Anonymous)
Analytica priora, book I, chapters 1-7	Analytica priora, book I, chapters 1-7 (Syriac: Ascribed to George bishop of the Arabs)	Analytica priora, book I, chapters 1-7 (Syriac: Ascribed to Proba)	Dummy	Analytica priora, book I, chapters 1-7 (Arabic: Tadārā or Ṯaḏārā (= Theodore Abū Qurra?))	Dummy
Analytica priora, book I, chapters 8-46	Analytica priora, book I, chapters 8-46 (Syriac: Ascribed to George bishop of the Arabs)	Dummy	Dummy	Analytica priora, book I, chapters 8-46 (Arabic: Tadārā or Ṯaḏārā (= Theodore Abū Qurra?))	Dummy
Analytica priora, book II	Analytica priora, book II (Syriac: Ascribed to George bishop of the Arabs)	Dummy	Dummy	Analytica priora, book II (Arabic)	Dummy
De poetica	De poetica (Syriac)	Dummy	Dummy	De poetica (Arabic)	Dummy
De virtutibus et vitiis	De virtutibus et vitiis (Syriac)	Dummy	Dummy	De virtutibus et vitiis (Arabic)	Dummy
De mundo	De mundo (Syriac: Ascribed to Sergius of Rēšʿaynā)	Dummy	Dummy	De mundo (Arabic: ʿĪsā ibn Ibrāhīm al-Nafīsī)	Dummy
Isagoge	Isagoge (Syriac: Anonymous)	Isagoge (Syriac: Ascribed to Athanasius of Balad)	Dummy	Isagoge (Arabic: Abū ʿUṯmān al-Dimašqī)	Dummy
De simplicium medicamentorum temperamentis et facultatibus liber VI (SMT 6)	De simplicium medicamentorum temperamentis et facultatibus liber VI (Syriac: Sergius of Rēšʿaynā)	Dummy	Dummy	De simplicium medicamentorum temperamentis et facultatibus liber VI (Arabic: Ḥunayn ibn Isḥāq)	De simplicium medicamentorum temperamentis et facultatibus liber VI (Arabic: Al-Biṭrīq)
De simplicium medicamentorum temperamentis et facultatibus liber VII (SMT 7)	De simplicium medicamentorum temperamentis et facultatibus liber VII (Syriac)	Dummy	Dummy	De simplicium medicamentorum temperamentis et facultatibus liber VII (Arabic)	Dummy
De simplicium medicamentorum temperamentis et facultatibus liber VIII (SMT 8)	De simplicium medicamentorum temperamentis et facultatibus liber VIII (Syriac)	Dummy	Dummy	De simplicium medicamentorum temperamentis et facultatibus liber VIII (Arabic)	Dummy
Ars medica	Ars medica (Syriac: Ascribed to Sergius of Rēšʿaynā)	Dummy	Dummy	Ars medica (Arabic: Ḥunayn ibn Isḥāq)	Dummy
Aphorismi	Aphorismi (Syriac)	Dummy	Dummy	Aphorismi (Arabic)	Dummy
Prognosticon	Prognosticon (Syriac)	Dummy	Dummy	Prognosticon (Arabic)	Dummy
Elements	Elements (Syriac)	Dummy	Dummy	Elements (Arabic)	Dummy

Linguistic annotation in the parallel corpus

All texts in the parallel corpus are annotated on the token (wordform) level. This means that each wordform has several levels of linguistic information assigned to enable linguistic analysis and to enable better search capabilities. The following table provides an overview of the levels of annotation.

annotation	example	explanation
wordform	ζῴῳ	word as it appears in the text; in Arabic and Syriac, this includes the proclitics and enclitics
lemma	ζῷον	dictionary entry in its base form (depending on the language), without any clitics or morphology
normalized wordform	ζωω	wordform with any but the most basic characters removed; for Greek, this means breathings and accents; for Arabic, this includes vowels and the hamza; for Syriac, all vowel marks and dots are removed
part-of-speech tag	NOUN	the word class of the wordform expressed as an abbreviation

For Greek and Arabic, the parts of speech are those contained in the standardized universal part of speech tagset. The full list can be found in the table below.

tag	part of speech	explanation
ADJ	adjective	words denoting properties which are inflected according to gender, number and case and agree with nouns in all those; this includes participles and ordinal numerals
ADP	adposition	prepositions and postpositions
ADV	adverb	manner words, i.e. words that answer questions like “how”, “when”, “where” and “by what means”
AUX	auxiliary	auxiliary verbs, including copulas
CCONJ	coordinating conjunction
DET	determiner	definite articles and quantifiers
INTJ	interjection
NOUN	noun
NUM	numeral
PART	particle	verbal particles
PRON	pronoun
PROPN	proper name	names of people, countries, cities etc.
PUNCT	punctuation
SCONJ	subordinating conjunction
SYM	symbol
VERB	verb
X	anything else

For Syriac, we employ the tagset developed by Kindt et. al 2018 (PDF) as modified below:

tag	part of speech	explanation
ADJ	adjective	words denoting properties which are inflected according to gender, number and state and agree with nouns in all those
ADV	adverb	manner words, i.e. words that answer questions like “how”, “when”, “where” and “by what means”, that modify predicates, adjectives or other adverbs
CARD	cardinal numeral
NAME	proper noun	names of people, countries, cities etc.
NOUN	generic noun
ORD	ordinal numeral
PART	particle	all prepositions (including the ubiquitous d-); all conjunctions; negators; interjections; existentials and other grammatical words
PRO_dem	demonstrative pronoun
PRO_ind	indefinite pronoun
PRO_int	interrogative pronoun
PRO_pers	personal pronoun
V	verb	verbs, including participles