Domari language

Domari is an endangered Indo-Aryan language, spoken by older Dom people scattered across the Middle East and North Africa. The language is reported to be spoken as far north as Azerbaijan and as far south as central Sudan, in Turkey, Iran, Iraq, Palestine, Israel, Jordan, Egypt, Sudan, Libya, Tunisia, Algeria, Morocco, Syria and Lebanon.[1] Based on the systematicity of sound changes, we know with a fair degree of certainty that the names Domari and Romani derive from the Indo-Aryan word ḍom.[4] The language itself actually derives from an Indo-Aryan language. The Arabs referred to them as nawar as they were a nomadic people that originally immigrated to the Middle East from India.[5]

Dōmʋārī, Dōmʋārī ǧib, Dômarî ĵib, דּוֺמָרִי ,دٛومَرِي
Native toIran, Egypt, Libya, Tunisia, Algeria, Morocco, Palestine, Syria, Turkey, Azerbaijan, Jordan, Sudan, and perhaps neighboring countries[1]
RegionMiddle East and North Africa, Caucasus, Central Asia
Native speakers
281,670 (2015)[2]
Latin, Arabic, Hebrew
Language codes
ISO 639-3rmt

Domari is also known as "Middle Eastern Romani", "Tsigene", "Luti", or "Mehtar". There is no standard written form. In the Arab world, it is occasionally written using the Arabic script and has many Arabic and Persian loanwords.[6] Descriptive work was done by Yaron Matras,[7] who published a comprehensive grammar of the language along with an historical and dialectological evaluation of secondary sources (Matras 2012).

Domari is an endangered language and is currently being shifted away from in younger generations, according to Yaron Matras. In certain areas such as Jerusalem, only about 20% of these Dom people, known as “Middle Eastern Gypsies”, speak the Domari language in everyday interactions. The language is mainly spoken by the elderly in the Jerusalem community. The younger generation are more influenced by Arabic, therefore most only know basic words and phrases. The modern-day community of Doms in Jerusalem was established by the nomadic people deciding to settle inside the Old City from 1940 until it came under Israeli administration in 1967 (Matras 1999).


The best-known variety of Domari is Palestinian Domari, also known as "Syrian Gypsy", the dialect of the Dom community of Jerusalem, which was described by R.A. S. Macalister in the 1910s. Palestinian Domari is an endangered language, with fewer than 200 speakers, the majority of the 1,200 members of the Jerusalem Domari community being native speakers of Palestinian Arabic.

Other dialects include:

Some dialects may be highly divergent and not mutually intelligible. Published sources often lump together dialects of Domari and the various unrelated in-group vocabularies of diverse peripatetic populations in the Middle East. Thus there is no evidence at all that the Lyuli, for example, speak a dialect of Domari, not is there any obvious connection between Domari and the vocabulary used by the Helebi of Egypt (see discussion in Matras 2012, chapter 1).

The small Seb Seliyer language of Iran is distinctive in its core vocabulary.


In the 1940s, the Dom began to abandon their nomadic culture and began settling and working in the local economy. This led to the next phenomenon; the assimilation of Dom children in the primary school system which marked the first generation to grow up in an academic environment alongside Arab children. Consequently, this 1940 generation do not fluently speak the Domari language. Arabic replaced their native Domari, and became the language of cross-generation communication. In Jerusalem, it is estimated there are about 600-900 members of the Dom population in Jerusalem. Less than 10% can effectively communicate in Jerusalem Domari.[9]

Comparison with Romani

Domari was once thought to be the "sister language" of Romani, the two languages having split after the departure from the Indian subcontinent, but more recent research suggests that the differences between them are significant enough to treat them as two separate languages within the Central zone (Hindustani) group of languages. The Dom and the Rom are most likely to be descendants of 2 different migration waves out of India, separated by several centuries.[10][11]

There remain similarities between the two, aside from their shared Central zone Indic origin, indicating a period of shared history as itinerant populations in the Middle East. These include: shared archaisms, which have been lost in the Central Indo-Aryan languages over the millennia since Dom/Rom emigration, a series of innovations connecting them with the Northwestern zone group, indicating their route of migration out of India, and a number of radical syntactical changes, due to superstrate influence of Middle Eastern languages, including Persian, Arabic and Byzantine Greek.


Since Domari is a minority Middle-Eastern language for a specific community of speakers, it did not have a standard orthography for many years; therefore many writers have used differing spelling systems (similarly to what happened with Ladino). Most Middle-Easterners used the Arabic script, while scholars made do with a modified Pan-Vlakh Latin-based alphabet.

Modified Pan-Vlakh orthography

In 2012, Yaron Matras used such a system in his recent publications on this subject where the Pan-Vlakh orthography served as a basis, with several modifications:

  • Romani j changed to y
  • Romani c use limited to the accented form č for /tʃ/, the /dʒ/ counterpart being denoted by
  • Doubled vowel letters for long vowels (aa ee ii oo uu)
  • Diphthongs denoted with vowel pairs (ai au ei eu oi and so on ... )
  • Additional letters in use for Semitic-derived words and names (ḍ ḥ ṣ ṭ ẓ ġ q ‘ ’ and so on ... )

Pan-Domari Alphabet

A new Semitic-flavored Latin-based pan-alphabet has recently been introduced by some scholars for the purpose of codifying written Domari.

The Pan-Domari Alphabet, which was invented in 2015, is a Semitic-flavored simplification of the previous Matras notation:

  • Y is used for /j/, and w for /w/—like in English
  • X is used for the sound /x/—the well-known guttural {kh} of Greek, Russian, and Middle Eastern languages
  • Q stands for /q/, the uvular plosive sound heard in the Semitic languages
  • Circumflexes are used to mark long vowels <â ê î ô û> and certain fricative/affricate consonants <ĉ ĝ ĵ ŝ ẑ> (={ch gh j sh zh})
  • Underdots under letters represent pharyngeal(-ized) consonants <ḍ ḥ ṣ ṭ ẓ> (IPA /d̪ˤ ħ sˤ t̪ˤ zˤ/)
  • Other letters include þ (thorn) and ð (edh) for the interdental fricatives /θ ð/, the characters <ʾ> (ʾalef/hamzaʾ—IPA /ʔ/) and <ʿ> (ʿayn—IPA /ʕ/), and the letter <ə> for the vowel sound shəwaʾ.
  • The diphthongs are now denoted by vowel + approximant digraphs <ay aw ey ew oy ... >.

The Pan-Domari Alphabet is shown in this table:

LetterAlternate§IPALetter NameIn IPAExample
ʾ/ʔ/ʾalefu/ʔɑ·lɛ·fʊ/ʾalefu-bêþah alphabet
A a/ɑ/a/ɑ/arat night
 âĀ ā/aː/â/aː/âxir last, final
B b/b/be/bɛ/bâsbort passport
C c**ce*Coke (Qôk)
Ĉ ĉCH ch/tʃ/ĉe/tʃɛ/ĉôna boy
D d/d̪/de/d̪ɛ/dînar dinar
Ḍ ḍ/d̪ˤ/ḍe/d̪ˤɛ/ḍanḍ tooth; ḍêf guest
Ð ðDH dh/ð/ðe/ðɛ/ðawḥâʾ dhow (small dinghy boat)
E e/ɛ/'e/ɛ/eras this
Ê êĒ ē/eː/ê/eː/ĵêb pocket
F f/f/~/ɸ/1fe/fɛ/finĵân cup
G g/ɡ/ge/ɡɛ/gêsu wheat
Ĝ ĝGH gh/ɣ/~/ʁ/2ĝe/ɣɛ/ĝassâle washing machine, washer
H h/h/he/hɛ/Hnûd Indus (river)
Ḥ ḥ/ħ/ḥe/ħɛ/ḥaqq right
I i/ɪ/i/ɪ/Isrâʾîl Israel
Î îĪ ī/iː/î/iː/nhîr blood
J j**je*Jeep (Ĵîp)
Ĵ ĵJ j/dʒ/ĵe/dʒɛ/ĵâr neighbor
K k/k/ke/k/kâz gas(oline), petrol
L l/l̪/le/l̪ɛ/Libnân Lebanon
M m/m/me/mɛ/mâsûra tube
N n/n̪/~/ŋ/~/ɲ/3ne/n̪ɛ/nohri tomato
ʿ/ʕ/ʿayenu/ʕɑ·jɛ·n̪ʊ/ʿIbrânî Hebrew
O o/ɔ/o/ɔ/oŝt lip
Ô ôŌ ō/oː/ô/oː/Dômarî Domari
P p/p/pe/pɛ/ple money, cash, moolah, funds
Q q/q/qe/qɛ/qayîŝ food
R r/r/~/ɾ/4re/rɛ/rxîṣ cheap
S s/s/se/sɛ/sûq market(place)
Ṣ ṣ/sˤ/ṣe/sˤɛ/ṣaḥafi journalist, reporter
Ŝ ŝSH sh/ʃ/ŝe/ʃɛ/ŝmâriya chicken, poultry
T t/t̪/te/t̪ɛ/turĵman translator, interpreter
Ṭ ṭ/t̪ˤ/ṭe/t̪ˤɛ/ṭarmabil (motor)car, motor vehicle, auto(mobile)
Þ þTH th/θ/þe/θɛ/Þawrâ Thor (the Norse god)
U u/ʊ/u/ʊ/ustâz teacher, instructor, proctor
Û ûŪ ū/uː/û/uː/ûyar town, city, vicinity; Til-Ûyar Jerusalem
V v/v/~/β/5ve/vɛ/vîsa visa, affidavit; qravat cravat
W w/w/we/wɛ/waqt time
X xKH kh/x/~/χ/6xe/xɛ/xarbûŝ tent; xârfân sheep, lambs
Y y/j/ye/jɛ/Yasûʿ Jesus
Z z/z/ze/zɛ/zard gold; zarf envelope
Ẓ ẓ/zˤ/ẓe/zˤɛ/ẓâbiṭ officer
Ẑ ẑZH zh/ʒ/ẑe/ʒɛ/ẑbin forehead
Ǝ ǝĖ ė/ə/~/ʌ/7ŝǝwaʾ/ʃə·wɑʔ/ŝǝwaʾ shǝwaʾ, schwa


§ Spelling alternates are shown for certain of these sounds (i.e.: when typing on an ASCII or typewriter keyboard, or when/where computers cannot show the proper accented Domari letters); these alternates are also used on the KURI's Learn Domari article series.

1 The letter fe may be sounded either as a labiodental /f/ or a bilabial [ɸ] fricative, depending on the context, or origin of a given word/name.

2 The letter ĝe usually represents a voiced velar fricative /ɣ/, but may be sounded as a velarolaryngeal [ʁ] in words/names derived from Arabic, Persian, and Urdu.

3 The letter ne usually represents a voiced dental nasal /n̪/; however, it manifests as a velar [ŋ] before the letters g ĝ k q x, but as a palatal [ɲ] before the letters ĉ ĵ y.

4 The letter re represents a flapped [ɾ] or a trilled [r] rhotative resonant continuant, depending on the position within a word/name, and whether it appears singly or doubly.

5 The letter ve shows up mainly in words and names derived from foreign loans, and may represent either a voiced labiodental /v/ or a voiced bilabial [β] fricative.

6 The letter xe (pronounced as KHEH) usually represents a voiceless velar fricative /x/, but usually is sounded as a velarolaryngeal /χ/ one in scores of loan words/loan names which are derived from Arabic, Persian, and Urdu.

7 The vowel letter called ŝǝwaʾ (its name derives from the cognate Hebrew vowel point for this very same sound) represents the mean-mid central spread neutral vowel as it exists in the English words about, taken, pencil, lemon, and circus. While its normal manifestation is indeed [ə], it may vary in the direction of either a higher-mid [ʌ] or a fronted lower-mid [ɜ] one, depending on the dialect spoken.

  • SPECIAL NOTE*: The plain (unaccented) letters c and j are only found in foreign loan words and loan names, as shown in the above table.



There are five main vowel sounds, however this inventory shows the variation and quantity of short vowels. Most are interchangeable with a vowel sound next to it, however all of the sounds produced above are identical to the local Palestinian Arabic (Matras 1999).

Front Central Back
High ɪ iː ʊ uː
Mid ɛ eː ə ~ [ʌ] ~ [ɜ] ɔ oː
Low ɑ aː


Most of these consonants are influenced by Palestinian Arabic such as gemination; however, consonants such as [p], [g], [tʃ] and [h] are not found in the local dialect. There is speculation among linguists that these sounds are considered a part of the pre-Arabic component. Alveopalatal affricates such as [tʃ] and [dʒ] are also consonants that differ in sound from Arabic.[12]

Bilabial Labiodental Dental Postalveolar Palatal Velar Uvular Pharyngeal Glottal
plain phar.
Stop voiceless p t̪ˤ k q ʔ
voiced b d̪ˤ g [ɢ]
Affricate voiceless t͡ʃ
voiced d͡ʒ
Fricative voiceless [ɸ] f s̪ˤ ʃ x [χ] ħ h
voiced [β] v z̪ˤ ʒ ɣ [ʁ] ʕ
Nasal m [ŋ]
Approximant [ʋ] j w
Flap ɾ


The biggest difference in expression of language between Arabic and Domari is where the stress is placed. Arabic has phoneme-level stress while Domari is a language of word-level stress. The Domari language emphasizes stress on the final syllable, as well as grammatical markers for gender and number. Most nouns, besides proper nouns, adopted from Arabic sound distinct because of the unique stresses in Domari (Matras 1999). Domari is thought to have borrowed a lot of words and grammatical structure from Arabic; however, this is not entirely true. Complex verbs and most core prepositions did not transfer into the realms of grammar of the Domari language. The syntactic typology remains independent of Arabic influence. It also important to note that the numerals used by the Doms were inherited from Kurdish. Even though Domari was influenced by local Arabic, the language also felt the impacts of Kurdish and certain dialects of Iranian in the grammar of the language.[13]


Here is a table of the numerals (1-10, 20, and 100) in Hindi, Romani, Domari, Lomavren, and Persian for comparison.

1ekekh, jekhyek, yikayak, yekyak, yek
2dodujluydu, do
6chayšovŝaŝŝeŝʃaʃ, ʃeʃ
7sātiftahawt, hofthafthaft
8āt̥hoxtohayŝt, haytekhaŝthašt
9nauinjanununuh, noh

Keyboard Layout

A Pan-Domari Alphabet ASDF keyboard layout is to be made available for MS-Windows users, which is based on the United States English layout, albeit with C, J, and some punctuation signs being reassigned to <ALTGR>/<SHIFT<>ALTGR>, to make room for needed Domari characters.

Key Pan-Domari


United States


C Ĉ ĉ C c The plain (unaccented) form only occurs in certain loans.
J Ĵ ĵ J J The unaccented C and J only appear in unassimilated foreign loans.
, ? , < , The less than sign is reassigned to the <ALTGR> levels.
! > .


  1. Matras (2012)
  2. Domari at Ethnologue (19th ed., 2016)
  3. Domari language at Ethnologue (19th ed., 2016)
  4. "ROMANI Project - Manchester". Retrieved 20 December 2017.
  5. Matras, Yaron (2000). "Two Domari Legends about the origins of the Doms" (PDF). Retrieved 20 December 2017.
  6. The Gypsies of Lebanon: A DRC Update, April 2000, by Dr. G. A. Williams
  7. Matras, Yaron (1996). "Domari" (PDF). In Brown, Keith (ed.). Encyclopedia of Language and Linguistics. Oxford: Elsevier.
  8. "Peripatetics of Iraq, Syria, Lebanon, Jordan, Israel, Egypt, Sudan, and Yemen - Dictionary definition of Peripatetics of Iraq, Syria, Lebanon, Jordan, Israel, Egypt, Sudan, and Yemen - FREE online dictionary". Retrieved 20 December 2017.
  9. Matras, Yaron. "Language contact, language endangerment, and the role of the 'salvation linguist'". Language Documentation and Description. 3.
  10. "What is Domari?",, retrieved 2008-07-23
  11. "ON ROMANI ORIGINS AND IDENTITY",, archived from the original on 2011-07-17, retrieved 2008-07-23
  12. Matras, Yaron (2012-10-01). A Grammar of Domari. Walter de Gruyter. ISBN 9783110291421.
  13. "The Domari Language of Aleppo (Syria)". Linguistic Discovery. 10 (2).

Further reading

  • Herin, B. (2012). "The Domari language of Aleppo (Syria)" Linguistic Discovery 10 (2), 1-52.
  • Herin, B. (2014). "The Northern Dialects of Domari," Zeitschrift der Deutschen Morgenländischen Gesellschaft 164 (2): 407–450.
  • Matras, Y. (1999). "The state of present-day Domari in Jerusalem." Mediterranean Language Review 11, 1–58.
  • Matras, Y. (2002). Romani: a linguistic introduction. Cambridge: Cambridge University Press.
  • Matras, Y. (2012). A grammar of Domari. Berlin: De Gruyter Mouton (Mouton Grammar Library).
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.