Macron below

Macron below, U+0331 ̱ COMBINING MACRON BELOW, is a combining diacritical mark that is used in various orthographies.[1]

̱
Macron below
Diacritics in Latin & Greek
accent
acute´
double acute˝
grave`
double grave ̏
circumflexˆ
caron, háčekˇ
breve˘
inverted breve  ̑  
cedilla¸
diaeresis, umlaut¨
dot·
palatal hook  ̡
retroflex hook  ̢
hook above, dấu hỏi ̉
horn ̛
iota subscript ͅ 
macronˉ
ogonek, nosinė˛
perispomene ͂ 
overring˚
underring˳
rough breathing
smooth breathing᾿
Marks sometimes used as diacritics
apostrophe
bar◌̸
colon:
comma,
full stop/period.
hyphen˗
prime
tilde~
Diacritical marks in other scripts
Arabic diacritics
Early Cyrillic diacritics
kamora ҄
pokrytie ҇
titlo ҃
Hebrew diacritics
Indic diacritics
anusvara
avagraha
chandrabindu
nuqta
virama
visarga
Gurmukhī diacritics
Khmer diacritics
Thai diacritics
IPA diacritics
Japanese kana diacritics
dakuten
handakuten
Syriac diacritics
Related
Dotted circle
Punctuation marks
Logic symbols
A̱a̱ḆḇC̱c̱

It is not to be confused with U+0320 ̠ COMBINING MINUS SIGN BELOW, U+0332 ̲ COMBINING LOW LINE and U+005F _ LOW LINE. The difference between "macron below" and "low line" is that the latter results in an unbroken underline when it is run together: compare a̱ḇc̱ and a̲b̲c̲ (only the latter should look like abc).[2]

Unicode

Macron below character

Unicode defines several characters for the macron below:

macron below
combiningspacing
characterUnicodeHTMLcharacterUnicodeHTML
◌̱
single
U+0331̱ˍ
letter
U+02CDˍ
◌͟◌
double
U+035F͟

There are many similar marks covered elsewhere:

  • Spacing underscores, including
    • U+005F _ LOW LINE (HTML _ · _, _)
    • U+2017 DOUBLE LOW LINE (HTML ‗)
  • Combining underlines, including
    • U+0332 ̲ COMBINING LOW LINE (HTML ̲)
    • U+0333 ̳ COMBINING DOUBLE LOW LINE (HTML ̳)
    • U+0347 ͇ COMBINING EQUALS SIGN BELOW (HTML ͇);
    • U+FE2B COMBINING MACRON LEFT HALF BELOW (HTML ︫)
    • U+FE2C COMBINING MACRON RIGHT HALF BELOW (HTML ︬)
    • U+FE2D COMBINING CONJOINING MACRON BELOW (HTML ︭)
  • International Phonetic Alphabet mark for retracted or backed articulation:[1]
    • U+0320 ̠ COMBINING MINUS SIGN BELOW (HTML ̠)
    • U+02D7 ˗ MODIFIER LETTER MINUS SIGN (HTML ˗)

Precomposed characters

Various precomposed letters with a macron below are defined in Unicode:

upper caselower casenotes
letterUnicodeHTMLletterUnicodeHTML
U+1E06ḆU+1E07ḇUsed in the transliteration of Biblical Hebrew into the Roman alphabet to show the fricative value of the letter beth (ב) representing [v], or perhaps [β].
U+1E0EḎU+1E0FḏUsed in the transliteration of Biblical Hebrew, Syriac and Arabic into the Roman alphabet to show the fricative value of the letter dalet (ד), [ð], and in the romanization of Pashto, it is used sometimes to represent retroflex D.
U+1E96ẖSometimes used for Arabic خ ẖāʼ, Hebrew Heth (letter), Egyptian 𓄡.

There is no precomposed upper case equivalent of so it uses a combining macron below instead: .

U+1E34ḴU+1E35ḵUsed in the transliteration of Biblical Hebrew into the Roman alphabet to show the fricative value of the letter kaph (ב) representing [χ].

Used in Tlingit and Haida (among other Pacific Northwest languages) for the voiceless uvular stop [q]. Close to Korean ㄲ kk; closest English "shocking"

Used optionally in the K-dialect of Māori in the South Island of New Zealand, where an original ng has merged with k. The ḵ indicates that it corresponds to ng in other dialects. There is no difference in pronunciation between ḵ and k.

U+1E3AḺU+1E3BḻOne possible transliteration of the Tamil letter . Ḻ is used in the Seri language to represent [l], like English l, while unmodified "l" represents [ɬ], like Welsh ll. It is also used in the proposed Unified Alphabet for Mapudungun.
U+1E48ṈU+1E49ṉUsed in Pitjantjatjara to represent [ɳ], and in Saanich to represent both plain and glottalized [ɴ]. In the romanization of Pashto, it is used sometimes to represent retroflex N.
U+1E5EṞU+1E5FṟUsed in Pitjantjatjara to represent [ɻ], and sometimes in the romanization of Pashto to represent the retroflex R.
U+1E6EṮU+1E6FṯUsed in the proposed Unified Alphabet for Mapudungun language representing [t̪]. In the romanization of Pashto, it is used sometimes to represent retroflex T. In the romanization of Arabic this letter is used to transcribe the letter Ṯāʾ.
U+1E94ẔU+1E95ẕUsed in the 1953 Hebrew Academy Romanization of Hebrew to represent tsade (צ).
U+20AB₫Vietnamese đồng.

Note that the Unicode character names of precomposed characters whose decompositions contain U+0331 ̱ COMBINING MACRON BELOW use "WITH LINE BELOW" rather than "WITH MACRON BELOW". Thus, U+1E07 LATIN SMALL LETTER B WITH LINE BELOW decomposes to U+0062 b LATIN SMALL LETTER B and U+0331 ̱ COMBINING MACRON BELOW.[3]

The Vietnamese đồng currency sign resembles a lower case d with a stroke and macron below: U+20AB DONG SIGN (HTML ₫) but is neither a letter nor decomposable.[4]

See also

References

  1. "Combining Diacritical Marks Code Chart, Range: 0300–036F" (PDF). The Unicode Standard. Retrieved 2016-11-21.
  2. "6.2 General Punctuation" (PDF). The Unicode Standard. Version 11.0.0. Mountain View, CA: The Unicode Consortium. 2018. p. 273. ISBN 978-1-936213-19-1. Retrieved 2018-12-12. Spacing Overscores and Underscores. U+203E OVERLINE is the above-the-line counterpart to U+005F low line. It is a spacing character, not to be confused with U+0305 COMBINING OVERLINE. As with all overscores and underscores, a sequence of these characters should connect in an unbroken line. The overscoring characters also must be distinguished from U+0304 COMBINING MACRON, which does not connect horizontally in this way.
  3. "Latin Extended Additional Code Chart, Range: 1E00–1EFF" (PDF). The Unicode Standard. Retrieved 2016-11-21.
  4. "Unicode character database". The Unicode Standard. Retrieved 2016-11-21.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.