# Myanmar character tables #
This document lists the per-character shaping information needed to
[shape Myanmar text](../opentype-shaping-myanmar.md).
**Contents**
- [Myanmar character table](#myanmar-character-table)
- [Myanmar Extended character tables](#myanmar-extended-character-tables)
- [Vedic Extensions character table](#vedic-extensions-character-table)
- [Miscellaneous character table](#miscellaneous-character-table)
## Myanmar character table ##
Myanmar glyphs should be classified as in the following
table. Codepoints in the Myanmar block with no assigned meaning are
designated as _unassigned_ in the _Unicode category_ column.
Assigned codepoints with a _null_ in the _Shaping class_
column evoke no special behavior from the shaping engine. Note that
this does include some valid codepoints, such as currency marks,
punctuation, and other symbols.
> Note: the `NUMBER` and `SYMBOL` _Shaping classes_ are important
> during syllable identification, but generally evoke no further
> special behavior during the rest of the shaping process.
The _Mark-placement subclass_ column indicates mark-placement
positioning for codepoints in the _Mark_ category. Assigned, non-mark
codepoints have a _null_ in this column and evoke no special
mark-placement behavior. Marks tagged with [Mn] in the _Unicode
category_ column are categorized as non-spacing; marks tagged with
[Mc] are categorized as spacing-combining.
Some codepoints in the following table use a _Shaping class_ that
differs from the codepoint's Unicode _General Category_. The _Shaping
class_ takes precedence during OpenType shaping, as it captures more
specific, script-aware behavior.
:::{table} Myanmar character table
| Codepoint | Unicode category | Shaping class | Mark-placement subclass | Glyph |
|:----------|:-----------------|:------------------|:---------------------------|:-----------------------------|
|`U+1000` | Letter | CONSONANT | _null_ | က Ka |
|`U+1001` | Letter | CONSONANT | _null_ | ခ Kha |
|`U+1002` | Letter | CONSONANT | _null_ | ဂ Ga |
|`U+1003` | Letter | CONSONANT | _null_ | ဃ Gha |
|`U+1004` | Letter | CONSONANT | _null_ | င Nga |
|`U+1005` | Letter | CONSONANT | _null_ | စ Ca |
|`U+1006` | Letter | CONSONANT | _null_ | ဆ Cha |
|`U+1007` | Letter | CONSONANT | _null_ | ဇ Ja |
|`U+1008` | Letter | CONSONANT | _null_ | ဈ Jha |
|`U+1009` | Letter | CONSONANT | _null_ | ဉ Nya |
|`U+100A` | Letter | CONSONANT | _null_ | ည Nnya |
|`U+100B` | Letter | CONSONANT | _null_ | ဋ Tta |
|`U+100C` | Letter | CONSONANT | _null_ | ဌ Ttha |
|`U+100D` | Letter | CONSONANT | _null_ | ဍ Dda |
|`U+100E` | Letter | CONSONANT | _null_ | ဎ DDha |
|`U+100F` | Letter | CONSONANT | _null_ | ဏ Nna |
| | | | |
|`U+1010` | Letter | CONSONANT | _null_ | တ Ta |
|`U+1011` | Letter | CONSONANT | _null_ | ထ Tha |
|`U+1012` | Letter | CONSONANT | _null_ | ဒ Da |
|`U+1013` | Letter | CONSONANT | _null_ | ဓ Dha |
|`U+1014` | Letter | CONSONANT | _null_ | န Na |
|`U+1015` | Letter | CONSONANT | _null_ | ပ Pa |
|`U+1016` | Letter | CONSONANT | _null_ | ဖ Pha |
|`U+1017` | Letter | CONSONANT | _null_ | ဗ Ba |
|`U+1018` | Letter | CONSONANT | _null_ | ဘ Bha |
|`U+1019` | Letter | CONSONANT | _null_ | မ Ma |
|`U+101A` | Letter | CONSONANT | _null_ | ယ Ya |
|`U+101B` | Letter | CONSONANT | _null_ | ရ Ra |
|`U+101C` | Letter | CONSONANT | _null_ | လ La |
|`U+101D` | Letter | CONSONANT | _null_ | ဝ Wa |
|`U+101E` | Letter | CONSONANT | _null_ | သ Sa |
|`U+101F` | Letter | CONSONANT | _null_ | ဟ Ha |
| | | | |
|`U+1020` | Letter | CONSONANT | _null_ | ဠ Lla |
|`U+1021` | Letter | VOWEL_INDEPENDENT | _null_ | အ A |
|`U+1022` | Letter | VOWEL_INDEPENDENT | _null_ | ဢ Shan A |
|`U+1023` | Letter | VOWEL_INDEPENDENT | _null_ | ဣ I |
|`U+1024` | Letter | VOWEL_INDEPENDENT | _null_ | ဤ Ii |
|`U+1025` | Letter | VOWEL_INDEPENDENT | _null_ | ဥ U |
|`U+1026` | Letter | VOWEL_INDEPENDENT | _null_ | ဦ Uu |
|`U+1027` | Letter | VOWEL_INDEPENDENT | _null_ | ဧ E |
|`U+1028` | Letter | VOWEL_INDEPENDENT | _null_ | ဨ Mon E |
|`U+1029` | Letter | VOWEL_INDEPENDENT | _null_ | ဩ O |
|`U+102A` | Letter | VOWEL_INDEPENDENT | _null_ | ဪ Au |
|`U+102B` | Mark [Mc] | VOWEL_DEPENDENT | RIGHT_POSITION | ါ Sign Tall Aa |
|`U+102C` | Mark [Mc] | VOWEL_DEPENDENT | RIGHT_POSITION | ာ Sign Aa |
|`U+102D` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ိ Sign I |
|`U+102E` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ီ Sign Ii |
|`U+102F` | Mark [Mn] | VOWEL_DEPENDENT | BOTTOM_POSITION | ု Sign U |
| | | | |
|`U+1030` | Mark [Mn] | VOWEL_DEPENDENT | BOTTOM_POSITION | ူ Sign Uu |
|`U+1031` | Mark [Mc] | VOWEL_DEPENDENT | LEFT_POSITION | ေ Sign E |
|`U+1032` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ဲ Sign Ai |
|`U+1033` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ဳ Sign Mon Ii |
|`U+1034` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ဴ Sign Mon O |
|`U+1035` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ဵ Sign E Above |
|`U+1036` | Mark [Mn] | BINDU | TOP_POSITION | ံ Anusvara |
|`U+1037` | Mark [Mn] | TONE_MARKER | BOTTOM_POSITION | ့ Dot Below |
|`U+1038` | Mark [Mc] | VISARGA | RIGHT_POSITION | း Visarga |
|`U+1039` | Mark [Mn] | INVISIBLE_STACKER | _null_ | ္ Virama |
|`U+103A` | Mark [Mn] | PURE_KILLER | TOP_POSITION | ် Asat |
|`U+103B` | Mark [Mc] | CONSONANT_MEDIAL | RIGHT_POSITION | ျ Sign Medial Ya |
|`U+103C` | Mark [Mc] | CONSONANT_MEDIAL | TOP_LEFT_AND_BOTTOM_POSITION | ြ Sign Medial Ra |
|`U+103D` | Mark [Mn] | CONSONANT_MEDIAL | BOTTOM_POSITION | ွ Sign Medial Wa |
|`U+103E` | Mark [Mn] | CONSONANT_MEDIAL | BOTTOM_POSITION | ှ Sign Medial Ha |
|`U+103F` | Letter | CONSONANT | _null_ | ဿ Great Sa |
| | | | |
|`U+1040` | Number | NUMBER | _null_ | ၀ Digit Zero |
|`U+1041` | Number | NUMBER | _null_ | ၁ Digit One |
|`U+1042` | Number | NUMBER | _null_ | ၂ Digit Two |
|`U+1043` | Number | NUMBER | _null_ | ၃ Digit Three |
|`U+1044` | Number | NUMBER | _null_ | ၄ Digit Four |
|`U+1045` | Number | NUMBER | _null_ | ၅ Digit Five |
|`U+1046` | Number | NUMBER | _null_ | ၆ Digit Six |
|`U+1047` | Number | NUMBER | _null_ | ၇ Digit Seven |
|`U+1048` | Number | NUMBER | _null_ | ၈ Digit Eight |
|`U+1049` | Number | NUMBER | _null_ | ၉ Digit Nine |
|`U+104A` | Punctuation | _null_ | _null_ | ၊ Little Section |
|`U+104B` | Punctuation | _null_ | _null_ | ။ Section |
|`U+104C` | Punctuation | _null_ | _null_ | ၌ Locative |
|`U+104D` | Punctuation | _null_ | _null_ | ၍ Completed |
|`U+104E` | Punctuation | CONSONANT_PLACEHOLDER| _null_ | ၎ Aforementioned |
|`U+104F` | Punctuation | _null_ | _null_ | ၏ Genitive |
| | | | |
|`U+1050` | Letter | CONSONANT | _null_ | ၐ Sha |
|`U+1051` | Letter | CONSONANT | _null_ | ၑ Ssa |
|`U+1052` | Letter | VOWEL_INDEPENDENT | _null_ | ၒ Vocalic R |
|`U+1053` | Letter | VOWEL_INDEPENDENT | _null_ | ၓ Vocalic Rr |
|`U+1054` | Letter | VOWEL_INDEPENDENT | _null_ | ၔ Vocalic L |
|`U+1055` | Letter | VOWEL_INDEPENDENT | _null_ | ၕ Vocalic Ll |
|`U+1056` | Mark [Mc] | VOWEL_DEPENDENT | RIGHT_POSITION | ၖ Sign Vocalic R |
|`U+1057` | Mark [Mc] | VOWEL_DEPENDENT | RIGHT_POSITION | ၗ Sign Vocalic Rr |
|`U+1058` | Mark [Mn] | VOWEL_DEPENDENT | BOTTOM_POSITION | ၘ Sign Vocalic L |
|`U+1059` | Mark [Mn] | VOWEL_DEPENDENT | BOTTOM_POSITION | ၙ Sign Vocalic Ll |
|`U+105A` | Letter | CONSONANT | _null_ | ၚ Mon Nga |
|`U+105B` | Letter | CONSONANT | _null_ | ၛ Mon Jha |
|`U+105C` | Letter | CONSONANT | _null_ | ၜ Mon Bba |
|`U+105D` | Letter | CONSONANT | _null_ | ၝ Mon Bbe |
|`U+105E` | Mark [Mn] | CONSONANT_MEDIAL | BOTTOM_POSITION | ၞ Sign Mon Medial Na |
|`U+105F` | Mark [Mn] | CONSONANT_MEDIAL | BOTTOM_POSITION | ၟ Sign Mon Medial Ma |
| | | | |
|`U+1060` | Mark [Mn] | CONSONANT_MEDIAL | BOTTOM_POSITION | ၠ Sign Mon Medial La |
|`U+1061` | Letter | CONSONANT | _null_ | ၡ Sgaw Karen Sha |
|`U+1062` | Mark [Mc] | VOWEL_DEPENDENT | RIGHT_POSITION | ၢ Sign Sgaw Karen Eu |
|`U+1063` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ၣ Tone Sgaw Karen Hathi|
|`U+1064` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ၤ Tone Sgaw Karen Ke Pho|
|`U+1065` | Letter | CONSONANT | _null_ | ၥ Western Pwo Karen Tha|
|`U+1066` | Letter | CONSONANT | _null_ | ၦ Western Pwo Karen Pwa|
|`U+1067` | Mark [Mc] | VOWEL_DEPENDENT | RIGHT_POSITION | ၧ Sign Western Pwo Karen Eu|
|`U+1068` | Mark [Mc] | VOWEL_DEPENDENT | RIGHT_POSITION | ၨ Sign Western Pwo Karen Ue|
|`U+1069` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ၩ Sign Western Pwo Karen Tone 1|
|`U+106A` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ၪ Sign Western Pwo Karen Tone 2|
|`U+106B` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ၫ Sign Western Pwo Karen Tone 3|
|`U+106C` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ၬ Sign Western Pwo Karen Tone 4|
|`U+106D` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ၭ Sign Western Pwo Karen Tone 5|
|`U+106E` | Letter | CONSONANT | _null_ | ၮ Eastern Pwo Karen Nna|
|`U+106F` | Letter | CONSONANT | _null_ | ၯ Eastern Pwo Karen Ywa|
| | | | |
|`U+1070` | Letter | CONSONANT | _null_ | ၰ Eastern Pwo Karen Ghwa|
|`U+1071` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ၱ Sign Geba Karen I |
|`U+1072` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ၲ Sign Kayah Oe |
|`U+1073` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ၳ Sign Kayah U |
|`U+1074` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ၴ Sign Kayah Ee |
|`U+1075` | Letter | CONSONANT | _null_ | ၵ Shan Ka |
|`U+1076` | Letter | CONSONANT | _null_ | ၶ Shan Kha |
|`U+1077` | Letter | CONSONANT | _null_ | ၷ Shan Ga |
|`U+1078` | Letter | CONSONANT | _null_ | ၸ Shan Ca |
|`U+1079` | Letter | CONSONANT | _null_ | ၹ Shan Za |
|`U+107A` | Letter | CONSONANT | _null_ | ၺ Shan Nya |
|`U+107B` | Letter | CONSONANT | _null_ | ၻ Shan Da |
|`U+107C` | Letter | CONSONANT | _null_ | ၼ Shan Na |
|`U+107D` | Letter | CONSONANT | _null_ | ၽ Shan Pha |
|`U+107E` | Letter | CONSONANT | _null_ | ၾ Shan Fa |
|`U+107F` | Letter | CONSONANT | _null_ | ၿ Shan Ba |
| | | | |
|`U+1080` | Letter | CONSONANT | _null_ | ႀ Shan Tha |
|`U+1081` | Letter | CONSONANT | _null_ | ႁ Shan Ha |
|`U+1082` | Mark [Mn] | CONSONANT_MEDIAL | BOTTOM_POSITION | ႂ Sign Shan Medial Wa |
|`U+1083` | Mark [Mc] | VOWEL_DEPENDENT | RIGHT_POSITION | ႃ Sign Shan Aa |
|`U+1084` | Mark [Mc] | VOWEL_DEPENDENT | LEFT_POSITION | ႄ Sign Shan E |
|`U+1085` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ႅ Sign Shan E Above |
|`U+1086` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ႆ Sign Shan Final Y |
|`U+1087` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ႇ Sign Shan Tone 2 |
|`U+1088` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ႈ Sign Shan Tone 3 |
|`U+1089` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ႉ Sign Shan Tone 5 |
|`U+108A` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ႊ Sign Shan Tone 6 |
|`U+108B` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ႋ Sign Shan Council Tone 2|
|`U+108C` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ႌ Sign Shan Council Tone 3|
|`U+108D` | Mark [Mn] | TONE_MARKER | BOTTOM_POSITION | ႍ Sign Shan Council Emphatic Tone|
|`U+108E` | Letter | CONSONANT | _null_ | ႎ Rumai Palaung Fa |
|`U+108F` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ႏ Sign Rumai Palaung Tone 5|
| | | | |
|`U+1090` | Number | NUMBER | _null_ | ႐ Shan Digit Zero |
|`U+1091` | Number | NUMBER | _null_ | ႑ Shan Digit One |
|`U+1092` | Number | NUMBER | _null_ | ႒ Shan Digit Two |
|`U+1093` | Number | NUMBER | _null_ | ႓ Shan Digit Three |
|`U+1094` | Number | NUMBER | _null_ | ႔ Shan Digit Four |
|`U+1095` | Number | NUMBER | _null_ | ႕ Shan Digit Five |
|`U+1096` | Number | NUMBER | _null_ | ႖ Shan Digit Six |
|`U+1097` | Number | NUMBER | _null_ | ႗ Shan Digit Seven |
|`U+1098` | Number | NUMBER | _null_ | ႘ Shan Digit Eight |
|`U+1099` | Number | NUMBER | _null_ | ႙ Shan Digit Nine |
|`U+109A` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ႚ Sign Khamti Tone 1 |
|`U+109B` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ႛ Sign Khamti Tone 3 |
|`U+109C` | Mark [Mc] | VOWEL_DEPENDENT | RIGHT_POSITION | ႜ Sign Aiton A |
|`U+109D` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ႝ Sign Aiton Ai |
|`U+109E` | Symbol | SYMBOL | _null_ | ႞ Shan One |
|`U+109F` | Symbol | SYMBOL | _null_ | ႟ Shan Exclamation |
:::
## Myanmar Extended character tables ##
### Myanmar Extended A character table ###
:::{table} Myanmar Extended-A character table
| Codepoint | Unicode category | Shaping class | Mark-placement subclass | Glyph |
|:----------|:-----------------|:------------------|:---------------------------|:-----------------------------|
|`U+AA60` | Letter | CONSONANT | _null_ | ꩠ Khamti Ga |
|`U+AA61` | Letter | CONSONANT | _null_ | ꩡ Khamti Ca |
|`U+AA62` | Letter | CONSONANT | _null_ | ꩢ Khamti Cha |
|`U+AA63` | Letter | CONSONANT | _null_ | ꩣ Khamti Ja |
|`U+AA64` | Letter | CONSONANT | _null_ | ꩤ Khamti Jha |
|`U+AA65` | Letter | CONSONANT | _null_ | ꩥ Khamti Nya |
|`U+AA66` | Letter | CONSONANT | _null_ | ꩦ Khamti Tta |
|`U+AA67` | Letter | CONSONANT | _null_ | ꩧ Khamti Ttha |
|`U+AA68` | Letter | CONSONANT | _null_ | ꩨ Khamti Dda |
|`U+AA69` | Letter | CONSONANT | _null_ | ꩩ Khamti Ddha |
|`U+AA6A` | Letter | CONSONANT | _null_ | ꩪ Khamti Dha |
|`U+AA6B` | Letter | CONSONANT | _null_ | ꩫ Khamti Na |
|`U+AA6C` | Letter | CONSONANT | _null_ | ꩬ Khamti Sa |
|`U+AA6D` | Letter | CONSONANT | _null_ | ꩭ Khamti Ha |
|`U+AA6E` | Letter | CONSONANT | _null_ | ꩮ Khamti Hha |
|`U+AA6F` | Letter | CONSONANT | _null_ | ꩯ Khamti Fa |
| | | | |
|`U+AA70` | Letter | _null_ | _null_ | ꩰ Khamti Reduplication|
|`U+AA71` | Letter | CONSONANT | _null_ | ꩱ Khamti Xa |
|`U+AA72` | Letter | CONSONANT | _null_ | ꩲ Khamti Za |
|`U+AA73` | Letter | CONSONANT | _null_ | ꩳ Khamti Ra |
|`U+AA74` | Letter | CONSONANT_PLACEHOLDER| _null_ | ꩴ Khamti Oay |
|`U+AA75` | Letter | CONSONANT_PLACEHOLDER| _null_ | ꩵ Khamti Qn |
|`U+AA76` | Letter | CONSONANT_PLACEHOLDER| _null_ | ꩶ Khamti Hm |
|`U+AA77` | Symbol | SYMBOL | _null_ | ꩷ Khamti Aiton Exclamation|
|`U+AA78` | Symbol | SYMBOL | _null_ | ꩸ Khamti Aiton One |
|`U+AA79` | Symbol | SYMBOL | _null_ | ꩹ Khamti Aiton Two |
|`U+AA7A` | Letter | CONSONANT | _null_ | ꩺ Khamti Aiton Ra |
|`U+AA7B` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ꩻ Sign Pao Karen Tone |
|`U+AA7C` | Mark [Mn] | TONE_MARKER | TOP_POSITION | ꩼ Sign Tai Laing Tone 2|
|`U+AA7D` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ꩽ Sign Tai Laing Tone 5|
|`U+AA7E` | Letter | CONSONANT | _null_ | ꩾ Shwe Palaung Cha |
|`U+AA7F` | Letter | CONSONANT | _null_ | ꩿ Shwe Palaung Sha |
:::
### Myanmar Extended B character table ###
:::{table} Myanmar Extended-B character table
| Codepoint | Unicode category | Shaping class | Mark-placement subclass | Glyph |
|:----------|:-----------------|:------------------|:---------------------------|:-----------------------------|
|`U+A9E0` | Letter | CONSONANT | _null_ | ꧠ Shan Gha |
|`U+A9E1` | Letter | CONSONANT | _null_ | ꧡ Shan Cha |
|`U+A9E2` | Letter | CONSONANT | _null_ | ꧢ Shan Jha |
|`U+A9E3` | Letter | CONSONANT | _null_ | ꧣ Shan Nna |
|`U+A9E4` | Letter | CONSONANT | _null_ | ꧤ Shan Bha |
|`U+A9E5` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ꧥ Sign Shan Saw |
|`U+A9E6` | Letter | _null_ | _null_ | ꧦ Shan Reduplication |
|`U+A9E7` | Letter | CONSONANT | _null_ | ꧧ Tai Laing Nya |
|`U+A9E8` | Letter | CONSONANT | _null_ | ꧨ Tai Laing Fa |
|`U+A9E9` | Letter | CONSONANT | _null_ | ꧩ Tai Laing Ga |
|`U+A9EA` | Letter | CONSONANT | _null_ | ꧪ Tai Laing Gha |
|`U+A9EB` | Letter | CONSONANT | _null_ | ꧫ Tai Laing Ja |
|`U+A9EC` | Letter | CONSONANT | _null_ | ꧬ Tai Laing Jha |
|`U+A9ED` | Letter | CONSONANT | _null_ | ꧭ Tai Laing Dda |
|`U+A9EE` | Letter | CONSONANT | _null_ | ꧮ Tai Laing Ddha |
|`U+A9EF` | Letter | CONSONANT | _null_ | ꧯ Tai Laing Nna |
| | | | |
|`U+A9F0` | Number | NUMBER | _null_ | ꧰ Tai Laing Digit Zero|
|`U+A9F1` | Number | NUMBER | _null_ | ꧱ Tai Laing Digit One |
|`U+A9F2` | Number | NUMBER | _null_ | ꧲ Tai Laing Digit Two |
|`U+A9F3` | Number | NUMBER | _null_ | ꧳ Tai Laing Digit Three|
|`U+A9F4` | Number | NUMBER | _null_ | ꧴ Tai Laing Digit Four|
|`U+A9F5` | Number | NUMBER | _null_ | ꧵ Tai Laing Digit Five|
|`U+A9F6` | Number | NUMBER | _null_ | ꧶ Tai Laing Digit Six |
|`U+A9F7` | Number | NUMBER | _null_ | ꧷ Tai Laing Digit Seven|
|`U+A9F8` | Number | NUMBER | _null_ | ꧸ Tai Laing Digit Eight|
|`U+A9F9` | Number | NUMBER | _null_ | ꧹ Tai Laing Digit Nine|
|`U+A9FA` | Letter | CONSONANT | _null_ | ꧺ Tai Laing Lla |
|`U+A9FB` | Letter | CONSONANT | _null_ | ꧻ Tai Laing Da |
|`U+A9FC` | Letter | CONSONANT | _null_ | ꧼ Tai Laing Dha |
|`U+A9FD` | Letter | CONSONANT | _null_ | ꧽ Tai Laing Ba |
|`U+A9FE` | Letter | CONSONANT | _null_ | ꧾ Tai Laing Bha |
|`U+A9FF` | _unassigned_ | | | |
:::
### Myanmar Extended C character table ###
:::{table} Myanmar Extended-C character table
| Codepoint | Unicode category | Shaping class | Mark-placement subclass | Glyph |
|:----------|:-----------------|:------------------|:---------------------------|:-----------------------------|
|`U+116D0` | Number | NUMBER | _null_ | Pao Digit Zero |
|`U+116D1` | Number | NUMBER | _null_ | Pao Digit One |
|`U+116D2` | Number | NUMBER | _null_ | Pao Digit Two |
|`U+116D3` | Number | NUMBER | _null_ | Pao Digit Three |
|`U+116D4` | Number | NUMBER | _null_ | Pao Digit Four |
|`U+116D5` | Number | NUMBER | _null_ | Pao Digit Five |
|`U+116D6` | Number | NUMBER | _null_ | Pao Digit Six |
|`U+116D7` | Number | NUMBER | _null_ | Pao Digit Seven |
|`U+116D8` | Number | NUMBER | _null_ | Pao Digit Eight |
|`U+116D9` | Number | NUMBER | _null_ | Pao Digit Nine |
|`U+116DA` | Number | NUMBER | _null_ | Pao Digit Zero |
|`U+116DB` | Number | NUMBER | _null_ | Eastern Pwo Karen Digit One|
|`U+116DC` | Number | NUMBER | _null_ | Eastern Pwo Karen Digit Two|
|`U+116DD` | Number | NUMBER | _null_ | Eastern Pwo Karen Digit Three|
|`U+116DE` | Number | NUMBER | _null_ | Eastern Pwo Karen Digit Four|
|`U+116DF` | Number | NUMBER | _null_ | Eastern Pwo Karen Digit Five|
| | | | |
|`U+116E0` | Number | NUMBER | _null_ | Eastern Pwo Karen Digit Six|
|`U+116E1` | Number | NUMBER | _null_ | Eastern Pwo Karen Digit Seven|
|`U+116E2` | Number | NUMBER | _null_ | Eastern Pwo Karen Digit Eight|
|`U+116E3` | Number | NUMBER | _null_ | Eastern Pwo Karen Digit Nine|
|`U+116E4` | _unassigned_ | | | |
|`U+116E5` | _unassigned_ | | | |
|`U+116E6` | _unassigned_ | | | |
|`U+116E7` | _unassigned_ | | | |
|`U+116E8` | _unassigned_ | | | |
|`U+116E9` | _unassigned_ | | | |
|`U+116EA` | _unassigned_ | | | |
|`U+116EB` | _unassigned_ | | | |
|`U+116EC` | _unassigned_ | | | |
|`U+116ED` | _unassigned_ | | | |
|`U+116EE` | _unassigned_ | | | |
|`U+116EF` | _unassigned_ | | | |
| | | | |
|`U+116F0` | _unassigned_ | | | |
|`U+116F1` | _unassigned_ | | | |
|`U+116F2` | _unassigned_ | | | |
|`U+116F3` | _unassigned_ | | | |
|`U+116F4` | _unassigned_ | | | |
|`U+116F5` | _unassigned_ | | | |
|`U+116F6` | _unassigned_ | | | |
|`U+116F7` | _unassigned_ | | | |
|`U+116F8` | _unassigned_ | | | |
|`U+116F9` | _unassigned_ | | | |
|`U+116FA` | _unassigned_ | | | |
|`U+116FB` | _unassigned_ | | | |
|`U+116FC` | _unassigned_ | | | |
|`U+116FD` | _unassigned_ | | | |
|`U+116FE` | _unassigned_ | | | |
|`U+116FF` | _unassigned_ | | | |
:::
## Vedic Extensions character table ##
Sanskrit runs written in the Myanmar script may also include
characters from the Vedic Extensions block. These characters should be
classified as follows.
> Note: See the [Vedic Extensions](../opentype-shaping-vedic-extensions.md)
> document for additional information.
:::{table} Vedic Extensions character table
| Codepoint | Unicode category | Shaping class | Mark-placement subclass | Glyph |
|:----------|:-----------------|:------------------|:---------------------------|:-----------------------------|
|`U+1CD0` | Mark [Mn] | CANTILLATION | TOP_POSITION | ᳐ Tone Karshana |
|`U+1CD1` | Mark [Mn] | CANTILLATION | TOP_POSITION | ᳑ Tone Shara |
|`U+1CD2` | Mark [Mn] | CANTILLATION | TOP_POSITION | ᳒ Tone Prenkha |
|`U+1CD3` | Punctuation | _null_ | _null_ | ᳓ Sign Nihshvasa |
|`U+1CD4` | Mark [Mn] | CANTILLATION | OVERSTRUCK | ᳔ Tone Midline Svarita |
|`U+1CD5` | Mark [Mn] | CANTILLATION | BOTTOM_POSITION | ᳕ Tone Aggravated Independent Svarita |
|`U+1CD6` | Mark [Mn] | CANTILLATION | BOTTOM_POSITION | ᳖ Tone Independent Svarita |
|`U+1CD7` | Mark [Mn] | CANTILLATION | BOTTOM_POSITION | ᳗ Tone Kathaka Independent Svarita |
|`U+1CD8` | Mark [Mn] | CANTILLATION | BOTTOM_POSITION | ᳘ Tone Candra Below |
|`U+1CD9` | Mark [Mn] | CANTILLATION | BOTTOM_POSITION | ᳙ Tone Kathaka Independent Svarita Schroeder |
|`U+1CDA` | Mark [Mn] | CANTILLATION | TOP_POSITION | ᳚ Tone Double Svarita |
|`U+1CDB` | Mark [Mn] | CANTILLATION | TOP_POSITION | ᳛ Tone Triple Svarita |
|`U+1CDC` | Mark [Mn] | CANTILLATION | BOTTOM_POSITION | ᳜ Tone Kathaka Anudatta |
|`U+1CDD` | Mark [Mn] | CANTILLATION | BOTTOM_POSITION | ᳝ Tone Dot Below |
|`U+1CDE` | Mark [Mn] | CANTILLATION | BOTTOM_POSITION | ᳞ Tone Two Dots Below |
|`U+1CDF` | Mark [Mn] | CANTILLATION | BOTTOM_POSITION | ᳟ Tone Three Dots Below |
| | | | |
|`U+1CE0` | Mark [Mn] | CANTILLATION | TOP_POSITION | ᳠ Tone Rigvedic Kashmiri Independent Svarita |
|`U+1CE1` | Mark [Mc] | CANTILLATION | RIGHT_POSITION | ᳡ Tone Atharavedic Independent Svarita |
|`U+1CE2` | Mark [Mn] | AVAGRAHA | OVERSTRUCK | ᳢ Sign Visarga Svarita |
|`U+1CE3` | Mark [Mn] | _null_ | OVERSTRUCK | ᳣ Sign Visarga Udatta |
|`U+1CE4` | Mark [Mn] | _null_ | OVERSTRUCK | ᳤ Sign Reversed Visarga Udatta |
|`U+1CE5` | Mark [Mn] | _null_ | OVERSTRUCK | ᳥ Sign Visarga Anudatta |
|`U+1CE6` | Mark [Mn] | _null_ | OVERSTRUCK | ᳦ Sign Reversed Visarga Anudatta |
|`U+1CE7` | Mark [Mn] | _null_ | OVERSTRUCK | ᳧ Sign Visarga Udatta With Tail |
|`U+1CE8` | Mark [Mn] | AVAGRAHA | OVERSTRUCK | ᳨ Sign Visarga Anudatta With Tail |
|`U+1CE9` | Letter | SYMBOL | _null_ | ᳩ Sign Anusvara Antargomukha |
|`U+1CEA` | Letter | _null_ | _null_ | ᳪ Sign Anusvara Bahirgomukha |
|`U+1CEB` | Letter | _null_ | _null_ | ᳫ Sign Anusvara Vamagomukha |
|`U+1CEC` | Letter | SYMBOL | _null_ | ᳬ Sign Anusvara Vamagomukha With Tail |
|`U+1CED` | Mark [Mn] | AVAGRAHA | BOTTOM_POSITION | ᳭ Sign Tiryak |
|`U+1CEE` | Letter | SYMBOL | _null_ | ᳮ Sign Hexiform Long Anusvara |
|`U+1CEF` | Letter | _null_ | _null_ | ᳯ Sign Long Anusvara |
| | | | |
|`U+1CF0` | Letter | _null_ | _null_ | ᳰ Sign Rthang Long Anusvara |
|`U+1CF2` | Letter | CONSONANT_DEAD | _null_ | ᳲ Sign Ardhavisarga |
|`U+1CF3` | Letter | CONSONANT_DEAD | _null_ | ᳳ Sign Rotated Ardhavisarga |
|`U+1CF3` | Mark [Mc] | VISARGA | _null_ | ᳳ Sign Rotated Ardhavisarga |
|`U+1CF4` | Mark [Mn] | CANTILLATION | TOP_POSITION | ᳴ Tone Candra Above |
|`U+1CF5` | Letter | CONSONANT_WITH_STACKER | _null_ | ᳵ Sign Jihvamuliya |
|`U+1CF6` | Letter | CONSONANT_WITH_STACKER | _null_ | ᳶ Sign Upadhmaniya |
|`U+1CF7` | Mark [Mc] | _null_ | _null_ | ᳷ Sign Atikrama |
|`U+1CF8` | Mark [Mn] | CANTILLATION | _null_ | ᳸ Tone Ring Above |
|`U+1CF9` | Mark [Mn] | CANTILLATION | _null_ | ᳹ Tone Double Ring Above |
|`U+1CFA` | Letter | PLACEHOLDER | _null_ | ᳺ Sign Double Anusvara Antargomukha |
|`U+1CFB` | _unassigned_ | | | |
|`U+1CFC` | _unassigned_ | | | |
|`U+1CFD` | _unassigned_ | | | |
|`U+1CFE` | _unassigned_ | | | |
|`U+1CFF` | _unassigned_ | | | |
:::
## Miscellaneous character table ##
Other important characters that may be encountered when shaping runs
of Myanmar text include the dotted-circle placeholder (`U+25CC`), the
zero-width joiner (`U+200D`) and zero-width non-joiner (`U+200C`), and
the no-break space (`U+00A0`).
The dotted-circle placeholder is frequently used when displaying a
dependent vowel (matra) or a combining mark in isolation. Real-world
text syllables may also use other characters, such as hyphens or dashes,
in a similar placeholder fashion; shaping engines should cope with
this situation gracefully.
:::{table} Miscellaneous character table
| Codepoint | Unicode category | Shaping class | Mark-placement subclass | Glyph |
|:----------|:-----------------|:------------------|:---------------------------|:-------------------------------|
|`U+00A0` | Separator | PLACEHOLDER | _null_ | No-break space |
|`U+200C` | Other | NON_JOINER | _null_ | Zero-width non-joiner |
|`U+200D` | Other | JOINER | _null_ | Zero-width joiner |
|`U+2010` | Punctuation | PLACEHOLDER | _null_ | ‐ Hyphen |
|`U+2011` | Punctuation | PLACEHOLDER | _null_ | ‑ No-break hyphen |
|`U+2012` | Punctuation | PLACEHOLDER | _null_ | ‒ Figure dash |
|`U+2013` | Punctuation | PLACEHOLDER | _null_ | – En dash |
|`U+2014` | Punctuation | PLACEHOLDER | _null_ | — Em dash |
|`U+25CC` | Symbol | DOTTED_CIRCLE | _null_ | ◌ Dotted circle |
:::
The zero-width joiner (ZWJ) is primarily used to prevent the formation of a conjunct
from a "_Consonant_,Halant,_Consonant_" sequence. The sequence
"_Consonant_,Halant,ZWJ,_Consonant_" blocks the formation of a
conjunct between the two consonants.
Note, however, that the "_Consonant_,Halant" subsequence in the above
example may still trigger a half-forms feature. To prevent the
application of the half-forms feature in addition to preventing the
conjunct, the zero-width non-joiner (ZWNJ) must be used instead. The sequence
"_Consonant_,Halant,ZWNJ,_Consonant_" should produce the first
consonant in its standard form, followed by an explicit "Halant".
A secondary usage of the zero-width joiner is to prevent the formation of
"Reph". An initial "Ra,Halant,ZWJ" sequence should not produce a "Reph",
where an initial "Ra,Halant" sequence without the zero-width joiner
otherwise would.
The no-break space (NBSP) is primarily used to display
those codepoints that are defined as non-spacing (marks, dependent
vowels (matras), below-base consonant forms, and post-base consonant
forms) in an isolated context, as an alternative to displaying them
superimposed on the dotted-circle placeholder. These sequences will
match "NBSP,ZWJ,Halant,_Consonant_", "NBSP,_mark_", or "NBSP,_matra_".