# Myanmar character tables # This document lists the per-character shaping information needed to [shape Myanmar text](../opentype-shaping-myanmar.md). **Contents** - [Myanmar character table](#myanmar-character-table) - [Myanmar Extended character tables](#myanmar-extended-character-tables) - [Vedic Extensions character table](#vedic-extensions-character-table) - [Miscellaneous character table](#miscellaneous-character-table) ## Myanmar character table ## Myanmar glyphs should be classified as in the following table. Codepoints in the Myanmar block with no assigned meaning are designated as _unassigned_ in the _Unicode category_ column. Assigned codepoints with a _null_ in the _Shaping class_ column evoke no special behavior from the shaping engine. Note that this does include some valid codepoints, such as currency marks, punctuation, and other symbols. > Note: the `NUMBER` and `SYMBOL` _Shaping classes_ are important > during syllable identification, but generally evoke no further > special behavior during the rest of the shaping process. The _Mark-placement subclass_ column indicates mark-placement positioning for codepoints in the _Mark_ category. Assigned, non-mark codepoints have a _null_ in this column and evoke no special mark-placement behavior. Marks tagged with [Mn] in the _Unicode category_ column are categorized as non-spacing; marks tagged with [Mc] are categorized as spacing-combining. Some codepoints in the following table use a _Shaping class_ that differs from the codepoint's Unicode _General Category_. The _Shaping class_ takes precedence during OpenType shaping, as it captures more specific, script-aware behavior. :::{table} Myanmar character table | Codepoint | Unicode category | Shaping class | Mark-placement subclass | Glyph | |:----------|:-----------------|:------------------|:---------------------------|:-----------------------------| |`U+1000` | Letter | CONSONANT | _null_ | က Ka | |`U+1001` | Letter | CONSONANT | _null_ | ခ Kha | |`U+1002` | Letter | CONSONANT | _null_ | ဂ Ga | |`U+1003` | Letter | CONSONANT | _null_ | ဃ Gha | |`U+1004` | Letter | CONSONANT | _null_ | င Nga | |`U+1005` | Letter | CONSONANT | _null_ | စ Ca | |`U+1006` | Letter | CONSONANT | _null_ | ဆ Cha | |`U+1007` | Letter | CONSONANT | _null_ | ဇ Ja | |`U+1008` | Letter | CONSONANT | _null_ | ဈ Jha | |`U+1009` | Letter | CONSONANT | _null_ | ဉ Nya | |`U+100A` | Letter | CONSONANT | _null_ | ည Nnya | |`U+100B` | Letter | CONSONANT | _null_ | ဋ Tta | |`U+100C` | Letter | CONSONANT | _null_ | ဌ Ttha | |`U+100D` | Letter | CONSONANT | _null_ | ဍ Dda | |`U+100E` | Letter | CONSONANT | _null_ | ဎ DDha | |`U+100F` | Letter | CONSONANT | _null_ | ဏ Nna | | | | | | |`U+1010` | Letter | CONSONANT | _null_ | တ Ta | |`U+1011` | Letter | CONSONANT | _null_ | ထ Tha | |`U+1012` | Letter | CONSONANT | _null_ | ဒ Da | |`U+1013` | Letter | CONSONANT | _null_ | ဓ Dha | |`U+1014` | Letter | CONSONANT | _null_ | န Na | |`U+1015` | Letter | CONSONANT | _null_ | ပ Pa | |`U+1016` | Letter | CONSONANT | _null_ | ဖ Pha | |`U+1017` | Letter | CONSONANT | _null_ | ဗ Ba | |`U+1018` | Letter | CONSONANT | _null_ | ဘ Bha | |`U+1019` | Letter | CONSONANT | _null_ | မ Ma | |`U+101A` | Letter | CONSONANT | _null_ | ယ Ya | |`U+101B` | Letter | CONSONANT | _null_ | ရ Ra | |`U+101C` | Letter | CONSONANT | _null_ | လ La | |`U+101D` | Letter | CONSONANT | _null_ | ဝ Wa | |`U+101E` | Letter | CONSONANT | _null_ | သ Sa | |`U+101F` | Letter | CONSONANT | _null_ | ဟ Ha | | | | | | |`U+1020` | Letter | CONSONANT | _null_ | ဠ Lla | |`U+1021` | Letter | VOWEL_INDEPENDENT | _null_ | အ A | |`U+1022` | Letter | VOWEL_INDEPENDENT | _null_ | ဢ Shan A | |`U+1023` | Letter | VOWEL_INDEPENDENT | _null_ | ဣ I | |`U+1024` | Letter | VOWEL_INDEPENDENT | _null_ | ဤ Ii | |`U+1025` | Letter | VOWEL_INDEPENDENT | _null_ | ဥ U | |`U+1026` | Letter | VOWEL_INDEPENDENT | _null_ | ဦ Uu | |`U+1027` | Letter | VOWEL_INDEPENDENT | _null_ | ဧ E | |`U+1028` | Letter | VOWEL_INDEPENDENT | _null_ | ဨ Mon E | |`U+1029` | Letter | VOWEL_INDEPENDENT | _null_ | ဩ O | |`U+102A` | Letter | VOWEL_INDEPENDENT | _null_ | ဪ Au | |`U+102B` | Mark [Mc] | VOWEL_DEPENDENT | RIGHT_POSITION | ါ Sign Tall Aa | |`U+102C` | Mark [Mc] | VOWEL_DEPENDENT | RIGHT_POSITION | ာ Sign Aa | |`U+102D` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ိ Sign I | |`U+102E` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ီ Sign Ii | |`U+102F` | Mark [Mn] | VOWEL_DEPENDENT | BOTTOM_POSITION | ု Sign U | | | | | | |`U+1030` | Mark [Mn] | VOWEL_DEPENDENT | BOTTOM_POSITION | ူ Sign Uu | |`U+1031` | Mark [Mc] | VOWEL_DEPENDENT | LEFT_POSITION | ေ Sign E | |`U+1032` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ဲ Sign Ai | |`U+1033` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ဳ Sign Mon Ii | |`U+1034` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ဴ Sign Mon O | |`U+1035` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ဵ Sign E Above | |`U+1036` | Mark [Mn] | BINDU | TOP_POSITION | ံ Anusvara | |`U+1037` | Mark [Mn] | TONE_MARKER | BOTTOM_POSITION | ့ Dot Below | |`U+1038` | Mark [Mc] | VISARGA | RIGHT_POSITION | း Visarga | |`U+1039` | Mark [Mn] | INVISIBLE_STACKER | _null_ | ္ Virama | |`U+103A` | Mark [Mn] | PURE_KILLER | TOP_POSITION | ် Asat | |`U+103B` | Mark [Mc] | CONSONANT_MEDIAL | RIGHT_POSITION | ျ Sign Medial Ya | |`U+103C` | Mark [Mc] | CONSONANT_MEDIAL | TOP_LEFT_AND_BOTTOM_POSITION | ြ Sign Medial Ra | |`U+103D` | Mark [Mn] | CONSONANT_MEDIAL | BOTTOM_POSITION | ွ Sign Medial Wa | |`U+103E` | Mark [Mn] | CONSONANT_MEDIAL | BOTTOM_POSITION | ှ Sign Medial Ha | |`U+103F` | Letter | CONSONANT | _null_ | ဿ Great Sa | | | | | | |`U+1040` | Number | NUMBER | _null_ | ၀ Digit Zero | |`U+1041` | Number | NUMBER | _null_ | ၁ Digit One | |`U+1042` | Number | NUMBER | _null_ | ၂ Digit Two | |`U+1043` | Number | NUMBER | _null_ | ၃ Digit Three | |`U+1044` | Number | NUMBER | _null_ | ၄ Digit Four | |`U+1045` | Number | NUMBER | _null_ | ၅ Digit Five | |`U+1046` | Number | NUMBER | _null_ | ၆ Digit Six | |`U+1047` | Number | NUMBER | _null_ | ၇ Digit Seven | |`U+1048` | Number | NUMBER | _null_ | ၈ Digit Eight | |`U+1049` | Number | NUMBER | _null_ | ၉ Digit Nine | |`U+104A` | Punctuation | _null_ | _null_ | ၊ Little Section | |`U+104B` | Punctuation | _null_ | _null_ | ။ Section | |`U+104C` | Punctuation | _null_ | _null_ | ၌ Locative | |`U+104D` | Punctuation | _null_ | _null_ | ၍ Completed | |`U+104E` | Punctuation | CONSONANT_PLACEHOLDER| _null_ | ၎ Aforementioned | |`U+104F` | Punctuation | _null_ | _null_ | ၏ Genitive | | | | | | |`U+1050` | Letter | CONSONANT | _null_ | ၐ Sha | |`U+1051` | Letter | CONSONANT | _null_ | ၑ Ssa | |`U+1052` | Letter | VOWEL_INDEPENDENT | _null_ | ၒ Vocalic R | |`U+1053` | Letter | VOWEL_INDEPENDENT | _null_ | ၓ Vocalic Rr | |`U+1054` | Letter | VOWEL_INDEPENDENT | _null_ | ၔ Vocalic L | |`U+1055` | Letter | VOWEL_INDEPENDENT | _null_ | ၕ Vocalic Ll | |`U+1056` | Mark [Mc] | VOWEL_DEPENDENT | RIGHT_POSITION | ၖ Sign Vocalic R | |`U+1057` | Mark [Mc] | VOWEL_DEPENDENT | RIGHT_POSITION | ၗ Sign Vocalic Rr | |`U+1058` | Mark [Mn] | VOWEL_DEPENDENT | BOTTOM_POSITION | ၘ Sign Vocalic L | |`U+1059` | Mark [Mn] | VOWEL_DEPENDENT | BOTTOM_POSITION | ၙ Sign Vocalic Ll | |`U+105A` | Letter | CONSONANT | _null_ | ၚ Mon Nga | |`U+105B` | Letter | CONSONANT | _null_ | ၛ Mon Jha | |`U+105C` | Letter | CONSONANT | _null_ | ၜ Mon Bba | |`U+105D` | Letter | CONSONANT | _null_ | ၝ Mon Bbe | |`U+105E` | Mark [Mn] | CONSONANT_MEDIAL | BOTTOM_POSITION | ၞ Sign Mon Medial Na | |`U+105F` | Mark [Mn] | CONSONANT_MEDIAL | BOTTOM_POSITION | ၟ Sign Mon Medial Ma | | | | | | |`U+1060` | Mark [Mn] | CONSONANT_MEDIAL | BOTTOM_POSITION | ၠ Sign Mon Medial La | |`U+1061` | Letter | CONSONANT | _null_ | ၡ Sgaw Karen Sha | |`U+1062` | Mark [Mc] | VOWEL_DEPENDENT | RIGHT_POSITION | ၢ Sign Sgaw Karen Eu | |`U+1063` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ၣ Tone Sgaw Karen Hathi| |`U+1064` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ၤ Tone Sgaw Karen Ke Pho| |`U+1065` | Letter | CONSONANT | _null_ | ၥ Western Pwo Karen Tha| |`U+1066` | Letter | CONSONANT | _null_ | ၦ Western Pwo Karen Pwa| |`U+1067` | Mark [Mc] | VOWEL_DEPENDENT | RIGHT_POSITION | ၧ Sign Western Pwo Karen Eu| |`U+1068` | Mark [Mc] | VOWEL_DEPENDENT | RIGHT_POSITION | ၨ Sign Western Pwo Karen Ue| |`U+1069` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ၩ Sign Western Pwo Karen Tone 1| |`U+106A` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ၪ Sign Western Pwo Karen Tone 2| |`U+106B` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ၫ Sign Western Pwo Karen Tone 3| |`U+106C` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ၬ Sign Western Pwo Karen Tone 4| |`U+106D` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ၭ Sign Western Pwo Karen Tone 5| |`U+106E` | Letter | CONSONANT | _null_ | ၮ Eastern Pwo Karen Nna| |`U+106F` | Letter | CONSONANT | _null_ | ၯ Eastern Pwo Karen Ywa| | | | | | |`U+1070` | Letter | CONSONANT | _null_ | ၰ Eastern Pwo Karen Ghwa| |`U+1071` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ၱ Sign Geba Karen I | |`U+1072` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ၲ Sign Kayah Oe | |`U+1073` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ၳ Sign Kayah U | |`U+1074` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ၴ Sign Kayah Ee | |`U+1075` | Letter | CONSONANT | _null_ | ၵ Shan Ka | |`U+1076` | Letter | CONSONANT | _null_ | ၶ Shan Kha | |`U+1077` | Letter | CONSONANT | _null_ | ၷ Shan Ga | |`U+1078` | Letter | CONSONANT | _null_ | ၸ Shan Ca | |`U+1079` | Letter | CONSONANT | _null_ | ၹ Shan Za | |`U+107A` | Letter | CONSONANT | _null_ | ၺ Shan Nya | |`U+107B` | Letter | CONSONANT | _null_ | ၻ Shan Da | |`U+107C` | Letter | CONSONANT | _null_ | ၼ Shan Na | |`U+107D` | Letter | CONSONANT | _null_ | ၽ Shan Pha | |`U+107E` | Letter | CONSONANT | _null_ | ၾ Shan Fa | |`U+107F` | Letter | CONSONANT | _null_ | ၿ Shan Ba | | | | | | |`U+1080` | Letter | CONSONANT | _null_ | ႀ Shan Tha | |`U+1081` | Letter | CONSONANT | _null_ | ႁ Shan Ha | |`U+1082` | Mark [Mn] | CONSONANT_MEDIAL | BOTTOM_POSITION | ႂ Sign Shan Medial Wa | |`U+1083` | Mark [Mc] | VOWEL_DEPENDENT | RIGHT_POSITION | ႃ Sign Shan Aa | |`U+1084` | Mark [Mc] | VOWEL_DEPENDENT | LEFT_POSITION | ႄ Sign Shan E | |`U+1085` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ႅ Sign Shan E Above | |`U+1086` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ႆ Sign Shan Final Y | |`U+1087` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ႇ Sign Shan Tone 2 | |`U+1088` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ႈ Sign Shan Tone 3 | |`U+1089` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ႉ Sign Shan Tone 5 | |`U+108A` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ႊ Sign Shan Tone 6 | |`U+108B` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ႋ Sign Shan Council Tone 2| |`U+108C` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ႌ Sign Shan Council Tone 3| |`U+108D` | Mark [Mn] | TONE_MARKER | BOTTOM_POSITION | ႍ Sign Shan Council Emphatic Tone| |`U+108E` | Letter | CONSONANT | _null_ | ႎ Rumai Palaung Fa | |`U+108F` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ႏ Sign Rumai Palaung Tone 5| | | | | | |`U+1090` | Number | NUMBER | _null_ | ႐ Shan Digit Zero | |`U+1091` | Number | NUMBER | _null_ | ႑ Shan Digit One | |`U+1092` | Number | NUMBER | _null_ | ႒ Shan Digit Two | |`U+1093` | Number | NUMBER | _null_ | ႓ Shan Digit Three | |`U+1094` | Number | NUMBER | _null_ | ႔ Shan Digit Four | |`U+1095` | Number | NUMBER | _null_ | ႕ Shan Digit Five | |`U+1096` | Number | NUMBER | _null_ | ႖ Shan Digit Six | |`U+1097` | Number | NUMBER | _null_ | ႗ Shan Digit Seven | |`U+1098` | Number | NUMBER | _null_ | ႘ Shan Digit Eight | |`U+1099` | Number | NUMBER | _null_ | ႙ Shan Digit Nine | |`U+109A` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ႚ Sign Khamti Tone 1 | |`U+109B` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ႛ Sign Khamti Tone 3 | |`U+109C` | Mark [Mc] | VOWEL_DEPENDENT | RIGHT_POSITION | ႜ Sign Aiton A | |`U+109D` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ႝ Sign Aiton Ai | |`U+109E` | Symbol | SYMBOL | _null_ | ႞ Shan One | |`U+109F` | Symbol | SYMBOL | _null_ | ႟ Shan Exclamation | ::: ## Myanmar Extended character tables ## ### Myanmar Extended A character table ### :::{table} Myanmar Extended-A character table | Codepoint | Unicode category | Shaping class | Mark-placement subclass | Glyph | |:----------|:-----------------|:------------------|:---------------------------|:-----------------------------| |`U+AA60` | Letter | CONSONANT | _null_ | ꩠ Khamti Ga | |`U+AA61` | Letter | CONSONANT | _null_ | ꩡ Khamti Ca | |`U+AA62` | Letter | CONSONANT | _null_ | ꩢ Khamti Cha | |`U+AA63` | Letter | CONSONANT | _null_ | ꩣ Khamti Ja | |`U+AA64` | Letter | CONSONANT | _null_ | ꩤ Khamti Jha | |`U+AA65` | Letter | CONSONANT | _null_ | ꩥ Khamti Nya | |`U+AA66` | Letter | CONSONANT | _null_ | ꩦ Khamti Tta | |`U+AA67` | Letter | CONSONANT | _null_ | ꩧ Khamti Ttha | |`U+AA68` | Letter | CONSONANT | _null_ | ꩨ Khamti Dda | |`U+AA69` | Letter | CONSONANT | _null_ | ꩩ Khamti Ddha | |`U+AA6A` | Letter | CONSONANT | _null_ | ꩪ Khamti Dha | |`U+AA6B` | Letter | CONSONANT | _null_ | ꩫ Khamti Na | |`U+AA6C` | Letter | CONSONANT | _null_ | ꩬ Khamti Sa | |`U+AA6D` | Letter | CONSONANT | _null_ | ꩭ Khamti Ha | |`U+AA6E` | Letter | CONSONANT | _null_ | ꩮ Khamti Hha | |`U+AA6F` | Letter | CONSONANT | _null_ | ꩯ Khamti Fa | | | | | | |`U+AA70` | Letter | _null_ | _null_ | ꩰ Khamti Reduplication| |`U+AA71` | Letter | CONSONANT | _null_ | ꩱ Khamti Xa | |`U+AA72` | Letter | CONSONANT | _null_ | ꩲ Khamti Za | |`U+AA73` | Letter | CONSONANT | _null_ | ꩳ Khamti Ra | |`U+AA74` | Letter | CONSONANT_PLACEHOLDER| _null_ | ꩴ Khamti Oay | |`U+AA75` | Letter | CONSONANT_PLACEHOLDER| _null_ | ꩵ Khamti Qn | |`U+AA76` | Letter | CONSONANT_PLACEHOLDER| _null_ | ꩶ Khamti Hm | |`U+AA77` | Symbol | SYMBOL | _null_ | ꩷ Khamti Aiton Exclamation| |`U+AA78` | Symbol | SYMBOL | _null_ | ꩸ Khamti Aiton One | |`U+AA79` | Symbol | SYMBOL | _null_ | ꩹ Khamti Aiton Two | |`U+AA7A` | Letter | CONSONANT | _null_ | ꩺ Khamti Aiton Ra | |`U+AA7B` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ꩻ Sign Pao Karen Tone | |`U+AA7C` | Mark [Mn] | TONE_MARKER | TOP_POSITION | ꩼ Sign Tai Laing Tone 2| |`U+AA7D` | Mark [Mc] | TONE_MARKER | RIGHT_POSITION | ꩽ Sign Tai Laing Tone 5| |`U+AA7E` | Letter | CONSONANT | _null_ | ꩾ Shwe Palaung Cha | |`U+AA7F` | Letter | CONSONANT | _null_ | ꩿ Shwe Palaung Sha | ::: ### Myanmar Extended B character table ### :::{table} Myanmar Extended-B character table | Codepoint | Unicode category | Shaping class | Mark-placement subclass | Glyph | |:----------|:-----------------|:------------------|:---------------------------|:-----------------------------| |`U+A9E0` | Letter | CONSONANT | _null_ | ꧠ Shan Gha | |`U+A9E1` | Letter | CONSONANT | _null_ | ꧡ Shan Cha | |`U+A9E2` | Letter | CONSONANT | _null_ | ꧢ Shan Jha | |`U+A9E3` | Letter | CONSONANT | _null_ | ꧣ Shan Nna | |`U+A9E4` | Letter | CONSONANT | _null_ | ꧤ Shan Bha | |`U+A9E5` | Mark [Mn] | VOWEL_DEPENDENT | TOP_POSITION | ꧥ Sign Shan Saw | |`U+A9E6` | Letter | _null_ | _null_ | ꧦ Shan Reduplication | |`U+A9E7` | Letter | CONSONANT | _null_ | ꧧ Tai Laing Nya | |`U+A9E8` | Letter | CONSONANT | _null_ | ꧨ Tai Laing Fa | |`U+A9E9` | Letter | CONSONANT | _null_ | ꧩ Tai Laing Ga | |`U+A9EA` | Letter | CONSONANT | _null_ | ꧪ Tai Laing Gha | |`U+A9EB` | Letter | CONSONANT | _null_ | ꧫ Tai Laing Ja | |`U+A9EC` | Letter | CONSONANT | _null_ | ꧬ Tai Laing Jha | |`U+A9ED` | Letter | CONSONANT | _null_ | ꧭ Tai Laing Dda | |`U+A9EE` | Letter | CONSONANT | _null_ | ꧮ Tai Laing Ddha | |`U+A9EF` | Letter | CONSONANT | _null_ | ꧯ Tai Laing Nna | | | | | | |`U+A9F0` | Number | NUMBER | _null_ | ꧰ Tai Laing Digit Zero| |`U+A9F1` | Number | NUMBER | _null_ | ꧱ Tai Laing Digit One | |`U+A9F2` | Number | NUMBER | _null_ | ꧲ Tai Laing Digit Two | |`U+A9F3` | Number | NUMBER | _null_ | ꧳ Tai Laing Digit Three| |`U+A9F4` | Number | NUMBER | _null_ | ꧴ Tai Laing Digit Four| |`U+A9F5` | Number | NUMBER | _null_ | ꧵ Tai Laing Digit Five| |`U+A9F6` | Number | NUMBER | _null_ | ꧶ Tai Laing Digit Six | |`U+A9F7` | Number | NUMBER | _null_ | ꧷ Tai Laing Digit Seven| |`U+A9F8` | Number | NUMBER | _null_ | ꧸ Tai Laing Digit Eight| |`U+A9F9` | Number | NUMBER | _null_ | ꧹ Tai Laing Digit Nine| |`U+A9FA` | Letter | CONSONANT | _null_ | ꧺ Tai Laing Lla | |`U+A9FB` | Letter | CONSONANT | _null_ | ꧻ Tai Laing Da | |`U+A9FC` | Letter | CONSONANT | _null_ | ꧼ Tai Laing Dha | |`U+A9FD` | Letter | CONSONANT | _null_ | ꧽ Tai Laing Ba | |`U+A9FE` | Letter | CONSONANT | _null_ | ꧾ Tai Laing Bha | |`U+A9FF` | _unassigned_ | | | | ::: ### Myanmar Extended C character table ### :::{table} Myanmar Extended-C character table | Codepoint | Unicode category | Shaping class | Mark-placement subclass | Glyph | |:----------|:-----------------|:------------------|:---------------------------|:-----------------------------| |`U+116D0` | Number | NUMBER | _null_ | 𑛐 Pao Digit Zero | |`U+116D1` | Number | NUMBER | _null_ | 𑛑 Pao Digit One | |`U+116D2` | Number | NUMBER | _null_ | 𑛒 Pao Digit Two | |`U+116D3` | Number | NUMBER | _null_ | 𑛓 Pao Digit Three | |`U+116D4` | Number | NUMBER | _null_ | 𑛔 Pao Digit Four | |`U+116D5` | Number | NUMBER | _null_ | 𑛕 Pao Digit Five | |`U+116D6` | Number | NUMBER | _null_ | 𑛖 Pao Digit Six | |`U+116D7` | Number | NUMBER | _null_ | 𑛗 Pao Digit Seven | |`U+116D8` | Number | NUMBER | _null_ | 𑛘 Pao Digit Eight | |`U+116D9` | Number | NUMBER | _null_ | 𑛙 Pao Digit Nine | |`U+116DA` | Number | NUMBER | _null_ | 𑛚 Pao Digit Zero | |`U+116DB` | Number | NUMBER | _null_ | 𑛛 Eastern Pwo Karen Digit One| |`U+116DC` | Number | NUMBER | _null_ | 𑛜 Eastern Pwo Karen Digit Two| |`U+116DD` | Number | NUMBER | _null_ | 𑛝 Eastern Pwo Karen Digit Three| |`U+116DE` | Number | NUMBER | _null_ | 𑛞 Eastern Pwo Karen Digit Four| |`U+116DF` | Number | NUMBER | _null_ | 𑛟 Eastern Pwo Karen Digit Five| | | | | | |`U+116E0` | Number | NUMBER | _null_ | 𑛐 Eastern Pwo Karen Digit Six| |`U+116E1` | Number | NUMBER | _null_ | 𑛑 Eastern Pwo Karen Digit Seven| |`U+116E2` | Number | NUMBER | _null_ | 𑛒 Eastern Pwo Karen Digit Eight| |`U+116E3` | Number | NUMBER | _null_ | 𑛓 Eastern Pwo Karen Digit Nine| |`U+116E4` | _unassigned_ | | | | |`U+116E5` | _unassigned_ | | | | |`U+116E6` | _unassigned_ | | | | |`U+116E7` | _unassigned_ | | | | |`U+116E8` | _unassigned_ | | | | |`U+116E9` | _unassigned_ | | | | |`U+116EA` | _unassigned_ | | | | |`U+116EB` | _unassigned_ | | | | |`U+116EC` | _unassigned_ | | | | |`U+116ED` | _unassigned_ | | | | |`U+116EE` | _unassigned_ | | | | |`U+116EF` | _unassigned_ | | | | | | | | | |`U+116F0` | _unassigned_ | | | | |`U+116F1` | _unassigned_ | | | | |`U+116F2` | _unassigned_ | | | | |`U+116F3` | _unassigned_ | | | | |`U+116F4` | _unassigned_ | | | | |`U+116F5` | _unassigned_ | | | | |`U+116F6` | _unassigned_ | | | | |`U+116F7` | _unassigned_ | | | | |`U+116F8` | _unassigned_ | | | | |`U+116F9` | _unassigned_ | | | | |`U+116FA` | _unassigned_ | | | | |`U+116FB` | _unassigned_ | | | | |`U+116FC` | _unassigned_ | | | | |`U+116FD` | _unassigned_ | | | | |`U+116FE` | _unassigned_ | | | | |`U+116FF` | _unassigned_ | | | | ::: ## Vedic Extensions character table ## Sanskrit runs written in the Myanmar script may also include characters from the Vedic Extensions block. These characters should be classified as follows. > Note: See the [Vedic Extensions](../opentype-shaping-vedic-extensions.md) > document for additional information. :::{table} Vedic Extensions character table | Codepoint | Unicode category | Shaping class | Mark-placement subclass | Glyph | |:----------|:-----------------|:------------------|:---------------------------|:-----------------------------| |`U+1CD0` | Mark [Mn] | CANTILLATION | TOP_POSITION | ᳐ Tone Karshana | |`U+1CD1` | Mark [Mn] | CANTILLATION | TOP_POSITION | ᳑ Tone Shara | |`U+1CD2` | Mark [Mn] | CANTILLATION | TOP_POSITION | ᳒ Tone Prenkha | |`U+1CD3` | Punctuation | _null_ | _null_ | ᳓ Sign Nihshvasa | |`U+1CD4` | Mark [Mn] | CANTILLATION | OVERSTRUCK | ᳔ Tone Midline Svarita | |`U+1CD5` | Mark [Mn] | CANTILLATION | BOTTOM_POSITION | ᳕ Tone Aggravated Independent Svarita | |`U+1CD6` | Mark [Mn] | CANTILLATION | BOTTOM_POSITION | ᳖ Tone Independent Svarita | |`U+1CD7` | Mark [Mn] | CANTILLATION | BOTTOM_POSITION | ᳗ Tone Kathaka Independent Svarita | |`U+1CD8` | Mark [Mn] | CANTILLATION | BOTTOM_POSITION | ᳘ Tone Candra Below | |`U+1CD9` | Mark [Mn] | CANTILLATION | BOTTOM_POSITION | ᳙ Tone Kathaka Independent Svarita Schroeder | |`U+1CDA` | Mark [Mn] | CANTILLATION | TOP_POSITION | ᳚ Tone Double Svarita | |`U+1CDB` | Mark [Mn] | CANTILLATION | TOP_POSITION | ᳛ Tone Triple Svarita | |`U+1CDC` | Mark [Mn] | CANTILLATION | BOTTOM_POSITION | ᳜ Tone Kathaka Anudatta | |`U+1CDD` | Mark [Mn] | CANTILLATION | BOTTOM_POSITION | ᳝ Tone Dot Below | |`U+1CDE` | Mark [Mn] | CANTILLATION | BOTTOM_POSITION | ᳞ Tone Two Dots Below | |`U+1CDF` | Mark [Mn] | CANTILLATION | BOTTOM_POSITION | ᳟ Tone Three Dots Below | | | | | | |`U+1CE0` | Mark [Mn] | CANTILLATION | TOP_POSITION | ᳠ Tone Rigvedic Kashmiri Independent Svarita | |`U+1CE1` | Mark [Mc] | CANTILLATION | RIGHT_POSITION | ᳡ Tone Atharavedic Independent Svarita | |`U+1CE2` | Mark [Mn] | AVAGRAHA | OVERSTRUCK | ᳢ Sign Visarga Svarita | |`U+1CE3` | Mark [Mn] | _null_ | OVERSTRUCK | ᳣ Sign Visarga Udatta | |`U+1CE4` | Mark [Mn] | _null_ | OVERSTRUCK | ᳤ Sign Reversed Visarga Udatta | |`U+1CE5` | Mark [Mn] | _null_ | OVERSTRUCK | ᳥ Sign Visarga Anudatta | |`U+1CE6` | Mark [Mn] | _null_ | OVERSTRUCK | ᳦ Sign Reversed Visarga Anudatta | |`U+1CE7` | Mark [Mn] | _null_ | OVERSTRUCK | ᳧ Sign Visarga Udatta With Tail | |`U+1CE8` | Mark [Mn] | AVAGRAHA | OVERSTRUCK | ᳨ Sign Visarga Anudatta With Tail | |`U+1CE9` | Letter | SYMBOL | _null_ | ᳩ Sign Anusvara Antargomukha | |`U+1CEA` | Letter | _null_ | _null_ | ᳪ Sign Anusvara Bahirgomukha | |`U+1CEB` | Letter | _null_ | _null_ | ᳫ Sign Anusvara Vamagomukha | |`U+1CEC` | Letter | SYMBOL | _null_ | ᳬ Sign Anusvara Vamagomukha With Tail | |`U+1CED` | Mark [Mn] | AVAGRAHA | BOTTOM_POSITION | ᳭ Sign Tiryak | |`U+1CEE` | Letter | SYMBOL | _null_ | ᳮ Sign Hexiform Long Anusvara | |`U+1CEF` | Letter | _null_ | _null_ | ᳯ Sign Long Anusvara | | | | | | |`U+1CF0` | Letter | _null_ | _null_ | ᳰ Sign Rthang Long Anusvara | |`U+1CF2` | Letter | CONSONANT_DEAD | _null_ | ᳲ Sign Ardhavisarga | |`U+1CF3` | Letter | CONSONANT_DEAD | _null_ | ᳳ Sign Rotated Ardhavisarga | |`U+1CF3` | Mark [Mc] | VISARGA | _null_ | ᳳ Sign Rotated Ardhavisarga | |`U+1CF4` | Mark [Mn] | CANTILLATION | TOP_POSITION | ᳴ Tone Candra Above | |`U+1CF5` | Letter | CONSONANT_WITH_STACKER | _null_ | ᳵ Sign Jihvamuliya | |`U+1CF6` | Letter | CONSONANT_WITH_STACKER | _null_ | ᳶ Sign Upadhmaniya | |`U+1CF7` | Mark [Mc] | _null_ | _null_ | ᳷ Sign Atikrama | |`U+1CF8` | Mark [Mn] | CANTILLATION | _null_ | ᳸ Tone Ring Above | |`U+1CF9` | Mark [Mn] | CANTILLATION | _null_ | ᳹ Tone Double Ring Above | |`U+1CFA` | Letter | PLACEHOLDER | _null_ | ᳺ Sign Double Anusvara Antargomukha | |`U+1CFB` | _unassigned_ | | | | |`U+1CFC` | _unassigned_ | | | | |`U+1CFD` | _unassigned_ | | | | |`U+1CFE` | _unassigned_ | | | | |`U+1CFF` | _unassigned_ | | | | ::: ## Miscellaneous character table ## Other important characters that may be encountered when shaping runs of Myanmar text include the dotted-circle placeholder (`U+25CC`), the zero-width joiner (`U+200D`) and zero-width non-joiner (`U+200C`), and the no-break space (`U+00A0`). The dotted-circle placeholder is frequently used when displaying a dependent vowel (matra) or a combining mark in isolation. Real-world text syllables may also use other characters, such as hyphens or dashes, in a similar placeholder fashion; shaping engines should cope with this situation gracefully. :::{table} Miscellaneous character table | Codepoint | Unicode category | Shaping class | Mark-placement subclass | Glyph | |:----------|:-----------------|:------------------|:---------------------------|:-------------------------------| |`U+00A0` | Separator | PLACEHOLDER | _null_ |   No-break space | |`U+200C` | Other | NON_JOINER | _null_ | ‌ Zero-width non-joiner | |`U+200D` | Other | JOINER | _null_ | ‍ Zero-width joiner | |`U+2010` | Punctuation | PLACEHOLDER | _null_ | ‐ Hyphen | |`U+2011` | Punctuation | PLACEHOLDER | _null_ | ‑ No-break hyphen | |`U+2012` | Punctuation | PLACEHOLDER | _null_ | ‒ Figure dash | |`U+2013` | Punctuation | PLACEHOLDER | _null_ | – En dash | |`U+2014` | Punctuation | PLACEHOLDER | _null_ | — Em dash | |`U+25CC` | Symbol | DOTTED_CIRCLE | _null_ | ◌ Dotted circle | ::: The zero-width joiner (ZWJ) is primarily used to prevent the formation of a conjunct from a "_Consonant_,Halant,_Consonant_" sequence. The sequence "_Consonant_,Halant,ZWJ,_Consonant_" blocks the formation of a conjunct between the two consonants. Note, however, that the "_Consonant_,Halant" subsequence in the above example may still trigger a half-forms feature. To prevent the application of the half-forms feature in addition to preventing the conjunct, the zero-width non-joiner (ZWNJ) must be used instead. The sequence "_Consonant_,Halant,ZWNJ,_Consonant_" should produce the first consonant in its standard form, followed by an explicit "Halant". A secondary usage of the zero-width joiner is to prevent the formation of "Reph". An initial "Ra,Halant,ZWJ" sequence should not produce a "Reph", where an initial "Ra,Halant" sequence without the zero-width joiner otherwise would. The no-break space (NBSP) is primarily used to display those codepoints that are defined as non-spacing (marks, dependent vowels (matras), below-base consonant forms, and post-base consonant forms) in an isolated context, as an alternative to displaying them superimposed on the dotted-circle placeholder. These sequences will match "NBSP,ZWJ,Halant,_Consonant_", "NBSP,_mark_", or "NBSP,_matra_".