Arabic character tables

This document lists the per-character shaping information needed to shape Arabic text.

Contents

Arabic character table

Arabic glyphs should be classified as in the following table. Codepoints in the Arabic block with no assigned meaning are designated as unassigned in the Unicode category column.

The Joining type column indicates whether each codepoint is defined as joining with adjacent characters on the left side, right side, left and right sides (“DUAL”), or neither side (“NON_JOINING”). Codepoints designated TRANSPARENT in the Joining type column do not join with adjacent characters and, in addition, do not affect the joining behavior of surrounding characters. Non-spacing marks are of type TRANSPARENT. Codepoints designated JOIN_CAUSING force adjacent characters to join.

The Joining group column lists the fundamental letter that the listed codepoint behaves like for joining purposes.

Assigned codepoints with a null in the Joining group column evoke no special behavior from the shaping engine during the join-computation stage.

The Mark class column indicates the Canonical Combining Class for the codepoint. Marks are assigned non-zero combining classes so that sequences of adjacent marks can be reordered as required by the orthography.

For Arabic, a subset of marks in the 220 and 230 classes are also designated Modifier Combining Marks (MCM). These are denoted with 220_MCM and 230_MCM in the Mark class column. The MCM marks are treated differently during the mark-reordering stage.

Table 2 Arabic block table

Codepoint

Unicode category

Joining type

Joining group

Mark class

Glyph

U+0600

Other

NON_JOINING

null

0

؀ Number Sign

U+0601

Other

NON_JOINING

null

0

؁ Sign Sanah

U+0602

Other

NON_JOINING

null

0

؂ Footnote Marker

U+0603

Other

NON_JOINING

null

0

؃ Sign Safha

U+0604

Other

NON_JOINING

null

0

؄ Sign Samvat

U+0605

Other

NON_JOINING

null

0

؅ Number Mark Above

U+0606

Symbol

NON_JOINING

null

0

؆ Cube Root

U+0607

Symbol

NON_JOINING

null

0

؇ Fourth Root

U+0608

Symbol

NON_JOINING

null

0

؈ Ray

U+0609

Punctuation

NON_JOINING

null

0

؉ Per Mille

U+060A

Punctuation

NON_JOINING

null

0

؊ Per Ten Thousand

U+060B

Symbol

NON_JOINING

null

0

؋ Afghani Sign

U+060C

Punctuation

NON_JOINING

null

0

، Comma

U+060D

Punctuation

NON_JOINING

null

0

؍ Date Separator

U+060E

Symbol

NON_JOINING

null

0

؎ Poetic Verse Sign

U+060F

Symbol

NON_JOINING

null

0

؏ Sign Misra

U+0610

Mark [Mn]

TRANSPARENT

null

230

ؐ Sign Sallallahou Alayhe Wassallam

U+0611

Mark [Mn]

TRANSPARENT

null

230

ؑ Sign Alayhe Assallam

U+0612

Mark [Mn]

TRANSPARENT

null

230

ؒ Sign Rahmatullah Alayhe

U+0613

Mark [Mn]

TRANSPARENT

null

230

ؓ Sign Radi Allahou Anhu

U+0614

Mark [Mn]

TRANSPARENT

null

230

ؔ Sign Takhallus

U+0615

Mark [Mn]

TRANSPARENT

null

230

ؕ Small High Tah

U+0616

Mark [Mn]

TRANSPARENT

null

230

ؖ Small High Alef Lam Yeh

U+0617

Mark [Mn]

TRANSPARENT

null

230

ؗ Small High Zain

U+0618

Mark [Mn]

TRANSPARENT

null

30

ؘ Small Fatha

U+0619

Mark [Mn]

TRANSPARENT

null

31

ؙ Small Damma

U+061A

Mark [Mn]

TRANSPARENT

null

32

ؚ Small Kasra

U+061B

Punctuation

NON_JOINING

null

0

؛ Semicolon

U+061C

Other

TRANSPARENT

null

0

؜ Arabic Letter Mark

U+061D

Punctuation

NON_JOINING

null

0

؝ End Of Text Mark

U+061E

Punctuation

NON_JOINING

null

0

؞ Triple Dot Punctuation Mark

U+061F

Punctuation

NON_JOINING

null

0

؟ Question Mark

U+0620

Letter

DUAL

YEH

0

ؠ Kashmiri Yeh

U+0621

Letter

NON_JOINING

null

0

ء Hamza

U+0622

Letter

RIGHT

ALEF

0

آ Alef With Madda Above

U+0623

Letter

RIGHT

ALEF

0

أ Alef With Hamza Above

U+0624

Letter

RIGHT

WAW

0

ؤ Waw With Hamza Above

U+0625

Letter

RIGHT

ALEF

0

إ Alef With Hamza Below

U+0626

Letter

DUAL

YEH

0

ئ Dotless Yeh With Hamza Above

U+0627

Letter

RIGHT

ALEF

0

ا Alef

U+0628

Letter

DUAL

BEH

0

ب Beh

U+0629

Letter

RIGHT

TEH_MARBUTA

0

ة Teh Marbuta

U+062A

Letter

DUAL

BEH

0

ت Dotless Beh With 2 Dots Above

U+062B

Letter

DUAL

BEH

0

ث Dotless Beh With 3 Dots Above

U+062C

Letter

DUAL

HAH

0

ج Hah With Dot Below

U+062D

Letter

DUAL

HAH

0

ح Hah

U+062E

Letter

DUAL

HAH

0

خ Hah With Dot Above

U+062F

Letter

RIGHT

DAL

0

د Dal

U+0630

Letter

RIGHT

DAL

0

ذ Dal With Dot Above

U+0631

Letter

RIGHT

REH

0

ر Reh

U+0632

Letter

RIGHT

REH

0

ز Reh With Dot Above

U+0633

Letter

DUAL

SEEN

0

س Seen

U+0634

Letter

DUAL

SEEN

0

ش Seen With 3 Dots Above

U+0635

Letter

DUAL

SAD

0

ص Sad

U+0636

Letter

DUAL

SAD

0

ض Sad With Dot Above

U+0637

Letter

DUAL

TAH

0

ط Tah

U+0638

Letter

DUAL

TAH

0

ظ Tah With Dot Above

U+0639

Letter

DUAL

AIN

0

ع Ain

U+063A

Letter

DUAL

AIN

0

غ Ain With Dot Above

U+063B

Letter

DUAL

GAF

0

ػ Keheh With 2 Dots Above

U+063C

Letter

DUAL

GAF

0

ؼ Keheh With 3 Dots Below

U+063D

Letter

DUAL

FARSI_YEH

0

ؽ Farsi Yeh With Inverted V Above

U+063E

Letter

DUAL

FARSI_YEH

0

ؾ Farsi Yeh With 2 Dots Above

U+063F

Letter

DUAL

FARSI_YEH

0

ؿ Farsi Yeh With 3 Dots Above

U+0640

Letter modifier

JOIN_CAUSING

null

0

ـ Tatweel

U+0641

Letter

DUAL

FEH

0

ف Feh

U+0642

Letter

DUAL

QAF

0

ق Qaf

U+0643

Letter

DUAL

KAF

0

ك Kaf

U+0644

Letter

DUAL

LAM

0

ل Lam

U+0645

Letter

DUAL

MEEM

0

م Meem

U+0646

Letter

DUAL

NOON

0

ن Noon

U+0647

Letter

DUAL

HEH

0

ه Heh

U+0648

Letter

RIGHT

WAW

0

و Waw

U+0649

Letter

DUAL

YEH

0

ى Dotless Yeh

U+064A

Letter

DUAL

YEH

0

ي Yeh

U+064B

Mark [Mn]

TRANSPARENT

null

27

ً Fathatan

U+064C

Mark [Mn]

TRANSPARENT

null

28

ٌ Dammatan

U+064D

Mark [Mn]

TRANSPARENT

null

29

ٍ Kasratan

U+064E

Mark [Mn]

TRANSPARENT

null

30

َ Fatha

U+064F

Mark [Mn]

TRANSPARENT

null

31

ُ Damma

U+0650

Mark [Mn]

TRANSPARENT

null

32

ِ Kasra

U+0651

Mark [Mn]

TRANSPARENT

null

33

ّ Shadda

U+0652

Mark [Mn]

TRANSPARENT

null

34

ْ Sukun

U+0653

Mark [Mn]

TRANSPARENT

null

230

ٓ Maddah Above

U+0654

Mark [Mn]

TRANSPARENT

null

230_MCM

ٔ Hamza Above

U+0655

Mark [Mn]

TRANSPARENT

null

220_MCM

ٕ Hamza Below

U+0656

Mark [Mn]

TRANSPARENT

null

220

ٖ Subscript Alef

U+0657

Mark [Mn]

TRANSPARENT

null

230

ٗ Inverted Damma

U+0658

Mark [Mn]

TRANSPARENT

null

230_MCM

٘ Noon Ghunna

U+0659

Mark [Mn]

TRANSPARENT

null

230

ٙ Zwarakay

U+065A

Mark [Mn]

TRANSPARENT

null

230

ٚ Vowel Sign Small V Above

U+065B

Mark [Mn]

TRANSPARENT

null

230

ٛ Vowel Sign Inverted Small V Above

U+065C

Mark [Mn]

TRANSPARENT

null

220

ٜ Vowel Sign Dot Below

U+065D

Mark [Mn]

TRANSPARENT

null

230

ٝ Reversed Damma

U+065E

Mark [Mn]

TRANSPARENT

null

230

ٞ Fatha with Two Dots

U+065F

Mark [Mn]

TRANSPARENT

null

220

ٟ Wavy Hamza Below

U+0660

Number

NON_JOINING

null

0

٠ Digit Zero

U+0661

Number

NON_JOINING

null

0

١ Digit One

U+0662

Number

NON_JOINING

null

0

٢ Digit Two

U+0663

Number

NON_JOINING

null

0

٣ Digit Three

U+0664

Number

NON_JOINING

null

0

٤ Digit Four

U+0665

Number

NON_JOINING

null

0

٥ Digit Five

U+0666

Number

NON_JOINING

null

0

٦ Digit Six

U+0667

Number

NON_JOINING

null

0

٧ Digit Seven

U+0668

Number

NON_JOINING

null

0

٨ Digit Eight

U+0669

Number

NON_JOINING

null

0

٩ Digit Nine

U+066A

Punctuation

NON_JOINING

null

0

٪ Percent Sign

U+066B

Punctuation

NON_JOINING

null

0

٫ Decimal Separator

U+066C

Punctuation

NON_JOINING

null

0

٬ Thousands Separator

U+066D

Punctuation

NON_JOINING

null

0

٭ Five Pointed Star

U+066E

Letter

DUAL

BEH

0

ٮ Dotless Beh

U+066F

Letter

DUAL

QAF

0

ٯ Dotless Qaf

U+0670

Mark [Mn]

TRANSPARENT

null

35

ٰ Superscript Alef

U+0671

Letter

RIGHT

ALEF

0

ٱ Alef With Wasla Above

U+0672

Letter

RIGHT

ALEF

0

ٲ Alef With Wavy Hamza Above

U+0673

Letter

RIGHT

ALEF

0

ٳ Alef With Wavy Hamza Below

U+0674

Letter

NON_JOINING

null

0

ٴ High Hamza

U+0675

Letter

RIGHT

ALEF

0

ٵ High Hamza Alef

U+0676

Letter

RIGHT

WAW

0

ٶ High Hamza Waw

U+0677

Letter

RIGHT

WAW

0

ٷ High Hamza Waw With Damma Above

U+0678

Letter

DUAL

YEH

0

ٸ High Hamza Dotless Yeh

U+0679

Letter

DUAL

BEH

0

ٹ Dotless Beh With Tah Above

U+067A

Letter

DUAL

BEH

0

ٺ Dotless Beh With Vertical 2 Dots Above

U+067B

Letter

DUAL

BEH

0

ٻ Dotless Beh With Vertical 2 Dots Below

U+067C

Letter

DUAL

BEH

0

ټ Dotless Beh With Attached Ring Below And 2 Dots Above

U+067D

Letter

DUAL

BEH

0

ٽ Dotless Beh With Inverted 3 Dots Above

U+067E

Letter

DUAL

BEH

0

پ Dotless Beh With 3 Dots Below

U+067F

Letter

DUAL

BEH

0

ٿ Dotless Beh With 4 Dots Above

U+0680

Letter

DUAL

BEH

0

ڀ Dotless Beh With 4 Dots Below

U+0681

Letter

DUAL

HAH

0

ځ Hah With Hamza Above

U+0682

Letter

DUAL

HAH

0

ڂ Hah With Vertical 2 Dots Above

U+0683

Letter

DUAL

HAH

0

ڃ Hah With 2 Dots Below

U+0684

Letter

DUAL

HAH

0

ڄ Hah With Vertical 2 Dots Below

U+0685

Letter

DUAL

HAH

0

څ Hah With 3 Dots Above

U+0686

Letter

DUAL

HAH

0

چ Hah With 3 Dots Below

U+0687

Letter

DUAL

HAH

0

ڇ Hah With 4 Dots Below

U+0688

Letter

RIGHT

DAL

0

ڈ Dal With Tah Above

U+0689

Letter

RIGHT

DAL

0

ډ Dal With Attached Ring Below

U+068A

Letter

RIGHT

DAL

0

ڊ Dal With Dot Below

U+068B

Letter

RIGHT

DAL

0

ڋ Dal With Dot Below And Tah Above

U+068C

Letter

RIGHT

DAL

0

ڌ Dal With 2 Dots Above

U+068D

Letter

RIGHT

DAL

0

ڍ Dal With 2 Dots Below

U+068E

Letter

RIGHT

DAL

0

ڎ Dal With 3 Dots Above

U+068F

Letter

RIGHT

DAL

0

ڏ Dal With Inverted 3 Dots Above

U+0690

Letter

RIGHT

DAL

0

ڐ Dal With 4 Dots Above

U+0691

Letter

RIGHT

REH

0

ڑ Reh With Tah Above

U+0692

Letter

RIGHT

REH

0

ڒ Reh With V Above

U+0693

Letter

RIGHT

REH

0

ړ Reh With Attached Ring Below

U+0694

Letter

RIGHT

REH

0

ڔ Reh With Dot Below

U+0695

Letter

RIGHT

REH

0

ڕ Reh With V Below

U+0696

Letter

RIGHT

REH

0

ږ Reh With Dot Below And Dot Within

U+0697

Letter

RIGHT

REH

0

ڗ Reh With 2 Dots Above

U+0698

Letter

RIGHT

REH

0

ژ Reh With 3 Dots Above

U+0699

Letter

RIGHT

REH

0

ڙ Reh With 4 Dots Above

U+069A

Letter

DUAL

SEEN

0

ښ Seen With Dot Below And Dot Above

U+069B

Letter

DUAL

SEEN

0

ڛ Seen With 3 Dots Below

U+069C

Letter

DUAL

SEEN

0

ڜ Seen With 3 Dots Below And 3 Dots Above

U+069D

Letter

DUAL

SAD

0

ڝ Sad With 2 Dots Below

U+069E

Letter

DUAL

SAD

0

ڞ Sad With 3 Dots Above

U+069F

Letter

DUAL

TAH

0

ڟ Tah With 3 Dots Above

U+06A0

Letter

DUAL

AIN

0

ڠ Ain With 3 Dots Above

U+06A1

Letter

DUAL

FEH

0

ڡ Dotless Feh

U+06A2

Letter

DUAL

FEH

0

ڢ Dotless Feh With Dot Below

U+06A3

Letter

DUAL

FEH

0

ڣ Feh With Dot Below

U+06A4

Letter

DUAL

FEH

0

ڤ Dotless Feh With 3 Dots Above

U+06A5

Letter

DUAL

FEH

0

ڥ Dotless Feh With 3 Dots Below

U+06A6

Letter

DUAL

FEH

0

ڦ Dotless Feh With 4 Dots Above

U+06A7

Letter

DUAL

QAF

0

ڧ Dotless Qaf With Dot Above

U+06A8

Letter

DUAL

QAF

0

ڨ Dotless Qaf With 3 Dots Above

U+06A9

Letter

DUAL

GAF

0

ک Keheh

U+06AA

Letter

DUAL

SWASH_KAF

0

ڪ Swash Kaf

U+06AB

Letter

DUAL

GAF

0

ګ Keheh With Attached Ring Below

U+06AC

Letter

DUAL

KAF

0

ڬ Kaf With Dot Above

U+06AD

Letter

DUAL

KAF

0

ڭ Kaf With 3 Dots Above

U+06AE

Letter

DUAL

KAF

0

ڮ Kaf With 3 Dots Below

U+06AF

Letter

DUAL

GAF

0

گ Gaf

U+06B0

Letter

DUAL

GAF

0

ڰ Gaf With Attached Ring Below

U+06B1

Letter

DUAL

GAF

0

ڱ Gaf With 2 Dots Above

U+06B2

Letter

DUAL

GAF

0

ڲ Gaf With 2 Dots Below

U+06B3

Letter

DUAL

GAF

0

ڳ Gaf With Vertical 2 Dots Below

U+06B4

Letter

DUAL

GAF

0

ڴ Gaf With 3 Dots Above

U+06B5

Letter

DUAL

LAM

0

ڵ Lam With V Above

U+06B6

Letter

DUAL

LAM

0

ڶ Lam With Dot Above

U+06B7

Letter

DUAL

LAM

0

ڷ Lam With 3 Dots Above

U+06B8

Letter

DUAL

LAM

0

ڸ Lam With 3 Dots Below

U+06B9

Letter

DUAL

NOON

0

ڹ Noon With Dot Below

U+06BA

Letter

DUAL

NOON

0

ں Dotless Noon

U+06BB

Letter

DUAL

NOON

0

ڻ Dotless Noon With Tah Above

U+06BC

Letter

DUAL

NOON

0

ڼ Noon With Attached Ring Below

U+06BD

Letter

DUAL

NYA

0

ڽ Nya

U+06BE

Letter

DUAL

KNOTTED_HEH

0

ھ Knotted Heh

U+06BF

Letter

DUAL

HAH

0

ڿ Hah With 3 Dots Below And Dot Above

U+06C0

Letter

RIGHT

TEH_MARBUTA

0

ۀ Dotless Teh Marbuta With Hamza Above

U+06C1

Letter

DUAL

HEH_GOAL

0

ہ Heh Goal

U+06C2

Letter

DUAL

HEH_GOAL

0

ۂ Heh Goal With Hamza Above

U+06C3

Letter

RIGHT

TEH_MARBUTA_GOAL

0

ۃ Teh Marbuta Goal

U+06C4

Letter

RIGHT

WAW

0

ۄ Waw With Attached Ring Within

U+06C5

Letter

RIGHT

WAW

0

ۅ Waw With Bar

U+06C6

Letter

RIGHT

WAW

0

ۆ Waw With V Above

U+06C7

Letter

RIGHT

WAW

0

ۇ Waw With Damma Above

U+06C8

Letter

RIGHT

WAW

0

ۈ Waw With Alef Above

U+06C9

Letter

RIGHT

WAW

0

ۉ Waw With Inverted V Above

U+06CA

Letter

RIGHT

WAW

0

ۊ Waw With 2 Dots Above

U+06CB

Letter

RIGHT

WAW

0

ۋ Waw With 3 Dots Above

U+06CC

Letter

DUAL

FARSI_YEH

0

ی Farsi Yeh

U+06CD

Letter

RIGHT

YEH_WITH_TAIL

0

ۍ Yeh With Tail

U+06CE

Letter

DUAL

FARSI_YEH

0

ێ Farsi Yeh With V Above

U+06CF

Letter

RIGHT

WAW

0

ۏ Waw With Dot Above

U+06D0

Letter

DUAL

YEH

0

ې Dotless Yeh With Vertical 2 Dots Below

U+06D1

Letter

DUAL

YEH

0

ۑ Dotless Yeh With 3 Dots Below

U+06D2

Letter

RIGHT

YEH_BARREE

0

ے Yeh Barree

U+06D3

Letter

RIGHT

YEH_BARREE

0

ۓ Yeh Barree With Hamza Above

U+06D4

Punctuation

NON_JOINING

null

0

۔ Full Stop

U+06D5

Letter

NON_JOINING

TEH_MARBUTA

0

ە Dotless Teh Marbuta

U+06D6

Mark [Mn]

TRANSPARENT

null

230

ۖ Small High Sad Lam Alef Maksura

U+06D7

Mark [Mn]

TRANSPARENT

null

230

ۗ Small High Qaf Lam Alef Maksura

U+06D8

Mark [Mn]

TRANSPARENT

null

230

ۘ Small High Meem Initial Form

U+06D9

Mark [Mn]

TRANSPARENT

null

230

ۙ Small High Lam Alef

U+06DA

Mark [Mn]

TRANSPARENT

null

230

ۚ Small High Jeem

U+06DB

Mark [Mn]

TRANSPARENT

null

230

ۛ Small High Three Dots

U+06DC

Mark [Mn]

TRANSPARENT

null

230_MCM

ۜ Small High Seen

U+06DD

Other

NON_JOINING

null

0

۝ End Of Ayah

U+06DE

Other

NON_JOINING

null

0

۞ Start Of Rub El Hizb

U+06DF

Mark [Mn]

TRANSPARENT

null

230

۟ Small High Rounded Zero

U+06E0

Mark [Mn]

TRANSPARENT

null

230

۠ Small High Upright Rectangular Zero

U+06E1

Mark [Mn]

TRANSPARENT

null

230

ۡ Small High Dotless Head Of Khah

U+06E2

Mark [Mn]

TRANSPARENT

null

230

ۢ Small High Meem Isolated Form

U+06E3

Mark [Mn]

TRANSPARENT

null

220_MCM

ۣ Small Low Seen

U+06E4

Mark [Mn]

TRANSPARENT

null

230

ۤ Small High Madda

U+06E5

Letter modifier

NON_JOINING

null

0

ۥ Small Waw

U+06E6

Letter modifier

NON_JOINING

null

0

ۦ Small Yeh

U+06E7

Mark [Mn]

TRANSPARENT

null

230_MCM

ۧ Small High Yeh

U+06E8

Mark [Mn]

TRANSPARENT

null

230_MCM

ۨ Small High Noon

U+06E9

Symbol

NON_JOINING

null

0

۩ Place Of Sajdah

U+06EA

Mark [Mn]

TRANSPARENT

null

220

۪ Empty Centre Low Stop

U+06EB

Mark [Mn]

TRANSPARENT

null

230

۫ Empty Centre High Stop

U+06EC

Mark [Mn]

TRANSPARENT

null

230

۬ Rounded High Stop With Filled Centre

U+06ED

Mark [Mn]

TRANSPARENT

null

220

ۭ Small Low Meem

U+06EE

Letter

RIGHT

DAL

0

ۮ Dal With Inverted V Above

U+06EF

Letter

RIGHT

REH

0

ۯ Reh With Inverted V Above

U+06F0

Number

NON_JOINING

null

0

۰ Extended Digit Zero

U+06F1

Number

NON_JOINING

null

0

۱ Extended Digit One

U+06F2

Number

NON_JOINING

null

0

۲ Extended Digit Two

U+06F3

Number

NON_JOINING

null

0

۳ Extended Digit Three

U+06F4

Number

NON_JOINING

null

0

۴ Extended Digit Four

U+06F5

Number

NON_JOINING

null

0

۵ Extended Digit Five

U+06F6

Number

NON_JOINING

null

0

۶ Extended Digit Six

U+06F7

Number

NON_JOINING

null

0

۷ Extended Digit Seven

U+06F8

Number

NON_JOINING

null

0

۸ Extended Digit Eight

U+06F9

Number

NON_JOINING

null

0

۹ Extended Digit Nine

U+06FA

Letter

DUAL

SEEN

0

ۺ Sheen With Dot Below

U+06FB

Letter

DUAL

SAD

0

ۻ Dad With Dot Below

U+06FC

Letter

DUAL

AIN

0

ۼ Ghain With Dot Below

U+06FD

Symbol

NON_JOINING

null

0

۽ Sign Sindhi Ampersand

U+06FE

Symbol

NON_JOINING

null

0

۾ Sign Sindhi Postposition Men

U+06FF

Letter

DUAL

KNOTTED_HEH

0

ۿ Knotted Heh With Inverted V Above

Arabic Supplement character table

Table 3 Arabic Supplement block table

Codepoint

Unicode category

Joining type

Joining group

Mark class

Glyph

U+0750

Letter

DUAL

BEH

0

ݐ Dotless Beh With Horizontal 3 Dots Below

U+0751

Letter

DUAL

BEH

0

ݑ Beh With 3 Dots Above

U+0752

Letter

DUAL

BEH

0

ݒ Dotless Beh With Inverted 3 Dots Below

U+0753

Letter

DUAL

BEH

0

ݓ Dotless Beh With Inverted 3 Dots Below And 2 Dots Above

U+0754

Letter

DUAL

BEH

0

ݔ Dotless Beh With 2 Dots Below And Dot Above

U+0755

Letter

DUAL

BEH

0

ݕ Dotless Beh With Inverted V Below

U+0756

Letter

DUAL

BEH

0

ݖ Dotless Beh With V Above

U+0757

Letter

DUAL

HAH

0

ݗ Hah With 2 Dots Above

U+0758

Letter

DUAL

HAH

0

ݘ Hah With Inverted 3 Dots Below

U+0759

Letter

RIGHT

DAL

0

ݙ Dal With Vertical 2 Dots Below And Tah Above

U+075A

Letter

RIGHT

DAL

0

ݚ Dal With Inverted V Below

U+075B

Letter

RIGHT

REH

0

ݛ Reh With Bar

U+075C

Letter

DUAL

SEEN

0

ݜ Seen With 4 Dots Above

U+075D

Letter

DUAL

AIN

0

ݝ Ain With 2 Dots Above

U+075E

Letter

DUAL

AIN

0

ݞ Ain With Inverted 3 Dots Above

U+075F

Letter

DUAL

AIN

0

ݟ Ain With Vertical 2 Dots Above

U+0760

Letter

DUAL

FEH

0

ݠ Dotless Feh With 2 Dots Below

U+0761

Letter

DUAL

FEH

0

ݡ Dotless Feh With Inverted 3 Dots Below

U+0762

Letter

DUAL

GAF

0

ݢ Keheh With Dot Above

U+0763

Letter

DUAL

GAF

0

ݣ Keheh With 3 Dots Above

U+0764

Letter

DUAL

GAF

0

ݤ Keheh With Inverted 3 Dots Below

U+0765

Letter

DUAL

MEEM

0

ݥ Meem With Dot Above

U+0766

Letter

DUAL

MEEM

0

ݦ Meem With Dot Below

U+0767

Letter

DUAL

NOON

0

ݧ Noon With 2 Dots Below

U+0768

Letter

DUAL

NOON

0

ݨ Noon With Tah Above

U+0769

Letter

DUAL

NOON

0

ݩ Noon With V Above

U+076A

Letter

DUAL

LAM

0

ݪ Lam With Bar

U+076B

Letter

RIGHT

REH

0

ݫ Reh With Vertical 2 Dots Above

U+076C

Letter

RIGHT

REH

0

ݬ Reh With Hamza Above

U+076D

Letter

DUAL

SEEN

0

ݭ Seen With Vertical 2 Dots Above

U+076E

Letter

DUAL

HAH

0

ݮ Hah With Tah Below

U+076F

Letter

DUAL

HAH

0

ݯ Hah With Tah And 2 Dots Below

U+0770

Letter

DUAL

SEEN

0

ݰ Seen With 2 Dots And Tah Above

U+0771

Letter

RIGHT

REH

0

ݱ Reh With 2 Dots And Tah Above

U+0772

Letter

DUAL

HAH

0

ݲ Hah With Tah Above

U+0773

Letter

RIGHT

ALEF

0

ݳ Alef With Digit Two Above

U+0774

Letter

RIGHT

ALEF

0

ݴ Alef With Digit Three Above

U+0775

Letter

DUAL

FARSI_YEH

0

ݵ Farsi Yeh With Digit Two Above

U+0776

Letter

DUAL

FARSI_YEH

0

ݶ Farsi Yeh With Digit Three Above

U+0777

Letter

DUAL

YEH

0

ݷ Dotless Yeh With Digit Four Below

U+0778

Letter

RIGHT

WAW

0

ݸ Waw With Digit Two Above

U+0779

Letter

RIGHT

WAW

0

ݹ Waw With Digit Three Above

U+077A

Letter

DUAL

BURUSHASKI_YEH_BARREE

0

ݺ Burushaski Yeh Barree With Digit Two Above

U+077B

Letter

DUAL

BURUSHASKI_YEH_BARREE

0

ݻ Burushaski Yeh Barree With Digit Three Above

U+077C

Letter

DUAL

HAH

0

ݼ Hah With Digit Four Below

U+077D

Letter

DUAL

SEEN

0

ݽ Seen With Digit Four Above

U+077E

Letter

DUAL

SEEN

0

ݾ Seen With Inverted V Above

U+077F

Letter

DUAL

KAF

0

ݿ Kaf With 2 Dots Above

Arabic Extended-A character table

Table 4 Arabic Extended-A block table

Codepoint

Unicode category

Joining type

Joining group

Mark class

Glyph

U+08A0

Letter

DUAL

BEH

0

ࢠ Dotless Beh With V Below

U+08A1

Letter

DUAL

BEH

0

ࢡ Beh With Hamza Above

U+08A2

Letter

DUAL

HAH

0

ࢢ Hah With Dot Below And 2 Dots Above

U+08A3

Letter

DUAL

TAH

0

ࢣ Tah With 2 Dots Above

U+08A4

Letter

DUAL

FEH

0

ࢤ Dotless Feh With Dot Below And 3 Dots Above

U+08A5

Letter

DUAL

QAF

0

ࢥ Qaf With Dot Below

U+08A6

Letter

DUAL

LAM

0

ࢦ Lam With Double Bar

U+08A7

Letter

DUAL

MEEM

0

ࢧ Meem With 3 Dots Above

U+08A8

Letter

DUAL

YEH

0

ࢨ Yeh With Hamza Above

U+08A9

Letter

DUAL

YEH

0

ࢩ Yeh With Dot Above

U+08AA

Letter

RIGHT

REH

0

ࢪ Reh With Loop

U+08AB

Letter

RIGHT

WAW

0

ࢫ Waw With Dot Within

U+08AC

Letter

RIGHT

ROHINGYA_YEH

0

ࢬ Rohingya Yeh

U+08AD

Letter

NON_JOINING

null

0

ࢭ Low Alef

U+08AE

Letter

RIGHT

DAL

0

ࢮ Dal With 3 Dots Below

U+08AF

Letter

DUAL

SAD

0

ࢯ Sad With 3 Dots Below

U+08B0

Letter

DUAL

GAF

0

ࢰ Keheh With Stroke Below

U+08B1

Letter

RIGHT

STRAIGHT_WAW

0

ࢱ Straight Waw

U+08B2

Letter

RIGHT

REH

0

ࢲ Reh With Dot And Inverted V Above

U+08B3

Letter

DUAL

AIN

0

ࢳ Ain With 3 Dots Below

U+08B4

Letter

DUAL

KAF

0

ࢴ Kaf With Dot Below

U+08B5

Letter

DUAL

QAF

0

ࢵ Qaf With Dot Below

U+08B6

Letter

DUAL

BEH

0

ࢶ Beh With Meem Above

U+08B7

Letter

DUAL

BEH

0

ࢷ Dotless Beh With 3 Dots Below And Meem Above

U+08B8

Letter

DUAL

BEH

0

ࢸ Dotless Beh With Teh Above

U+08B9

Letter

RIGHT

REH

0

ࢹ Reh With Noon Above

U+08BA

Letter

DUAL

YEH

0

ࢺ Yeh With Noon Above

U+08BB

Letter

DUAL

AFRICAN_FEH

0

ࢻ African Feh

U+08BC

Letter

DUAL

AFRICAN_QAF

0

ࢼ African Qaf

U+08BD

Letter

DUAL

AFRICAN_NOON

0

ࢽ African Noon

U+08BE

Letter

DUAL

BEH

0

ࢾ Peh With Small V

U+08BF

Letter

DUAL

BEH

0

ࢿ Teh With Small V

U+08C0

Letter

DUAL

BEH

0

ࣀ Tteh With Small V

U+08C1

Letter

DUAL

HAH

0

ࣁ Tcheh With Small V

U+08C2

Letter

DUAL

GAF

0

ࣂ Keheh With Small V

U+08C3

Letter

DUAL

AIN

0

ࣃ Ghain With 3 Dots Above

U+08C4

Letter

DUAL

AFRICAN_QAF

0

ࣄ African Qaf With 3 Dots Above

U+08C5

Letter

DUAL

HAH

0

ࣅ Jeem With 3 Dots Above

U+08C6

Letter

DUAL

HAH

0

ࣆ Jeem With 3 Dots Below

U+08C7

Letter

DUAL

LAM

0

ࣇ Lam With Small Arabic Tah Above

U+08C8

Letter

DUAL

GAF

0

ࣈ Graf

U+08C9

Letter modifier

TRANSPARENT

null

0

ࣉ Small Farsi Yeh

U+08CA

Mark [Mn]

TRANSPARENT

null

230_MCM

࣊ Small High Farsi Yeh

U+08CB

Mark [Mn]

TRANSPARENT

null

230_MCM

࣋ Small High Yeh Barree With Two Dots Below

U+08CC

Mark [Mn]

TRANSPARENT

null

230

࣌ Small High Word Sah

U+08CD

Mark [Mn]

TRANSPARENT

null

230_MCM

࣍ Small High Zah

U+08CE

Mark [Mn]

TRANSPARENT

null

230_MCM

࣎ Large Round Dot Above

U+08CF

Mark [Mn]

TRANSPARENT

null

220_MCM

࣏ Large Round Dot Below

U+08D0

Mark [Mn]

TRANSPARENT

null

220

࣐ Sukun Below

U+08D1

Mark [Mn]

TRANSPARENT

null

220

࣑ Large Circle Below

U+08D2

Mark [Mn]

TRANSPARENT

null

220

࣒ Large Round Dot Inside Circle Below

U+08D3

Mark [Mn]

TRANSPARENT

null

220_MCM

࣓ Small Low Waw

U+08D4

Mark [Mn]

TRANSPARENT

null

230

ࣔ Small High Word Ar-Rub

U+08D5

Mark [Mn]

TRANSPARENT

null

230

ࣕ Small High Sad

U+08D6

Mark [Mn]

TRANSPARENT

null

230

ࣖ Small High Ain

U+08D7

Mark [Mn]

TRANSPARENT

null

230

ࣗ Small High Qaf

U+08D8

Mark [Mn]

TRANSPARENT

null

230

ࣘ Small High Noon With Kasra

U+08D9

Mark [Mn]

TRANSPARENT

null

230

ࣙ Small Low Noon With Kasra

U+08DA

Mark [Mn]

TRANSPARENT

null

230

ࣚ Small High Word Ath-Thalatha

U+08DB

Mark [Mn]

TRANSPARENT

null

230

ࣛ Small High Word As-Sajda

U+08DC

Mark [Mn]

TRANSPARENT

null

230

ࣜ Small High Word An-Nisf

U+08DD

Mark [Mn]

TRANSPARENT

null

230

ࣝ Small High Word Sakta

U+08DE

Mark [Mn]

TRANSPARENT

null

230

ࣞ Small High Word Qif

U+08DF

Mark [Mn]

TRANSPARENT

null

230

ࣟ Small High Word Waqfa

U+08E0

Mark [Mn]

TRANSPARENT

null

230

࣠ Small High Footnote Marker

U+08E1

Mark [Mn]

TRANSPARENT

null

230

࣡ Small High Sign Safha

U+08E2

Other

NON_JOINING

null

0

࣢ Disputed End Of Ayah

U+08E3

Mark [Mn]

TRANSPARENT

null

220

ࣣ Turned Damma Below

U+08E4

Mark [Mn]

TRANSPARENT

null

230

ࣤ Curly Fatha

U+08E5

Mark [Mn]

TRANSPARENT

null

230

ࣥ Curly Damma

U+08E6

Mark [Mn]

TRANSPARENT

null

220

ࣦ Curly Kasra

U+08E7

Mark [Mn]

TRANSPARENT

null

230

ࣧ Curly Fathatan

U+08E8

Mark [Mn]

TRANSPARENT

null

230

ࣨ Curly Dammatan

U+08E9

Mark [Mn]

TRANSPARENT

null

220

ࣩ Curly Kasratan

U+08EA

Mark [Mn]

TRANSPARENT

null

230

࣪ Tone One Dot Above

U+08EB

Mark [Mn]

TRANSPARENT

null

230

࣫ Tone Two Dots aAove

U+08EC

Mark [Mn]

TRANSPARENT

null

230

࣬ Tone Loop Above

U+08ED

Mark [Mn]

TRANSPARENT

null

220

࣭ Tone One Dot Below

U+08EE

Mark [Mn]

TRANSPARENT

null

220

࣮ Tone Two Dots Below

U+08EF

Mark [Mn]

TRANSPARENT

null

220

࣯ Tone Loop Below

U+08F0

Mark [Mn]

TRANSPARENT

null

27

ࣰ Open Fathatan

U+08F1

Mark [Mn]

TRANSPARENT

null

28

ࣱ Open Dammatan

U+08F2

Mark [Mn]

TRANSPARENT

null

29

ࣲ Open Kasratan

U+08F3

Mark [Mn]

TRANSPARENT

null

230_MCM

ࣳ Small High Waw

U+08F4

Mark [Mn]

TRANSPARENT

null

230

ࣴ Fatha With Ring

U+08F5

Mark [Mn]

TRANSPARENT

null

230

ࣵ Fatha With Dot Above

U+08F6

Mark [Mn]

TRANSPARENT

null

220

ࣶ Kasra With Dot Below

U+08F7

Mark [Mn]

TRANSPARENT

null

230

ࣷ Left Arrowhead Above

U+08F8

Mark [Mn]

TRANSPARENT

null

230

ࣸ Right Arrowhead Above

U+08F9

Mark [Mn]

TRANSPARENT

null

220

ࣹ Left Arrowhead Below

U+08FA

Mark [Mn]

TRANSPARENT

null

220

ࣺ Right Arrowhead Below

U+08FB

Mark [Mn]

TRANSPARENT

null

230

ࣻ Double Right Arrowhead Above

U+08FC

Mark [Mn]

TRANSPARENT

null

230

ࣼ Double Right Arrowhead Above With Dot

U+08FD

Mark [Mn]

TRANSPARENT

null

230

ࣽ Right Arrowhead Above With Dot

U+08FE

Mark [Mn]

TRANSPARENT

null

230

ࣾ Damma With Dot

U+08FF

Mark [Mn]

TRANSPARENT

null

230

ࣿ Mark Sideways Noon Ghunna

Arabic Extended-B character table

Table 5 Arabic Extended-B block table

Codepoint

Unicode category

Joining type

Joining group

Mark class

Glyph

U+0870

Letter

RIGHT

ALEF

0

ࡰ Alef With Attached Fatha

U+0871

Letter

RIGHT

ALEF

0

ࡱ Alef With Attached Top Right Fatha

U+0872

Letter

RIGHT

ALEF

0

ࡲ Alef With Right Middle Stroke

U+0873

Letter

RIGHT

ALEF

0

ࡳ Alef With Left Middle Stroke

U+0874

Letter

RIGHT

ALEF

0

ࡴ Alef With Attached Kasra

U+0875

Letter

RIGHT

ALEF

0

ࡵ Alef With Attached Bottom Right Kasra

U+0876

Letter

RIGHT

ALEF

0

ࡶ Alef With Attached Round Dot Above

U+0877

Letter

RIGHT

ALEF

0

ࡷ Alef With Attached Right Round Dot

U+0878

Letter

RIGHT

ALEF

0

ࡸ Alef With Attached Left Round Dot

U+0879

Letter

RIGHT

ALEF

0

ࡹ Alef With Attached Round Dot Below

U+087A

Letter

RIGHT

ALEF

0

ࡺ Alef With Dot Above

U+087B

Letter

RIGHT

ALEF

0

ࡻ Alef With Attached Top Right Fatha And Dot Above

U+087C

Letter

RIGHT

ALEF

0

ࡼ Alef With Right Middle Stroke And Dot Above

U+087D

Letter

RIGHT

ALEF

0

ࡽ Alef With Attached Bottom Right Kasra And Dot Above

U+087E

Letter

RIGHT

ALEF

0

ࡾ Alef With Attached Top Right Fatha And Left Ring

U+087F

Letter

RIGHT

ALEF

0

ࡿ Alef With Right Middle Stroke And Left Ring

U+0880

Letter

RIGHT

ALEF

0

ࢀ Alef With Attached Bottom Right Kasra And Left Ring

U+0881

Letter

RIGHT

ALEF

0

ࢁ Alef With Attached Right Hamza

U+0882

Letter

RIGHT

ALEF

0

ࢂ Alef With Attached Left Hamza

U+0883

Letter modifier

JOIN_CAUSING

null

0

ࢃ Tatweel With Overstruck Hamza

U+0884

Letter modifier

JOIN_CAUSING

null

0

ࢄ Tatweel With Overstruck Waw

U+0885

Letter modifier

JOIN_CAUSING

null

0

ࢅ Tatweel With Two Dots Below

U+0886

Letter

DUAL

THIN_YEH

0

ࢆ Thin Yeh

U+0887

Letter

NON_JOINING

null

0

ࢇ Baseline Round Dot

U+0888

Symbol

NON_JOINING

null

0

࢈ Raised Round Dot

U+0889

Letter

DUAL

NOON

0

ࢉ Noon With Inverted Small V

U+088A

Letter

DUAL

HAH

0

ࢊ Hah With Inverted Small V Below

U+088B

Letter

DUAL

TAH

0

ࢋ Tah With Dot Below

U+088C

Letter

DUAL

TAH

0

ࢌ Tah With Three Dots Below

U+088D

Letter

DUAL

GAF

0

ࢍ Keheh With Two Dots Vertically Below

U+088E

Letter

RIGHT

VERTICAL_TAIL

0

ࢎ Vertical Tail

U+088F

unassigned

U+0890

Symbol

NON_JOINING

null

0

࢐ Pound Mark Above

U+0891

Symbol

NON_JOINING

null

0

࢑ Piastre Mark Above

U+0892

unassigned

U+0893

unassigned

U+0894

unassigned

U+0895

unassigned

U+0896

unassigned

U+0897

Mark [Mn]

TRANSPARENT

null

230

ࢗ Pepet

U+0898

Mark [Mn]

TRANSPARENT

null

230

࢘ Small High Word Al-Juz

U+0899

Mark [Mn]

TRANSPARENT

null

220

࢙ Small Low Word Ishmaam

U+089A

Mark [Mn]

TRANSPARENT

null

220

࢚ Small Low Word Imaala

U+089B

Mark [Mn]

TRANSPARENT

null

220

࢛ Small Low Word Tasheel

U+089C

Mark [Mn]

TRANSPARENT

null

230

࢜ Madda Waajib

U+089D

Mark [Mn]

TRANSPARENT

null

230

࢝ Superscript Alef Mokhassas

U+089E

Mark [Mn]

TRANSPARENT

null

230

࢞ Doubled Madda

U+089F

Mark [Mn]

TRANSPARENT

null

230

࢟ Half Madda Over Madda

Arabic Extended-C character table

Table 6 Arabic Extended-C block table

Codepoint

Unicode category

Joining type

Joining group

Mark class

Glyph

U+10EC0

unassigned

U+10EC1

unassigned

U+10EC2

Letter

RIGHT

DAL

0

𐻂 Dal With Two Dots Vertically Below

U+10EC3

Letter

DUAL

TAH

0

𐻂 Tah With Two Dots Vertically Below

U+10EC4

Letter

DUAL

KAF

0

𐻂 Kaf With Two Dots Vertically Below

U+10EC5

unassigned

U+10EC6

unassigned

U+10EC7

unassigned

U+10EC8

unassigned

U+10EC9

unassigned

U+10ECA

unassigned

U+10ECB

unassigned

U+10ECC

unassigned

U+10ECD

unassigned

U+10ECE

unassigned

U+10ECF

unassigned

U+10ED0

unassigned

U+10ED1

unassigned

U+10ED2

unassigned

U+10ED3

unassigned

U+10ED4

unassigned

U+10ED5

unassigned

U+10ED6

unassigned

U+10ED7

unassigned

U+10ED8

unassigned

U+10ED9

unassigned

U+10EDA

unassigned

U+10EDB

unassigned

U+10EDC

unassigned

U+10EDD

unassigned

U+10EDE

unassigned

U+10EDF

unassigned

U+10EE0

unassigned

U+10EE1

unassigned

U+10EE2

unassigned

U+10EE3

unassigned

U+10EE4

unassigned

U+10EE5

unassigned

U+10EE6

unassigned

U+10EE7

unassigned

U+10EE8

unassigned

U+10EE9

unassigned

U+10EEA

unassigned

U+10EEB

unassigned

U+10EEC

unassigned

U+10EED

unassigned

U+10EEE

unassigned

U+10EEF

unassigned

U+10EF0

unassigned

U+10EF1

unassigned

U+10EF2

unassigned

U+10EF3

unassigned

U+10EF4

unassigned

U+10EF5

unassigned

U+10EF6

unassigned

U+10EF7

unassigned

U+10EF8

unassigned

U+10EF9

unassigned

U+10EFA

unassigned

U+10EFB

unassigned

U+10EFC

Mark [Mn]

TRANSPARENT

null

0

𐻼 Combining Alef Overlay

U+10EFD

Mark [Mn]

TRANSPARENT

null

220

𐻽 Small Low Word Sakta

U+10EFE

Mark [Mn]

TRANSPARENT

null

220

𐻾 Small Low Word Qasr

U+10EFF

Mark [Mn]

TRANSPARENT

null

220

𐻿 Small Low Word Madda

Rumi Numeral Symbols character table

Table 7 Rumi Numeral Symbols block table

Codepoint

Unicode category

Joining type

Joining group

Mark class

Glyph

U+10E60

Number

NON_JOINING

null

0

𐹠 Digit One

U+10E61

Number

NON_JOINING

null

0

𐹡 Digit Two

U+10E62

Number

NON_JOINING

null

0

𐹢 Digit Three

U+10E63

Number

NON_JOINING

null

0

𐹣 Digit Four

U+10E64

Number

NON_JOINING

null

0

𐹤 Digit Five

U+10E65

Number

NON_JOINING

null

0

𐹥 Digit Six

U+10E66

Number

NON_JOINING

null

0

𐹦 Digit Seven

U+10E67

Number

NON_JOINING

null

0

𐹧 Digit Eight

U+10E68

Number

NON_JOINING

null

0

𐹨 Digit Nine

U+10E69

Number

NON_JOINING

null

0

𐹩 Number Ten

U+10E6A

Number

NON_JOINING

null

0

𐹪 Number Twenty

U+10E6B

Number

NON_JOINING

null

0

𐹫 Number Thirty

U+10E6C

Number

NON_JOINING

null

0

𐹬 Number Forty

U+10E6D

Number

NON_JOINING

null

0

𐹭 Number Fifty

U+10E6E

Number

NON_JOINING

null

0

𐹮 Number Sixty

U+10E6F

Number

NON_JOINING

null

0

𐹯 Number Seventy

U+10E70

Number

NON_JOINING

null

0

𐹰 Number Eighty

U+10E71

Number

NON_JOINING

null

0

𐹱 Number Ninety

U+10E72

Number

NON_JOINING

null

0

𐹲 Number One Hundred

U+10E73

Number

NON_JOINING

null

0

𐹳 Number Two Hundred

U+10E74

Number

NON_JOINING

null

0

𐹴 Number Three Hundred

U+10E75

Number

NON_JOINING

null

0

𐹵 Number Four Hundred

U+10E76

Number

NON_JOINING

null

0

𐹶 Number Five Hundred

U+10E77

Number

NON_JOINING

null

0

𐹷 Number Six Hundred

U+10E78

Number

NON_JOINING

null

0

𐹸 Number Seven Hundred

U+10E79

Number

NON_JOINING

null

0

𐹹 Number Eight Hundred

U+10E7A

Number

NON_JOINING

null

0

𐹺 Number Nine Hundred

U+10E7B

Number

NON_JOINING

null

0

𐹻 Fraction One Half

U+10E7C

Number

NON_JOINING

null

0

𐹼 Fraction One Quarter

U+10E7D

Number

NON_JOINING

null

0

𐹽 Fraction One Third

U+10E7E

Number

NON_JOINING

null

0

𐹾 Fraction Two Thirds

U+10E7F

unassigned

Miscellaneous character table

Other important characters that may be encountered when shaping runs of Arabic text include the dotted-circle placeholder (U+25CC), the combining grapheme joiner (U+034F), the zero-width joiner (U+200D) and zero-width non-joiner (U+200C), the left-to-right text marker (U+200E) and right-to-left text marker (U+200F), and the no-break space (U+00A0).

The dotted-circle placeholder is frequently used when displaying a combining mark in isolation. Real-world text syllables may also use other characters, such as hyphens or dashes, in a similar placeholder fashion; shaping engines should cope with this situation gracefully.

Table 8 Miscellaneous character table

Codepoint

Unicode category

Joining type

Joining group

Mark class

Glyph

U+00A0

Separator

NON_JOINING

null

0

  No-break space

U+034F

Other

NON_JOINING

null

0

͏ Combining grapheme joiner

U+200C

Other

NON_JOINING

null

0

‌ Zero-width non-joiner

U+200D

Other

JOIN_CAUSING

null

0

‍ Zero-width joiner

U+200E

Other

NON_JOINING

null

0

‎ Left-to-Right marker

U+200F

Other

NON_JOINING

null

0

‏ Right-to-Left marker

U+2010

Punctuation

NON_JOINING

null

0

‐ Hyphen

U+2011

Punctuation

NON_JOINING

null

0

‑ No-break hyphen

U+2012

Punctuation

NON_JOINING

null

0

‒ Figure dash

U+2013

Punctuation

NON_JOINING

null

0

– En dash

U+2014

Punctuation

NON_JOINING

null

0

— Em dash

U+25CC

Symbol

NON_JOINING

null

0

◌ Dotted circle

The combining grapheme joiner (CGJ) is primarily used to alter the order in which adjacent marks are positioned during the mark-reordering stage, in order to adhere to the needs of a non-default language orthography.

The zero-width joiner (ZWJ) is primarily used to force the usage of the cursive connecting form of a letter even when the context of the adjoining letters would not trigger the connecting form.

For example, to show the initial form of a letter in isolation (such as for displaying it in a table of forms), the sequence “Letter,ZWJ” would be used. To show the medial form of a letter in isolation, the sequence “ZWJ,Letter,ZWJ” would be used.

The right-to-left mark (RLM) and left-to-right mark (LRM) are used by the Unicode bidirectionality algorithm (BiDi) to indicate the points in a text run at which the writing direction changes.

The no-break space is primarily used to display those codepoints that are defined as non-spacing (such as vowel or diacritical marks and “Hamza”) in an isolated context, as an alternative to displaying them superimposed on the dotted-circle placeholder.