Arabic Unicode — Script Block U+0600–U+06FF
Arabic is the world's 5th most spoken language with over 420 million native speakers, and one of the six official languages of the United Nations. In Unicode, Arabic script is encoded in the Arabic block spanning U+0600 to U+06FF — a range of 256 code points covering the 28 core letters, vowel marks (fathah, kasrah, dammah), and Arabic numerals (٠١٢٣٤٥٦٧٨٩).
Arabic is a right-to-left (RTL) script — text flows from right to left. In Unicode
and HTML, this is handled by the Unicode Bidirectional Algorithm (UBA) and the HTML
dir="rtl" attribute. All major platforms — Instagram, WhatsApp, Twitter/X, Google Docs,
and Microsoft Word — support Arabic Unicode text natively.
Additional Arabic character blocks include: Arabic Supplement (U+0750–U+077F) for extended letters, Arabic Extended-A (U+08A0–U+08FF) for additional diacritics, and Arabic Presentation Forms (U+FB50–U+FDFF) for ligature forms used in some rendering contexts.