Georgian Script
Georgian has three distinct historical scripts — Mkhedruli, Asomtavruli, and Nuskhuri — all encoded in Unicode, with modern Georgian using Mkhedruli for everyday text and the others reserved for ecclesiastical purposes. This guide tells the story of Georgian script, explores its Unicode blocks, and explains how the three scripts relate to each other.
Georgian script is one of only 14 scripts in the world that are used by a single country's official language, making it inseparable from Georgian national identity. What makes Georgian truly remarkable among the world's writing systems is that it has three distinct script forms — Asomtavruli, Nuskhuri, and Mkhedruli — all encoded separately in Unicode. This guide explores the history, structure, and Unicode encoding of the Georgian writing system, along with practical advice for developers and designers.
A Three-Script Heritage
Georgian is unique in having three historical script forms that developed sequentially over 1,600 years. All three are still used today in specific contexts:
| Script | Georgian Name | Period | Unicode Block | Primary Use Today |
|---|---|---|---|---|
| Asomtavruli | ასომთავრული | ~5th century CE | Georgian Supplement (U+2D00–U+2D2F) | Church inscriptions, decorative |
| Nuskhuri | ნუსხური | ~9th century CE | Georgian Supplement (U+2D00–U+2D2F) | Religious manuscripts |
| Mkhedruli | მხედრული | ~10th century CE | Georgian (U+10A0–U+10FF) | Modern everyday use |
In 2016, UNESCO inscribed the three Georgian scripts on the Representative List of the Intangible Cultural Heritage of Humanity, recognizing their coexistence as a living cultural phenomenon.
Asomtavruli (Capital Letters)
Asomtavruli (from aso "letter" + mtavari "head/capital") is the oldest Georgian script. Its earliest known inscriptions date to the 5th century CE, found in churches in Bethlehem and Bolnisi (Georgia). The script has a monumental, rounded appearance similar to Greek uncials.
Asomtavruli was the primary script for all purposes until roughly the 9th century. Today it appears in church inscriptions, decorative contexts, and as capital letters in the Khutsuri ecclesiastical pair.
Nuskhuri (Minuscule Letters)
Nuskhuri (from nuskha "copy/inventory") developed as a faster, more cursive form for manuscript copying. It is smaller and more angular than Asomtavruli. Together, Asomtavruli (uppercase) and Nuskhuri (lowercase) form the Khutsuri ("priestly") bicameral pair, still used in Georgian Orthodox Church publications.
Mkhedruli (Military/Secular Script)
Mkhedruli (from mkhedari "warrior/horseman") became the dominant secular script by the 11th century and is the script used for modern Georgian. Uniquely, Mkhedruli is unicameral — it has no uppercase/lowercase distinction in traditional usage. All 38 letters have a single form.
However, in 2019, the Georgian government and Unicode Consortium introduced Mtavruli (Georgian capitals) to support an uppercase/lowercase distinction for Mkhedruli. This enables modern typographic conventions like title case.
The Georgian Alphabet
38 Modern Letters
Modern Georgian (Mkhedruli) uses 38 letters — 33 from the original set plus 5 additional letters. However, 5 of the original letters are now obsolete, leaving 33 active letters in contemporary use:
| Category | Count | Description |
|---|---|---|
| Vowels | 5 | ა ე ი ო უ |
| Consonants | 28 | Active consonants in modern Georgian |
| Obsolete | 5 | Letters no longer used (ჱ ჲ ჳ ჴ ჵ) |
Georgian writing is almost perfectly phonemic — each letter represents exactly one sound, and each sound is represented by exactly one letter. There are no silent letters, no digraphs, and no ambiguous readings. This makes Georgian one of the most phonetically transparent writing systems in the world.
Notable Phonological Features
Georgian has several sounds that are rare or unique among the world's languages, each with its own letter:
| Letter | Sound | IPA | Description |
|---|---|---|---|
| ყ | ejective k | /qʼ/ | Uvular ejective |
| წ | ejective ts | /tsʼ/ | Dental ejective affricate |
| ჭ | ejective ch | /tʃʼ/ | Postalveolar ejective |
| ძ | dz | /dz/ | Voiced affricate |
| ხ | kh | /x/ | Voiceless velar fricative |
The ejective consonants (produced with a glottalic airstream) are a hallmark of Kartvelian languages and required dedicated letters in Mashtots's original design — borrowed scripts like Greek or Latin had no way to represent them.
Georgian in Unicode
Main Georgian Block (U+10A0–U+10FF)
| Range | Content | Count |
|---|---|---|
| U+10A0–U+10C5 | Mkhedruli uppercase (Mtavruli) | 38 |
| U+10C7 | Mtavruli letter YN | 1 |
| U+10CD | Mtavruli letter AEN | 1 |
| U+10D0–U+10FA | Mkhedruli lowercase | 43 |
| U+10FB | Georgian paragraph separator ჻ | 1 |
| U+10FC | Georgian letter modifier (Nar) | 1 |
Georgian Supplement (U+2D00–U+2D2F)
This block contains Nuskhuri (Khutsuri lowercase) letters:
| Range | Content | Count |
|---|---|---|
| U+2D00–U+2D25 | Nuskhuri letters | 38 |
| U+2D27 | Nuskhuri letter YN | 1 |
| U+2D2D | Nuskhuri letter AEN | 1 |
Georgian Extended (U+1C90–U+1CBF)
Added in Unicode 11.0 (2018), this block contains Mtavruli uppercase letters for Mkhedruli, completing the bicameral pair:
| Range | Content | Count |
|---|---|---|
| U+1C90–U+1CBA | Mtavruli capital letters | 43 |
| U+1CBD–U+1CBF | Mtavruli AEN, hard sign, labial sign | 3 |
Case Mapping
Before Unicode 11.0, Mkhedruli was unicameral — toUpperCase() on Georgian text was a
no-op. With Mtavruli, case mapping now works:
# Modern Python with Unicode 11.0+ support
text = "საქართველო" # "Georgia" in Georgian
print(text.upper()) # ᲡᲐᲥᲐᲠᲗᲕᲔᲚᲝ (Mtavruli)
print(text.lower()) # საქართველო (Mkhedruli)
print(text.title()) # Საქართველო
// JavaScript (modern engines)
const text = "საქართველო";
console.log(text.toUpperCase()); // ᲡᲐᲥᲐᲠᲗᲕᲔᲚᲝ
console.log(text.toLowerCase()); // საქართველო
Note that older fonts and rendering engines may not support Mtavruli. Fallback rendering typically displays the lowercase (Mkhedruli) form.
Georgian Typography and Digital Challenges
No Spaces Between Words (Historical)
Old Georgian texts (Asomtavruli) were written in scriptio continua — without word spaces. Modern Georgian uses spaces between words, following the convention adopted during the medieval period. However, word boundaries in historical texts require specialized knowledge to identify.
Line Breaking
Georgian follows relatively simple line-breaking rules. Words can be broken at syllable boundaries, and Georgian syllable structure (typically CV or CVC) makes hyphenation straightforward. The Unicode Line Breaking Algorithm classifies Georgian letters as class AL (Alphabetic), so standard algorithms produce acceptable results.
Font Recommendations
| Font | Type | Platform |
|---|---|---|
| Noto Sans Georgian | Sans-serif | Cross-platform |
| Noto Serif Georgian | Serif | Cross-platform |
| BPG fonts (various) | Various | Windows/Linux |
| DejaVu Sans | Sans-serif | Linux |
| Sylfaen | Serif | Windows |
Collation
Georgian alphabetical order has been stable since the script's creation. The Unicode DUCET handles Georgian sorting correctly. The five obsolete letters sort in their historical positions within the alphabet.
Detecting Georgian Script in Code
import unicodedata
def is_georgian(ch):
name = unicodedata.name(ch, "")
return "GEORGIAN" in name
def script_of(ch):
cp = ord(ch)
if 0x10A0 <= cp <= 0x10FF:
return "Mkhedruli/Mtavruli"
elif 0x2D00 <= cp <= 0x2D2F:
return "Nuskhuri"
elif 0x1C90 <= cp <= 0x1CBF:
return "Mtavruli (Extended)"
return "Not Georgian"
# Check text
for ch in "საქართველო":
print(f"{ch}: U+{ord(ch):04X} — {script_of(ch)}")
// JavaScript: Georgian detection with Unicode property escapes
const georgianRegex = /\p{Script=Georgian}/u;
console.log(georgianRegex.test("თ")); // true
console.log(georgianRegex.test("A")); // false
Key Takeaways
- Georgian has three script forms — Asomtavruli (5th c.), Nuskhuri (9th c.), and Mkhedruli (10th c.) — all encoded separately in Unicode and recognized as UNESCO Intangible Cultural Heritage.
- Modern Georgian uses 33 active letters from the Mkhedruli script, which was historically unicameral (no case distinction) until Mtavruli capitals were added in Unicode 11.0 (2018).
- Georgian is perfectly phonemic — each letter maps to exactly one sound with no digraphs, silent letters, or ambiguity, making it one of the world's most transparent writing systems.
- Georgian occupies three Unicode blocks: Georgian (U+10A0–U+10FF), Georgian Supplement (U+2D00–U+2D2F), and Georgian Extended (U+1C90–U+1CBF).
- The addition of Mtavruli (Georgian Extended block) in 2018 enabled
toUpperCase()andtoLowerCase()to work on Georgian text — older systems may not support this. - Georgian ejective consonants (ყ, წ, ჭ) are typologically rare sounds that required dedicated letters and could not be represented in borrowed scripts like Greek or Latin.
Thêm trong Script Stories
Arabic is the third most widely used writing system in the world, …
Devanagari is an abugida script used to write Hindi, Sanskrit, Marathi, and …
Greek is one of the oldest alphabetic writing systems and gave Unicode …
Cyrillic is used to write Russian, Ukrainian, Bulgarian, Serbian, and over 50 …
Hebrew is an abjad script written right-to-left, used for Biblical Hebrew, Modern …
Thai is an abugida script with no spaces between words, complex vowel …
Japanese is unique in using three scripts simultaneously — Hiragana, Katakana, and …
Hangul was invented in 1443 by King Sejong as a scientific alphabet …
Bengali is an abugida script with over 300 million speakers, used for …
Tamil is one of the oldest living writing systems, with a literary …
The Armenian alphabet was created in 405 AD by the monk Mesrop …
The Ethiopic script (Ge'ez) is an abugida used to write Amharic, Tigrinya, …
Unicode encodes dozens of historic and extinct scripts — from Cuneiform and …
There are hundreds of writing systems in use around the world today, …