Letterlike Symbols
The Unicode Letterlike Symbols block contains mathematical and technical symbols derived from letters, such as ℃ ℉ ℕ ℤ ℝ ℂ and the Planck constant ℏ. This guide explains each letterlike symbol, its origin, and its use in mathematics, science, and typography.
The Letterlike Symbols block (U+2100--U+214F) contains 80 characters that look like letters but function as independent symbols. These are not mere font variants of ordinary letters -- they are distinct Unicode characters with specific meanings in mathematics, science, commerce, and typography. From the trademark sign ™ to the Planck constant symbol, from the Celsius indicator ℃ to the set of real numbers symbol, this guide catalogs every letterlike symbol in Unicode and explains when and why to use them instead of regular letters.
What Makes a Symbol "Letterlike"?
A letterlike symbol is a character that:
- Derives visually from one or more letters (Latin, Greek, or Hebrew)
- Has a specialized meaning distinct from the letter(s) it resembles
- Cannot be decomposed into its component letters without losing that meaning
For example, ™ (U+2122, TRADE MARK SIGN) looks like a superscript "TM" but is a single character with a specific legal and commercial meaning. The symbol ℃ (U+2103, DEGREE CELSIUS) looks like a degree sign followed by a capital C, but it is encoded as one character because it represents a specific unit of measurement.
Complete Letterlike Symbols Reference
Commercial and Legal Symbols
| Character | Code Point | Name | Origin Letters | Common Use |
|---|---|---|---|---|
| ™ | U+2122 | TRADE MARK SIGN | TM | Unregistered trademark |
| ℠ | U+2120 | SERVICE MARK | SM | Service mark |
| ℗ | U+2117 | SOUND RECORDING COPYRIGHT | P (in circle) | Phonogram copyright |
| ℞ | U+211E | PRESCRIPTION TAKE | Rx | Medical prescriptions |
| ℀ | U+2100 | ACCOUNT OF | a/c | Accounting |
| ℁ | U+2101 | ADDRESSED TO THE SUBJECT | a/s | Correspondence |
| ℅ | U+2105 | CARE OF | c/o | Postal addressing |
| ℆ | U+2106 | CADA UNA | c/u | Spanish "each one" |
The trademark ™ and service mark ℠ are among the most commonly used letterlike symbols. Note that the registered trademark sign ® (U+00AE) lives in the Latin-1 Supplement block, not in Letterlike Symbols, because it was inherited from ISO 8859-1.
Temperature and Measurement
| Character | Code Point | Name | Equivalent |
|---|---|---|---|
| ℃ | U+2103 | DEGREE CELSIUS | ° + C |
| ℉ | U+2104 | DEGREE FAHRENHEIT | ° + F |
These precomposed temperature symbols exist for compatibility with East Asian character sets (particularly CJK standards) where they were encoded as single characters. The Unicode Consortium recommends using the decomposed form (U+00B0 DEGREE SIGN + the letter) in new text, but the precomposed forms remain valid and widely used.
# Both representations are correct
celsius_precomposed = "\u2103" # ℃ (single character)
celsius_decomposed = "\u00B0C" # °C (degree sign + C)
# They look identical but are different strings
print(celsius_precomposed == celsius_decomposed) # False
print(len(celsius_precomposed)) # 1
print(len(celsius_decomposed)) # 2
Number and Reference Symbols
| Character | Code Point | Name | Origin | Use |
|---|---|---|---|---|
| № | U+2116 | NUMERO SIGN | N + o | Item numbering (European) |
| ℓ | U+2113 | SCRIPT SMALL L | l | Liter (typography) |
| ℔ | U+2114 | L B BAR SYMBOL | lb | Pound (weight) |
The numero sign № is standard in Russian, German, and many European languages as an abbreviation for "number." It is commonly used where English would write "No." or "#".
Mathematical Number Set Symbols
| Character | Code Point | Name | Represents | Definition |
|---|---|---|---|---|
| ℕ | U+2115 | DOUBLE-STRUCK CAPITAL N | Natural numbers | {0, 1, 2, 3, ...} or |
| ℤ | U+2124 | DOUBLE-STRUCK CAPITAL Z | Integers | {..., -2, -1, 0, 1, 2, ...} |
| ℚ | U+211A | DOUBLE-STRUCK CAPITAL Q | Rationals | p/q where p, q integers |
| ℝ | U+211D | DOUBLE-STRUCK CAPITAL R | Real numbers | Complete ordered field |
| ℂ | U+2102 | DOUBLE-STRUCK CAPITAL C | Complex numbers | a + bi |
| ℙ | U+2119 | DOUBLE-STRUCK CAPITAL P | Primes | {2, 3, 5, 7, 11, ...} |
| ℍ | U+210D | DOUBLE-STRUCK CAPITAL H | Quaternions | Hamilton's extension of complex numbers |
These "double-struck" (also called "blackboard bold") characters are among the most important letterlike symbols in mathematics. In handwriting, mathematicians draw a double vertical stroke through capital letters to distinguish them from ordinary variables. The Unicode characters encode this convention as distinct code points.
Physics Constants and Symbols
| Character | Code Point | Name | Represents |
|---|---|---|---|
| ℏ | U+210F | PLANCK CONSTANT OVER TWO PI | h-bar (reduced Planck constant) |
| ℇ | U+2107 | EULER CONSTANT | Euler's number (sometimes) |
| ℎ | U+210E | PLANCK CONSTANT | Planck constant |
| Å | U+212B | ANGSTROM SIGN | Unit of length (10^-10 m) |
| K | U+212A | KELVIN SIGN | Temperature unit |
The Planck constant ℎ (U+210E) and reduced Planck constant ℏ (U+210F) are fundamental in quantum mechanics. Note that the angstrom sign Å (U+212B) is canonically equivalent to the Latin capital letter A with ring above (U+00C5) -- they are treated as the same character under Unicode normalization.
Script Letter Symbols
| Character | Code Point | Name | Common Use |
|---|---|---|---|
| ℊ | U+210A | SCRIPT SMALL G | Mathematics |
| ℋ | U+210B | SCRIPT CAPITAL H | Hamiltonian, Hilbert space |
| ℌ | U+210C | BLACK-LETTER CAPITAL H | Mathematical contexts |
| ℐ | U+2110 | SCRIPT CAPITAL I | Mathematics |
| ℑ | U+2111 | BLACK-LETTER CAPITAL I | Imaginary part of complex number |
| ℒ | U+2112 | SCRIPT CAPITAL L | Laplace transform, Lagrangian |
| ℛ | U+211B | SCRIPT CAPITAL R | Mathematics |
| ℜ | U+211C | BLACK-LETTER CAPITAL R | Real part of complex number |
| ℬ | U+212C | SCRIPT CAPITAL B | Bernoulli number |
| ℯ | U+212F | SCRIPT SMALL E | Euler's number (e) |
| ℰ | U+2130 | SCRIPT CAPITAL E | Expectation, energy |
| ℱ | U+2131 | SCRIPT CAPITAL F | Fourier transform |
| ℳ | U+2133 | SCRIPT CAPITAL M | M-theory, manifold |
| ℴ | U+2134 | SCRIPT SMALL O | Mathematics |
These script and black-letter (Fraktur) variants carry specific mathematical meanings. The black-letter I (ℑ) for imaginary part and black-letter R (ℜ) for real part of a complex number are standard notation in engineering and physics.
Information and Telephony
| Character | Code Point | Name | Use |
|---|---|---|---|
| ℡ | U+2121 | TELEPHONE SIGN | Contact information |
| ℻ | U+213B | FACSIMILE SIGN | Fax numbers |
| ℹ | U+2139 | INFORMATION SOURCE | Information points, help |
The information symbol ℹ (U+2139) often renders as an emoji on modern platforms when followed by U+FE0F (variation selector).
Turned and Rotated Letters
| Character | Code Point | Name | Use |
|---|---|---|---|
| ℊ | U+210A | SCRIPT SMALL G | Mathematical notation |
| ℸ | U+2138 | DALET SYMBOL | Mathematical cardinal (from Hebrew) |
| ℷ | U+2137 | GIMEL SYMBOL | Mathematical cardinal |
| ℶ | U+2136 | BET SYMBOL | Mathematical cardinal |
| ℵ | U+2135 | ALEF SYMBOL | Aleph -- cardinality of infinite sets |
The Hebrew-derived symbols ℵ (aleph), ℶ (bet), ℷ (gimel), and ℸ (dalet) are used in set theory to denote transfinite cardinal numbers. ℵ₀ (aleph-null) represents the cardinality of the natural numbers -- the smallest infinity.
Letterlike Symbols vs. Regular Letters
A common source of confusion: should you use ℃ or just type degree-C? Should you use ℕ or bold N?
| Scenario | Recommendation |
|---|---|
| Academic math paper (LaTeX) | Use LaTeX commands: \mathbb{R}, \hbar |
| Plain text (email, chat) | Letterlike symbols ensure correct rendering |
| HTML / web content | Either works; letterlike symbols are more semantic |
| East Asian CJK contexts | Precomposed forms (℃, №) are conventional |
| Programming variable names | Use regular letters (Unicode identifiers vary by language) |
Normalization Caveat
Some letterlike symbols have canonical decompositions under Unicode normalization:
import unicodedata
# Angstrom sign decomposes to A + combining ring above
angstrom = "\u212B"
normalized = unicodedata.normalize("NFC", angstrom)
print(f"Original: U+{ord(angstrom):04X}")
print(f"NFC: U+{ord(normalized):04X}")
# Original: U+212B
# NFC: U+00C5 (Latin Capital Letter A with Ring Above)
This means that after NFC normalization, the dedicated Angstrom sign becomes indistinguishable from the Swedish letter. Similarly, the Kelvin sign (U+212A) normalizes to ordinary K (U+004B). Applications that perform string comparison should always normalize first.
Using Letterlike Symbols in Code
HTML Entities
<p>Temperature: 20℃ (68℄)</p>
<p>Product™ is a registered brand</p>
<p>Invoice № 2024-001</p>
<p>ℕ ⊂ ℤ ⊂ ℚ ⊂ ℝ ⊂ ℂ</p>
Python
# Math number sets
SETS = {
"naturals": "\u2115", # N
"integers": "\u2124", # Z
"rationals": "\u211A", # Q
"reals": "\u211D", # R
"complex": "\u2102", # C
}
# Display the number set hierarchy
chain = " \u2282 ".join(SETS.values())
print(chain)
# ℕ ⊂ ℤ ⊂ ℚ ⊂ ℝ ⊂ ℂ
JavaScript
// Temperature conversion display
function formatTemp(celsius) {
const f = celsius * 9 / 5 + 32;
return `${celsius}\u2103 = ${f.toFixed(1)}\u2104`;
}
console.log(formatTemp(100));
// 100℃ = 212.0℉
Font Support
| Symbol Group | Support Level | Notes |
|---|---|---|
| ™ ℠ ℗ | Excellent | In virtually all fonts |
| ℃ ℉ | Excellent | CJK fonts especially |
| № ℓ | Good | Most system fonts |
| ℕ ℤ ℚ ℝ ℂ | Good | Math-capable fonts |
| ℏ ℎ | Good | Scientific fonts |
| Script letters (ℋ ℒ ℱ) | Moderate | May require math fonts |
| Hebrew math (ℵ ℶ ℷ ℸ) | Moderate | Math or symbol fonts |
For web projects needing guaranteed rendering of mathematical letterlike symbols, include a math-capable web font like STIX Two Math or Latin Modern Math.
The Letterlike Symbols block is a compact but essential collection of characters that bridge the gap between ordinary letters and specialized notation. These 80 characters serve mathematicians, scientists, lawyers, typographers, and everyday users who need precise symbolic representation that a plain letter simply cannot provide.
Thêm trong Symbol Reference
Unicode contains hundreds of arrow symbols spanning simple directional arrows, double arrows, …
Unicode provides multiple check mark and tick symbols ranging from the classic …
Unicode includes a rich collection of star shapes — from the simple …
Unicode contains dozens of heart symbols including the classic ♥, black and …
Unicode's Currency Symbols block and surrounding areas contain dedicated characters for over …
Unicode has dedicated blocks for mathematical operators, arrows, letterlike symbols, and alphanumeric …
Beyond the ASCII parentheses and square brackets, Unicode includes angle brackets, curly …
Unicode offers a wide variety of bullet point characters beyond the standard …
Unicode's Box Drawing block contains 128 characters for drawing lines, corners, intersections, …
Unicode includes musical note symbols such as ♩♪♫♬ in the Miscellaneous Symbols …
Unicode includes precomposed fraction characters for common fractions like ½ ¼ ¾ …
Unicode provides precomposed superscript and subscript digits and letters — such as …
Unicode contains dozens of circle symbols including filled circles, outlined circles, circles …
Unicode includes filled squares, outlined squares, small squares, medium squares, dashed squares, …
Unicode provides a comprehensive set of triangle symbols in all orientations — …
Unicode includes filled and outline diamond shapes, lozenge characters, and playing card …
Unicode provides various cross and X mark characters including the heavy ballot …
The hyphen-minus on your keyboard is just one of Unicode's many dash …
Unicode defines typographic quotation marks — curly quotes — for dozens of …
Unicode includes dedicated characters for the copyright symbol ©, registered trademark ®, …
The degree symbol ° (U+00B0) and dedicated Celsius ℃ and Fahrenheit ℉ …
Unicode's Enclosed Alphanumerics block provides circled numbers ①②③, parenthesized numbers ⑴⑵⑶, and …
Unicode includes a Number Forms block with precomposed Roman numeral characters such …
Greek letters like α β γ δ π Σ Ω are widely …
The Unicode Dingbats block (U+2700–U+27BF) contains 192 decorative symbols originally from the …
Unicode includes a Playing Cards block with characters for all 52 standard …
Unicode provides characters for all six chess piece types in both white …
Unicode's Miscellaneous Symbols block includes the 12 zodiac signs ♈♉♊♋♌♍♎♏♐♑♒♓, planetary symbols, …
Unicode's Braille Patterns block (U+2800–U+28FF) encodes all 256 possible combinations of the …
Unicode's Geometric Shapes block contains 96 characters covering circles, squares, triangles, diamonds, …
Unicode's Miscellaneous Technical block contains symbols from computing, electronics, and engineering, including …
Diacritics are accent marks and other marks that attach to letters to …
Unicode defines dozens of invisible characters beyond the ordinary space, including zero-width …
Unicode includes warning and hazard symbols such as the universal caution ⚠ …
Unicode's Miscellaneous Symbols block includes sun ☀, cloud ☁, rain ☂, snow …
Unicode includes symbols for many of the world's major religions including the …
Unicode includes the traditional male ♂ and female ♀ symbols from astronomy, …
Apple's macOS uses Unicode characters for keyboard modifier keys such as ⌘ …
Unicode symbols like ▶ ◀ ► ★ ✦ ⚡ ✈ and hundreds …