What is การแสดงอีโมจิ?

การเรนเดอร์อักขระด้วย glyph emoji แบบสี มักใช้ Variation Selector 16 (U+FE0F) อักขระบางตัวมีค่าเริ่มต้นเป็นการแสดงผล emoji ส่วนอื่นๆ เป็นการแสดงผลข้อความ

What is ละเว้นได้โดยค่าเริ่มต้น?

อักขระที่ไม่มีผลที่มองเห็นได้และสามารถละเว้นได้โดยกระบวนการที่ไม่รองรับ รวมถึง variation selectors อักขระ zero-width และแท็กภาษา

เว็บและ HTML

ตัวเลือกรูปแบบ

อักขระ (U+FE00–U+FE0F, U+E0100–U+E01EF) ที่เลือกรูปแบบ glyph เฉพาะ VS15 (U+FE0E) = การแสดงผลแบบข้อความ VS16 (U+FE0F) = การแสดงผลแบบ emoji

2024-02-05 · Updated 2025-01-06

What Are Variation Selectors?

Variation selectors are invisible Unicode characters appended immediately after a base character to select among alternate glyph presentations. The base character and its variation selector together form a variation sequence that is defined in the Unicode Standard or in a registered Ideographic Variation Database (IVD).

Unicode allocates 256 variation selectors across several ranges:

VS1–VS16 (U+FE00–U+FE0F): General use, including emoji/text presentation toggle.
VS17–VS256 (U+E0100–U+E01EF): Primarily for CJK ideographic variants.

The most commonly encountered variation selectors are VS15 (U+FE0E, text presentation) and VS16 (U+FE0F, emoji presentation).

Emoji and Text Variation Sequences

VS16 and VS15 form the fundamental mechanism for controlling whether a character renders as a colorful emoji or a monochrome text symbol:

U+2665         ♥  (BLACK HEART SUIT — text or emoji, platform-dependent)
U+2665 U+FE0F  ♥️  (emoji presentation: colorful heart)
U+2665 U+FE0E  ♥  (text presentation: monochrome suit)

U+26A0         ⚠  (WARNING SIGN)
U+26A0 U+FE0F  ⚠️  (emoji warning)
U+26A0 U+FE0E  ⚠  (text warning)

CJK Ideographic Variation Sequences

For Chinese, Japanese, and Korean characters, variation selectors select specific glyph forms from registered IVD collections. Different glyph forms for the same character matter in contexts like personal names, legal documents, and historical texts where the precise stroke form has legal or cultural significance.

U+8FBA         辺  (base character)
U+8FBA U+E0100 辺  (Adobe-Japan1 variant glyph 1)
U+8FBA U+E0101 辺  (Adobe-Japan1 variant glyph 2)

These sequences are registered in the IVD (Ideographic Variation Database) maintained by Unicode.

Working with Variation Selectors in Code

# Variation selectors are invisible — string length can surprise you
base  = "\u2665"           # ♥
vs16  = "\u2665\uFE0F"    # ♥️ (emoji)
vs15  = "\u2665\uFE0E"    # ♥ (text)

len(base)   # 1
len(vs16)   # 2
len(vs15)   # 2

base == vs16  # False — different sequences

# Check for variation selector
def has_variation_selector(s):
    return any(
        "\uFE00" <= c <= "\uFE0F" or "\U000E0100" <= c <= "\U000E01EF"
        for c in s
    )

has_variation_selector("♥️")  # True
has_variation_selector("♥")   # False (no VS)

# Strip all variation selectors
def strip_variation_selectors(s):
    return "".join(
        c for c in s
        if not ("\uFE00" <= c <= "\uFE0F" or "\U000E0100" <= c <= "\U000E01EF")
    )

strip_variation_selectors("⚠️")  # "⚠"

// JavaScript — variation selectors are separate code units
const warning = "\u26A0\uFE0F";  // ⚠️
warning.length;       // 2
[...warning];         // ["⚠", "️"]

// Unicode property escapes can match VS range
const vsPattern = /[\uFE00-\uFE0F]/u;
vsPattern.test(warning[1]);  // true

In HTML and CSS

<!-- Explicit emoji presentation in HTML -->
<span>&#x26A0;&#xFE0F;</span>  <!-- ⚠️ emoji -->
<span>&#x26A0;&#xFE0E;</span>  <!-- ⚠ text -->

/* VS16 in CSS content property */
.warning::before {
  content: "\26A0\FE0F";   /* ⚠️ emoji */
}
.warning-text::before {
  content: "\26A0\FE0E";   /* ⚠ text */
}

Why They Are Invisible

Variation selectors have no visible rendering of their own. They are categorized as format characters (Cf) and non-spacing marks. Text rendering engines consume them silently; databases and search engines may or may not normalize them away, which affects string matching.

Quick Facts

Property	Value
VS1–VS16 range	U+FE00–U+FE0F
VS17–VS256 range	U+E0100–U+E01EF
VS15 (U+FE0E)	Forces text (monochrome) presentation
VS16 (U+FE0F)	Forces emoji (colorful) presentation
Rendering	Invisible — applied to preceding base character
Unicode category	Mn (Non-Spacing Mark) or Cf (Format)
IVD	Ideographic Variation Database for CJK glyph variants

คำศัพท์ที่เกี่ยวข้อง

การแสดงอีโมจิ ละเว้นได้โดยค่าเริ่มต้น

เพิ่มเติมใน เว็บและ HTML

CSS Text Direction

CSS properties (direction, writing-mode, unicode-bidi) controlling text layout direction. Works with Unicode …

HTML เอนทิตี

การแทนค่าอักขระในรูปแบบข้อความใน HTML สามรูปแบบ: ชื่อ (&), ทศนิยม (&), เลขฐานสิบหก (&) จำเป็นสำหรับอักขระที่ขัดแย้งกับไวยากรณ์ HTML

Internationalized Domain Name (IDN)

ชื่อโดเมนที่มีอักขระ Unicode ที่ไม่ใช่ ASCII เก็บไว้ภายในเป็น Punycode (xn--...) แต่แสดงเป็น Unicode ให้ผู้ใช้เห็น ความกังวลด้านความปลอดภัย: การโจมตีแบบ …

JavaScript Intl API

ECMAScript Internationalization API providing locale-aware string comparison (Collator), number formatting (NumberFormat), date …

Punycode

การเข้ารหัสที่เข้ากันได้กับ ASCII สำหรับชื่อโดเมน Unicode แปลงป้ายกำกับที่ถูก internationalize เป็นสตริง ASCII ที่มีคำนำหน้า xn-- münchen.de → …

Unicode in CSS

CSS supports Unicode via escape sequences (\2713 for ✓), the content property …

การอ้างอิงอักขระ XML

เวอร์ชัน XML ของการอ้างอิงอักขระเชิงตัวเลข: ✓ หรือ ✓ XML มีเพียง 5 entity ที่มีชื่อ (& …

การอ้างอิงอักขระที่มีชื่อ

การอ้างอิงอักขระเชิงตัวเลข

การเข้ารหัสเปอร์เซ็นต์ (URL encoding)

การเข้ารหัสอักขระที่ไม่ใช่ ASCII และอักขระที่สงวนไว้ใน URL โดยแทนที่แต่ละไบต์ด้วย %XX ใช้ UTF-8 ก่อน แล้วเข้ารหัส percent แต่ละไบต์: …

← กลับไปยังอภิธานศัพท์