Unicode 標準

Unicode コンソーシアム

Unicode標準を開発・維持する非営利団体。Apple・Google・Microsoft・Metaなど多くの企業が会員です。

2021-05-10 · Updated 2024-06-07

What is the Unicode Consortium?

The Unicode Consortium is the non-profit organization responsible for developing, maintaining, and publishing the Unicode Standard — the universal character encoding used by virtually all modern software. Founded in 1991 in San Jose, California, the Consortium brings together major technology companies, academic institutions, and individual experts to ensure that every writing system in the world is representable in digital text.

Without the Unicode Consortium, every software vendor would maintain its own incompatible encoding, and moving text between systems would require constant conversion. The Consortium eliminated that chaos by creating and stewarding a single, open, royalty-free standard.

Membership and Governance

The Consortium operates on a tiered membership model. Full members (typically large technology corporations) have voting rights on the Unicode Standard itself. Associate members participate in working groups. Supporting members fund the work without voting rights.

Notable full members include: - Apple, Google, Microsoft, IBM, Adobe, SAP - Emojipedia (now Zedge), Netflix, Meta - Government bodies and national standards organizations

The Board of Directors governs the Consortium. Technical decisions are made by the Unicode Technical Committee (UTC), which meets quarterly to review character encoding proposals, emoji submissions, and algorithm changes. Meetings are open for public observation via teleconference.

What the Consortium Produces

Output	Description
Unicode Standard	The core specification (published as a book and online)
Unicode Character Database (UCD)	Machine-readable property files for all characters
Common Locale Data Repository (CLDR)	Locale data: date formats, currency names, pluralization rules
Unicode Collation Algorithm (UCA)	Specification for language-aware text sorting
Unicode Technical Standards (UTS)	Extensions to the Standard (e.g., UTS#18: Regular Expressions)
Unicode Technical Reports (UTR)	Informative documents on Unicode-related topics
Emoji specifications	Official list, names, properties, and ordering of emoji

The Emoji Process

Emoji are now one of the most visible outputs of the Consortium. Submitting a new emoji requires a formal proposal following documented criteria: uniqueness, anticipated usage, visual distinctiveness, and compatibility. The UTC evaluates proposals at its quarterly meetings; accepted emoji typically appear in a Unicode release 18–24 months after initial acceptance.

Relationship to ISO

The Unicode Consortium co-maintains the character repertoire with ISO/IEC JTC 1/SC 2, which publishes ISO/IEC 10646. The two standards are synchronized at the code point level — every character in Unicode has the same code point in ISO 10646, and vice versa. The standards diverge only in their supplemental specifications (Unicode adds algorithms and properties that 10646 does not).

Contributing to Unicode

Anyone can contribute:

Propose a character: Submit a formal proposal following the Unicode proposal template
Report errors: File bugs in the Unicode issue tracker
Participate in working groups: The Consortium has groups for emoji, CLDR, security, and more
Attend UTC meetings: Public observers are welcome

Quick Facts

Property	Value
Founded	January 3, 1991
Headquarters	Mountain View, California, USA
Legal status	501(c)(3) non-profit corporation
Key output	Unicode Standard
Meeting frequency	UTC meets quarterly
Co-standard	ISO/IEC 10646
Website	unicode.org
Current president	Mark Davis (co-founder)

Unicode 標準のその他の用語

CJK（漢字・かな・ハングル）

中国語・日本語・韓国語 — Unicodeにおける統合漢字ブロックと関連スクリプトをまとめた総称。CJK統合漢字は20,992文字以上を含みます。

Han Unification

The process of mapping Chinese, Japanese, and Korean ideographs that share a …

Hangul Jamo

The individual consonant and vowel components (jamo) of the Korean Hangul writing …

ISO 10646 / 万国文字集合

Unicodeと同期している国際標準（ISO/IEC 10646）で、同じ文字目録とコードポイントを定義しますが、Unicodeの追加アルゴリズムやプロパティは含みません。

Unicode

あらゆる文字システムのすべての文字に固有の番号（コードポイント）を割り当てる普遍的文字エンコーディング規格。バージョン16.0には154,998個の割り当て済み文字が含まれます。

Unicode Standard Annex (UAX)

Normative or informative documents that are integral parts of the Unicode Standard. …

Unicode Technical Report (UTR)

Informational documents published by the Unicode Consortium covering specific topics like security …

Unicode スカラー値

サロゲートコードポイント（U+D800〜U+DFFF）を除くすべてのコードポイント。実際の文字を表すことができる有効な値の集合で、合計1,112,064個です。

Unicode バージョン

新しい文字・文字体系・機能を追加するUnicode標準の主要リリース。現在のバージョンはUnicode 16.0（2025年9月）です。

Unicode 安定性ポリシー

一度割り当てられた文字のコードポイントと名前は絶対に変更されないことを保証するポリシー。プロパティは改訂される場合がありますが、割り当ては永続的です。

← 用語集へ