Unicode 入力方法
Unicodeコードポイントで文字を入力する任意の方法:16進数入力(Mac)・Ctrl+Shift+UによるU+XXXX入力(Linux)・Alt+X(Windowsアプリケーション)。
What is a Unicode Input Method?
A Unicode input method is any technique, tool, or system that allows a user to enter a Unicode character that is not directly accessible from their physical keyboard. Because Unicode defines 154,998 characters across 168 scripts while keyboards typically have 100–120 keys, the gap between "characters you can type" and "characters that exist" is enormous. Unicode input methods bridge that gap.
The major categories of Unicode input methods are: IME (Input Method Editor), dead keys, Compose key sequences, OS character pickers, hex code entry, copy-paste from reference tools, and entity codes in markup languages.
The Landscape of Unicode Input Methods
| Method | Best For | Platform | Skill Level |
|---|---|---|---|
| IME | CJK characters (50K+ ideographs) | All | Intermediate |
| Dead keys | Latin diacritics (accented characters) | All (built into layouts) | Beginner |
| Compose key | Broad special characters via mnemonics | Linux, Windows (WinCompose) | Intermediate |
| Option key (macOS) | Diacritics and common symbols | macOS | Beginner |
| Alt codes (Windows) | Code-page characters | Windows | Intermediate |
| Hex input | Any Unicode character by code point | All (method varies) | Advanced |
| Character picker/map | Discovery and occasional use | All (OS built-in) | Beginner |
| Copy-paste from reference | Rare/unusual characters | Any browser | Beginner |
| HTML/CSS entities | Web content development | Code editors | Intermediate |
| Emoji keyboard | Emoji and pictographs | Mobile/desktop | Beginner |
Choosing the Right Method
For diacritics (accented Latin): Use your keyboard layout's dead keys (European keyboards) or Option key combos (macOS). These are optimized for exactly this purpose and require minimal setup.
For CJK characters: Use a dedicated IME. There is no shortcut — Japanese, Chinese, and Korean input requires a proper phonetic or structural IME.
For occasional symbols (copyright, trademark, arrows, mathematical): Use the OS character picker (Win+. on Windows, Ctrl+Cmd+Space on macOS) or memorize a handful of keyboard shortcuts for your most-used characters.
For technical/programming work: Learn hex input methods. Being able to type U+2014 and convert it to — is faster than reaching for a mouse.
For web/HTML development: Use named HTML entities (—, …, ) for clarity, or numeric hex entities (—) for any character.
HTML and CSS Entity Methods
<!-- Named entities (most readable) -->
— <!-- — em dash -->
– <!-- – en dash -->
… <!-- … ellipsis -->
<!-- non-breaking space -->
© <!-- © copyright -->
® <!-- ® registered -->
™ <!-- ™ trademark -->
<!-- Decimal numeric entities -->
— <!-- — em dash -->
… <!-- … ellipsis -->
<!-- Hexadecimal numeric entities (preferred in modern HTML) -->
— <!-- — em dash -->
… <!-- … ellipsis -->
😀 <!-- 😀 emoji -->
/* CSS unicode escapes */
.icon::before {
content: "\2022"; /* • bullet */
content: "\2192"; /* → right arrow */
content: "\1F600"; /* 😀 emoji */
}
Cross-Platform Summary
macOS:
Option + E, then E = é (dead key)
Ctrl + Cmd + Space = Character Viewer (picker)
Option + - = – (en dash)
Shift + Option + - = — (em dash)
Windows:
Alt + 0169 = © (alt code)
Win + . = Emoji & Symbols (picker)
2014 then Alt+X = — (hex input, Word only)
Linux:
Compose + e + ' = é
Ctrl+Shift+U then 2026 Enter = … (GTK hex input)
IBus/Fcitx for CJK
Quick Facts
| Property | Value |
|---|---|
| Total Unicode 16.0 characters | 154,998 assigned code points |
| Keyboard keys (typical) | ~100–120 |
| Gap | ~154,878 characters not directly typeable |
| IME platform frameworks | TSF (Windows), Input Method Kit (macOS), IBus/Fcitx (Linux) |
| Best general reference | Unicode Character Database (UCD) at unicode.org |
| HTML entity for em dash | — or — |
| Python unicode any char | "\N{EM DASH}" (named character escape) |
| Most used input method worldwide | Mobile IME/emoji keyboard |
関連用語
入力方式 のその他の用語
16進数値を入力してUnicodeコードポイントを直接入力する方法。Mac:Option+16進数+離す。Windows:Word/WordPadで16進数入力後にAlt+X。
Alt+テンキーの数字でコードページ番号により文字を入力するWindows入力方法(Alt+0169 → ©、Alt+0176 → °)。コードページ1252の文字に制限されます。
A system-level tool for browsing and inserting Unicode characters. macOS Character Viewer …
音声または構造的マッチングにより入力キーシーケンスを文字に変換し、標準キーボードで複雑な文字(CJK・韓国語など)を入力できるようにするソフトウェアコンポーネント。
複数キーの合成シーケンスを開始するキー(通常は右Alt またはカスタムマップ)。Linux/Unix機能:Compose + a + e → æ。XComposeで設定可能です。
すぐに出力せず次のキー入力を修飾するキー。発音区別符号の入力に使われます:`を押してeを押すとèになります。欧州のキーボードレイアウトで一般的です。
文字を視覚的に閲覧・選択するUIコンポーネント(ネイティブまたはWebベース)。モバイルの絵文字ピッカーが最も一般的な例です。
Unicode文字を閲覧・挿入するGUIユーティリティ。Windows:charmap.exe。Mac:文字ビューア(Control+Command+Space)。Linux:gucharmap。