Latin-1 Supplement

Latin Extended-A

Latin Extended-B

IPA Extensions

Spacing Modifier Letters

Combining Diacritical Marks

Greek and Coptic

Cyrillic

Cyrillic Supplement

Armenian

Hebrew

Arabic

Syriac

Arabic Supplement

Thaana

NKo

Devanagari

Bengali

Gurmukhi

Gujarati

Oriya

Tamil

Telugu

Kannada

Malayalam

Sinhala

Thai

Lao

Tibetan

Myanmar

Georgian

Hangul Jamo

Ethiopic

Ethiopic Supplement

Cherokee

Unified Canadian Aboriginal Syllabics

Ogham

Runic

Tagalog

Hanunoo

Buhid

Tagbanwa

Khmer

Mongolian

Limbu

Tai Le

New Tai Lue

Khmer Symbols

Buginese

Balinese

Phonetic Extensions

Phonetic Extensions Supplement

Combining Diacritical Marks Supplement

Latin Extended Additional

Greek Extended

General Punctuation

Superscripts and Subscripts

Currency Symbols

Combining Diacritical Marks for Symbols

Letterlike Symbols

Number Forms

Arrows

Mathematical Operators

Miscellaneous Technical

Control Pictures

Optical Character Recognition

Enclosed Alphanumerics

Box Drawing

Block Elements

Geometric Shapes

Miscellaneous Symbols

Dingbats

Miscellaneous Mathematical Symbols-A

Supplemental Arrows-A

Braille Patterns

Supplemental Arrows-B

Miscellaneous Mathematical Symbols-B

Supplemental Mathematical Operators

Miscellaneous Symbols and Arrows

Glagolitic

Latin Extended-C

Coptic

Georgian Supplement

Tifinagh

Ethiopic Extended

Supplemental Punctuation

CJK Radicals Supplement

Kangxi Radicals

Ideographic Description Characters

CJK Symbols and Punctuation

Hiragana

Katakana

Bopomofo

Hangul Compatibility Jamo

Kanbun

Bopomofo Extended

CJK Strokes

Katakana Phonetic Extensions

Enclosed CJK Letters and Months

CJK Compatibility

CJK Unified Ideographs Extension A

Yijing Hexagram Symbols

CJK Unified Ideographs

Yi Syllables

Yi Radicals

Modifier Tone Letters

Latin Extended-D

Syloti Nagri

Phags-pa

Hangul Syllables

High Surrogates

High Private Use Surrogates

Low Surrogates

Private Use Area

CJK Compatibility Ideographs

Alphabetic Presentation Forms

Arabic Presentation Forms-A

Variation Selectors

Vertical Forms

Combining Half Marks

CJK Compatibility Forms

Small Form Variants

Arabic Presentation Forms-B

Halfwidth and Fullwidth Forms

Specials

Linear B Syllabary

Linear B Ideograms

Aegean Numbers

Ancient Greek Numbers

Old Italic

Gothic

Ugaritic

Old Persian

Deseret

Shavian

Osmanya

Cypriot Syllabary

Phoenician

Kharoshthi

Cuneiform

Cuneiform Numbers and Punctuation

Byzantine Musical Symbols

Musical Symbols

Ancient Greek Musical Notation

Tai Xuan Jing Symbols

Counting Rod Numerals

Mathematical Alphanumeric Symbols

CJK Unified Ideographs Extension B

CJK Compatibility Ideographs Supplement

Tags

Variation Selectors Supplement

Supplementary Private Use Area-A

Supplementary Private Use Area-B

You can use this form to ask for information on Unicode 5.0.0 characters in a number of different ways. Most frequently, you'll probably want to enter a regular expression, such as smil.*face, editor. You can also enter numeric code points, using the prefix 0x for hexadecimal, 0b for binary, 0 for octal and a number with leading zeros removed for decimal. So, the code point 1578 (ARABIC LETTER TEH) could be entered as 1578, 0x62A, 03052 or 0b11000101010. Or you can enter a range of hex characters like 0x384-0x3FF.

The large characters on the right look much better on some systems than others - ideally you want Truetype fonts with good Unicode coverage installed. (e.g. from the Unicode Free Font Guide or you could download Arial Unicode MS from Microsoft.) If you know the Magic Word then instead of relying on your fonts, you'll get a bitmap from the official code pages, but there are obvious legal problems with that, so it's not the default.