Unicode character information

Basic Latin
Latin-1 Supplement
Latin Extended-A
Latin Extended-B
IPA Extensions
Spacing Modifier Letters
Combining Diacritical Marks
Greek and Coptic
Cyrillic
Cyrillic Supplement
Armenian
Hebrew
Arabic
Syriac
Arabic Supplement
Thaana
NKo
Devanagari
Bengali
Gurmukhi
Gujarati
Oriya
Tamil
Telugu
Kannada
Malayalam
Sinhala
Thai
Lao
Tibetan
Myanmar
Georgian
Hangul Jamo
Ethiopic
Ethiopic Supplement
Cherokee
Unified Canadian Aboriginal Syllabics
Ogham
Runic
Tagalog
Hanunoo
Buhid
Tagbanwa
Khmer
Mongolian
Limbu
Tai Le
New Tai Lue
Khmer Symbols
Buginese
Balinese
Phonetic Extensions
Phonetic Extensions Supplement
Combining Diacritical Marks Supplement
Latin Extended Additional
Greek Extended
General Punctuation
Superscripts and Subscripts
Currency Symbols
Combining Diacritical Marks for Symbols
Letterlike Symbols
Number Forms
Arrows
Mathematical Operators
Miscellaneous Technical
Control Pictures
Optical Character Recognition
Enclosed Alphanumerics
Box Drawing
Block Elements
Geometric Shapes
Miscellaneous Symbols
Dingbats
Miscellaneous Mathematical Symbols-A
Supplemental Arrows-A
Braille Patterns
Supplemental Arrows-B
Miscellaneous Mathematical Symbols-B
Supplemental Mathematical Operators
Miscellaneous Symbols and Arrows
Glagolitic
Latin Extended-C
Coptic
Georgian Supplement
Tifinagh
Ethiopic Extended
Supplemental Punctuation
CJK Radicals Supplement
Kangxi Radicals
Ideographic Description Characters
CJK Symbols and Punctuation
Hiragana
Katakana
Bopomofo
Hangul Compatibility Jamo
Kanbun
Bopomofo Extended
CJK Strokes
Katakana Phonetic Extensions
Enclosed CJK Letters and Months
CJK Compatibility
CJK Unified Ideographs Extension A
Yijing Hexagram Symbols
CJK Unified Ideographs
Yi Syllables
Yi Radicals
Modifier Tone Letters
Latin Extended-D
Syloti Nagri
Phags-pa
Hangul Syllables
High Surrogates
High Private Use Surrogates
Low Surrogates
Private Use Area
CJK Compatibility Ideographs
Alphabetic Presentation Forms
Arabic Presentation Forms-A
Variation Selectors
Vertical Forms
Combining Half Marks
CJK Compatibility Forms
Small Form Variants
Arabic Presentation Forms-B
Halfwidth and Fullwidth Forms
Specials
Linear B Syllabary
Linear B Ideograms
Aegean Numbers
Ancient Greek Numbers
Old Italic
Gothic
Ugaritic
Old Persian
Deseret
Shavian
Osmanya
Cypriot Syllabary
Phoenician
Kharoshthi
Cuneiform
Cuneiform Numbers and Punctuation
Byzantine Musical Symbols
Musical Symbols
Ancient Greek Musical Notation
Tai Xuan Jing Symbols
Counting Rod Numerals
Mathematical Alphanumeric Symbols
CJK Unified Ideographs Extension B
CJK Compatibility Ideographs Supplement
Tags
Variation Selectors Supplement
Supplementary Private Use Area-A
Supplementary Private Use Area-B

Search for: (e.g. phone, 0x2615)

You can use this form to ask for information on Unicode 5.0.0 characters in a number of different ways. Most frequently, you'll probably want to enter a regular expression, such as smil.*face, editor. You can also enter numeric code points, using the prefix 0x for hexadecimal, 0b for binary, 0 for octal and a number with leading zeros removed for decimal. So, the code point 1578 (ARABIC LETTER TEH) could be entered as 1578, 0x62A, 03052 or 0b11000101010. Or you can enter a range of hex characters like 0x384-0x3FF.

The large characters on the right look much better on some systems than others - ideally you want Truetype fonts with good Unicode coverage installed. (e.g. from the Unicode Free Font Guide or you could download Arial Unicode MS from Microsoft.) If you know the Magic Word then instead of relying on your fonts, you'll get a bitmap from the official code pages, but there are obvious legal problems with that, so it's not the default.