Documentation for What Unicode character is this ?


Unicode Support

Supports all 149,813 named characters defined in Unicode 15.1 (released September 2023).


Format of Output

For each character, its hexadecimal code point value is prefixed U+, followed by the official character name, with any abbreviation in square brackets. For a very few characters with incorrect names, the formal character name alias listed in NameAliases.txt is noted in parentheses following the character name. Where available, informative aliases listed in the Unicode code charts are appended in braces.

For characters which do not have a defined character name (i.e. control characters, private use characters, unpaired surrogates, and noncharacters), the code point label is given in angle brackets in place of the name (e.g. <control-0009> for the tab character). For control characters, the corresponding name given in ISO/IEC 6429:1992 is given in parentheses.


Tangut Ideographs

For Tangut ideographs, the algorithmic character name is followed in parentheses by the identifying source reference for the character in TangutSources.txt. Where available, the simple (and sometimes misleading or incomplete) English meaning of the character given in the English index to Lǐ Fànwén's 李範文 Tangut-Chinese Dictionary 夏漢字典 (rev. ed., 2008) is appended in braces.


Khitan Small Script Characters

For Khitan Small Script characters, the reading (in lowercase) and/or the meaning (in capitals) is appended in braces if known. This information is based on (with some modifications): Wu Yingzhe [Wú Yīngzhé 吴英喆] and Juha Janhunen, New Materials on the Khitan Small Script: A Critical Edition of Xiao Dilu and Yelü Xiangwen (Folkestone: Global Oriental, 2010).


Egyptian Hieroglyphs

For Egyptian hieroglyphs, a description of the glyph is appended in braces. Most of the glyph descriptions are derived (with some minor modifications) from the sign-list in Alan Gardiner's Egyptian Grammar (3rd ed., 1957). The descriptions for characters not included in Gardiner's sign-list are gratefully adapted from Mark-Jan Nederhof's The 1071 hieroglyphs from Unicode 5.2.


Unified Ideographs

For CJK unified ideographs, only the algorithmic character name is currently given, but additional information may be provided in the future.


CJK Compatibility Ideographs

For CJK compatibility ideographs, the algorithmic character name is followed by the variation sequence mapping for the corresponding CJK unified ideograph.


URL parameters

The following URL parameters are supported:


NB for the decode parameter only, any codes or notes parameters must be included before the decode parameter, e.g. https://www.babelstone.co.uk/​Unicode/​whatisit.html?​codes=no​&notes=no​&decode=&#x1F926;.



What Unicode character is this ? | BabelMap Online | Unicode Slide Show | Unicode Text Styler