BabelStone Fonts


BabelStone Han

Description

BabelStone Han v. 13.0.13 is a free Unicode CJK font with over 45,700 Han characters (hanzi, kanji, hanja), and over 56,500 Unicode characters in total. It is a Song/Ming style (宋体/明體) font, with glyphs modelled on the official character forms used in the People's Republic of China. It covers both traditional Chinese characters and simplified Chinese characters, and includes many rare or archaic characters that are not found in most CJK fonts, as well as more than a thousand Sawndip characters used for writing Zhuang. It also includes over 5,000 unencoded Han characters in the Private Use Area (also available in the separate BabelStone Han PUA font). The current version of BabelStone Han supports Unicode 13.0* which was released on 10 March 2020.

* A plain text file listing all IDS sequences for CJK unified ideographs in Unicode version 13.0 is available as IDS.TXT.

BabelStone Han was created by merging the "AR PL Mingti2L Big5" font (13,068 CJK Unified Ideographs) and "AR PL SungtiL GB" font (6,763 CJK Unified Ideographs), which were both released by Arphic Technology (文鼎科技) in 2001 under the original Arphic Public License [see below]. The merged font, comprising 14,907 CJK Unified Ideographs, was then converted to Unicode mappings, and has since been expanded to cover a wide range of traditional and simplified characters in the CJK and CJK Extension A, B, C, D, E, F and G blocks, as well as many general Unicode characters. The original CJK glyphs have been modified where appropriate to match the G-source glyphs given in the Unicode code charts.

This font is under continuous development, as tens of thousands of additional CJK ideographs are scheduled for inclusion in Unicode over then next few years. I intend to add as many of these new characters to BabelStone Han as I can, but I will never be able to include anywhere near all the CJK ideographs that have been and will be added to Unicode, so I am concentrating on G-source additions (i.e. characters used in China). However, if there are any characters that you would like to have added to BabelStone Han urgently just let me know (by twitter or email) and I will try to fast-track their inclusion in the font. Please also let me know if you notice any glyph errors.


Summary of Coverage of CJK Unified Ideographs in BabelStone Han
Unicode Block Total Coverage G-Source Coverage
Characters Percentage Characters Percentage
CJK Unified Ideographs20,989 / 20,989100%20,930 / 20,930100%
CJK Unified Ideographs Extension A3,752 / 6,59256.9%3,506 / 6,19256.6%
CJK Unified Ideographs Extension B10,435 / 42,71824.4%8,075 / 30,52526.5%
CJK Unified Ideographs Extension C2,049 / 4,14949.4%1,119 / 1,119100%
CJK Unified Ideographs Extension D222 / 222100%76 / 76100%
CJK Unified Ideographs Extension E3,126 / 5,76254.3%2,296 / 2,81581.6%
CJK Unified Ideographs Extension F2,499 / 7,47333.4%1,304 / 1,304100.0%
CJK Unified Ideographs Extension G2,661 / 4,93953.9%703 / 2,08233.8%
CJK Compatibility Ideographs12 / 12100%0 / 0N/A
Total45,745 / 92,85649.3%38,009 / 65,04358.4%


License

BabelStone Han is released under the original Arphic Public License, and may be used and distributed freely for personal or commercial purposes. The license also allows you to modify the font in any way you like (e.g. add/remove/modify glyphs or change the character mappings) without seeking my permission, as long as the modified font retains the Arphic Public License. If you do intend to distribute a modified version of BabelStone Han, please ensure that the font name (and all relevant fields in the name table) is changed to something different, so as not to cause confusion between my font and your modified font. In particular, please do not use "BabelStone" in the font name.

In 2010 Arphic Technology released two new public fonts, "AR PLMingU20 Light" (文鼎PL明體U20-L) and "AR PLBaosong2GBK Light" (文鼎PL報宋2GBK), both under a new Arphic Public License. The old license (which BabelStone Han uses) explicitly allows distributers of the original or modified font to make a financial gain ("If you wish, you can charge for this service"), whereas the terms of the new license explicitly restrict distribution of the original or modified font to "only for non-profit purpose". The two new fonts both have complete coverage of the 20,902 original characters in the main CJK block, and it would have saved me hundreds of hours of work to use one of these fonts as the basis for my font, but I prefer to use the old fonts with the original, unrestricted license.



Samples

Traditional Chinese Text


Simplified Chinese Text



Character Repertoire

BabelStone Han includes a total of 55,273 Unicode characters (50,307 named Unicode characters and 4,966 Private Use Area characters), and has a total of 56,341 glyphs. The font includes coverage for a wide range of Han ideographs and Unicode symbols as listed in the table below. The font is a dual-width font, with a mixture of full-width characters and half-width characters, as indicated in the column headed "Character Width" (a few symbol characters are multiple half-width character width).


Unicode Block Range Coverage Character Width Notes
Basic Latin00000..0007F97 out of 128 charactersHalf-width
Latin-1 Supplement00080..000FF91 out of 128 charactersHalf-width
Latin Extended-A00100..0017F36 out of 128 charactersHalf-width
Latin Extended-B00180..0024F23 out of 208 charactersHalf-width
IPA Extensions00250..002AF25 out of 96 charactersFull-width
Spacing Modifier Letters002B0..002FF30 out of 80 charactersHalf-width
Combining Diacritical Marks00300..0036F18 out of 112 charactersFull-width
Tibetan00F00..00FFF15 out of 211 charactersFull-widthReligious and astrological symbols
Mongolian01800..018AF3 out of 157 charactersFull-widthLozenge-shaped punctuation marks
Combining Diacritical Marks Extended01AB0..01AFF1 out of 17 charactersFull-widthCombining wavy line below for use with Han ideographs
Phonetic Extensions01D00..01D7F6 out of 128 charactersHalf-width
Latin Extended Additional01E00..01EFF30 out of 256 charactersHalf-width
General Punctuation02000..0206F111 out of 111 charactersMixed
Superscripts and Subscripts02070..0209F42 out of 42 charactersHalf-width
Currency Symbols020A0..020CF5 out of 32 charactersFull-width
Combining Diacritical Marks for Symbols020D0..020FF33 out of 33 charactersFull-width
Letterlike Symbols02100..0214F26 out of 80 charactersFull-width
Number Forms02150..0218F53 out of 60 charactersFull-width
Arrows02190..021FF53 out of 112 charactersFull-width
Mathematical Operators02200..022FF47 out of 256 charactersFull-width
Miscellaneous Technical02300..023FF31 out of 256 charactersFull-width
Enclosed Alphanumerics02460..024FF160 out of 160 charactersFull-width
Box Drawing02500..0257F128 out of 128 charactersFull-width
Block Elements02580..0259F32 out of 32 charactersFull-width
Geometric Shapes025A0..025FF96 out of 96 charactersFull-width
Miscellaneous Symbols02600..026FF61 out of 256 charactersFull-width
Dingbats02700..027BF15 out of 192 charactersFull-width
Supplemental Arrows-A027F0..027FF2 out of 16 charactersFull-width
Braille Patterns02800..028FF256 out of 256 charactersFull-width
Supplemental Arrows-B02900..0297F16 out of 128 charactersFull-width
Miscellaneous Mathematical Symbols-B02980..029FF10 out of 128 charactersFull-width
Miscellaneous Symbols and Arrows02B00..02BFF79 out of 253 charactersFull-width
Latin Extended-C02C60..02C7F1 out of 32 charactersHalf-width
Supplemental Punctuation02E00..02E7F7 out of 83 charactersFull-width
CJK Radicals Supplement02E80..02EFF115 out of 115 charactersFull-width
Kangxi Radicals02F00..02FDF214 out of 214 charactersFull-width
Ideographic Description Characters02FF0..02FFF12 out of 12 charactersFull-width
CJK Symbols and Punctuation03000..0303F55 out of 64 charactersFull-width
Hiragana03040..0309F93 out of 93 charactersFull-widthDecomposed (NFD) characters and named sequences also supported
Katakana030A0..030FF96 out of 96 charactersFull-widthDecomposed (NFD) characters and named sequences also supported
Bopomofo03100..0312F43 out of 43 charactersFull-widthPoor quality glyphs
Kanbun03190..0319F16 out of 16 charactersFull-width
Bopomofo Extended031A0..031BF32 out of 32 charactersFull-widthPoor quality glyphs
CJK Strokes031C0..031EF36 out of 36 charactersFull-width
Katakana Phonetic Extensions031F0..031FF16 out of 16 charactersFull-width
Enclosed CJK Letters and Months03200..032FF192 out of 255 charactersFull-width
CJK Compatibility03300..033FF73 out of 256 charactersFull-width
CJK Unified Ideographs Extension A03400..04DBF3,752 out of 6,592 charactersFull-width
Yijing Hexagram Symbols04DC0..04DFF64 out of 64 charactersFull-width
CJK Unified Ideographs04E00..09FFF20,989 out of 20,989 charactersFull-width
Latin Extended-D0A720..0A7FF2 out of 180 charactersHalf-width
Private Use Area0E000..0F8FF4,966 out of 6,400 charactersFull-widthE000..E01D: Song and Yuan dynasty flute notation symbols.
E020..E04B: Tang and Song dynasty lute notation symbols.
E050..E06F: Miscellaneous symbols.
E080..F8DF: unencoded CJK ideographs and ideograph components.
F8E0..F8FF: Unencoded kana letters and small-form ideographs.
NB PUA mappings may vary from version to version of BabelStone Han, and characters will be removed from the PUA as and when they are encoded. The current PUA allocation of BabelStone Han is detailed here. The CJK characters in the PUA of this font are also available in the BabelStone Han PUA font, which also includes some unencoded characters which may not be in the latest released version of the BabelStone Han font.
CJK Compatibility Ideographs0F900..0FAFF472 out of 472 charactersFull-widthGlyphs are mostly the same as the corresponding unified ideograph (glyphs will be corrected where necessary in a future version of this font).
Variation Selectors0FE00..0FE0F16 out of 16 charactersFull-width
Vertical Forms0FE10..0FE1F10 out of 10 charactersFull-width
CJK Compatibility Forms0FE30..0FE4F32 out of 32 charactersFull-width
Small Form Variants0FE50..0FE6F26 out of 26 charactersFull-width
Halfwidth and Fullwidth Forms0FF00..0FFEF173 out of 225 charactersFull-width
Specials0FFF0..0FFFF5 out of 5 charactersFull-width
Ideographic Symbols and Punctuation16FE0..16FFF4 out of 7 charactersFull-width
Kana Supplement1B000..1B0FF256 out of 256 charactersFull-widthArchaic Hiragana and Katakana letters added in Unicode 6.0
Hentaigana letters added in Unicode 10.0. Glyphs auto-traced from the Unicode code charts.
Kana Extended-A1B100..1B12F31 out of 31 charactersFull-widthHentaigana letters added in Unicode 10.0. Glyphs auto-traced from the Unicode code charts.
Small Kana Extension1B130..1B16F7 out of 7 charactersFull-width
Mayan Numerals1D2E0..1D2FF20 out of 20 charactersFull-width
Tai Xuan Jing Symbols1D300..1D35F87 out of 87 charactersFull-width
Counting Rod Numerals1D360..1D37F25 out of 25 charactersFull-width
Mahjong Tiles1F000..1F02F44 out of 44 charactersFull-widthGlyphs auto-traced from the Unicode code charts.
Domino Tiles1F030..1F09F100 out of 100 charactersFull-width
Playing Cards1F0A0..1F0FF82 out of 82 charactersFull-widthGlyphs auto-traced from the Unicode code charts.
Enclosed Alphanumeric Supplement1F100..1F1FF200 out of 200 charactersFull-width
Enclosed Ideographic Supplement1F200..1F2FF64 out of 64 charactersFull-width
Miscellaneous Symbols And Pictographs1F300..1F5FF70 out of 768 charactersFull-width
Emoticons1F600..1F64F80 out of 80 charactersFull-width
Ornamental Dingbats1F650..1F67F48 out of 48 charactersFull-widthGlyphs auto-traced from the Unicode code charts.
Transport and Map Symbols1F680..1F6FF6 out of 114 charactersFull-width
Geometric Shapes Extended1F780..1F7FF101 out of 101 charactersFull-width
Supplemental Symbols and Pictographs1F900..1F9FF42 out of 254 charactersFull-width
Chess Symbols1FA00..1FA6F14 out of 98 charactersFull-width.Xiangqi game symbols
Symbols and Pictographs Extended-A1FA70..1FAFF2 out of 57 charactersFull-width
Symbols for Legacy Computing1FB00..1FBFF10 out of 212 charactersFull-widthSegmented digits for 0..9
CJK Unified Ideographs Extension B20000..2A6DF10,435 out of 42,718 charactersFull-width
CJK Unified Ideographs Extension C2A700..2B73F2,049 out of 4,149 charactersFull-widthAdded in Unicode 5.2
CJK Unified Ideographs Extension D2B740..2B81F222 out of 222 charactersFull-width"Urgent need characters", added in Unicode 6.0
CJK Unified Ideographs Extension E2B820..2CEAF3,126 out of 5,762 charactersFull-widthAdded in Unicode 8.0
CJK Unified Ideographs Extension F2CEB0..2EBEF2,499 out of 7,473 charactersFull-widthAdded in Unicode 10.0
CJK Compatibility Ideographs Supplement2F800..2FA1F542 out of 542 charactersFull-widthGlyphs are mostly the same as the corresponding unified ideograph (glyphs will be corrected where necessary in a future version of this font).
CJK Unified Ideographs Extension G30000..3134F2,661 out of 4,939 charactersFull-widthAdded in Unicode 13.0
Tags0FE00..0FE0F97 out of 97 charactersFull-width
Variation Selectors Supplement0FE00..0FE0F240 out of 240 charactersFull-width


Glyph Forms

The glyph forms in BabelStone Han are in principle based on the G-source glyph forms defined by the Standardization Administration of China that are given in the Unicode code charts:


If there is no G-source glyph for a particular character included in BabelStone Han, then I use a glyph form that is consistent with G-source glyph conventions, even if it does not exactly match the glyph form in the Unicode code chart. For the basic CJK block there are an unknown number of characters that do not have the correct G-source glyph form, but the number of incorrect glyphs is being reduced with each new version of BabelStone Han that I release. Please inform me of any incorrect glyph forms that you notice.

In a few cases I deliberately do not follow the G-source glyph form given in the Unicode code charts. Occasionally the glyph form given in an authoritative PRC dictionary such as Hànyǔ Dà Zìdiǎn 漢語大字典 or Xiàndài Hànyǔ Cídiǎn 现代汉语词典 seems preferable to the code chart glyph form, and that has been used instead.

In some cases the G-source glyphs are not consistent, and I have used the expected G-source glyph form rather than the actual G-source glyph form. For example, the radical for characters sourced to GZJW (Yīnzhōu Jīnwén Jíchéng Yǐndé 殷周金文集成引得) in Exts. C and E is written differently to the standard form of the radical, and in these cases I have used the standard radical form.

In cases where I have deliberately not used the G-source code chart glyph, the G-source glyph is included in the font as an unmapped glyph which can be accessed as an ideographic variation sequence (see below).



OpenType Features

BabelStone Han supports the following OpenType features.


Script Language Features Explanation
Han <hani> Default <dflt> Glyph Composition Decomposition <ccmp> Support for a limited set of combining sequences of CJK unified ideographs and the Vietnamese reading marks encoded at U+16FF0 and U+16FF1 (test page)
Han <hani> Default <dflt> Contextual Alternates <calt> Automatically rotates Suzhou numerals 1-3 when adjacent to each other (〡〢〣).
Latin <latn> Default <dflt> Glyph Composition Decomposition <ccmp>

Roman numeral ligatures for Ⅲ‍Ⅰ ( + ZWJ + ) and Ⅷ‍Ⅰ ( + ZWJ + ).

Sequences of two, three or four consecutive instances of U+1D377 TALLY MARK ONE 𝍷 should be ligatured into a single character space (𝍷𝍷, 𝍷𝍷𝍷, 𝍷𝍷𝍷𝍷).

Hiragana and Katakana <kana> Default <dflt> Glyph Composition Decomposition <ccmp> Used to compose decomposed Hiragana, Katakana and Hentaigana characters, including extended katakana sequences used for Ainu.


Standardized Variation Sequences

BabelStone Han supports 1,002 standardized variation sequences for 902 CJK Unified Ideographs corresponding to CJK Compatibility Ideographs (see StandardizedVariants.txt "CJK compatibility ideographs"). It also supports 16 standardized variation sequences for eight East Asian punctuation marks (see StandardizedVariants.txt "East Asian punctuation positional variants").



Ideographic Variation Sequences

BabelStone Han supports a provisional set of ideographic variation sequences (IVS) for 721 characters. These are listed on this page. These are not yet registered in the Ideographic Variation Database (IVD), but it is intended to register a BabelStone IVS collection in the IVD at a future date. The IVS sequences currently in the font are not stable, and some may be modified or removed in response to user feedback and expert review.



Version History

See BabelStone Han Version History.



Download

BabelStone Han is free to download and use for personal or commercial purposes. For use on your computer or mobile device please download and install the TTF file (or the ZIP containing the TTF file). The WOFF and WOFF2 files are for web hosting only, and are not required by most users.


Font Version Download Notes
BabelStone Han

13.0.13

(2021-03-15)

BabelStoneHan.zip [17.4 MB]

BabelStoneHan.ttf [42.6 MB]

BabelStoneHan.woff [16.7 MB]

BabelStoneHan.woff2 [6.0 MB]

BabelStone Han PUA

1.255

(2021-03-15)

BabelStoneHanPUA.ttf [4.00 MB]

Only includes the PUA CJK characters in the range U+E080 through U+F8FF.
BabelStone Han Erjian 1

1.1.3

(2020-01-28)

BabelStoneErjian1.ttf [6.58 MB]

Table 1 2nd stage simplified glyph forms mapped to the corresponding standard simplified form of the character.
BabelStone Han Erjian 2

1.1.5

(2020-01-27)

BabelStoneErjian2.ttf [6.38 MB]

Table 1 and Table 2 2nd stage simplified glyph forms mapped to the corresponding standard simplified form of the character.

If you find this font useful, please consider making a small donation to help support the continued development of BabelStone fonts by clicking on the PayPal link below:

paypal.me/babelstone



Made with FontCreator badge



BabelStone Fonts