Unicode typefaces
Encyclopedia
A Unicode font is a computer font
Computer font
A computer font is an electronic data file containing a set of glyphs, characters, or symbols such as dingbats. Although the term font first referred to a set of metal type sorts in one style and size, since the 1990s it is generally used to refer to a scalable set of digital shapes that may be...

 that contains a wide range of characters
Character (computing)
In computer and machine-based telecommunications terminology, a character is a unit of information that roughly corresponds to a grapheme, grapheme-like unit, or symbol, such as in an alphabet or syllabary in the written form of a natural language....

, letters
Letter (alphabet)
A letter is a grapheme in an alphabetic system of writing, such as the Greek alphabet and its descendants. Letters compose phonemes and each phoneme represents a phone in the spoken form of the language....

, digit
Digit (unit)
The digit or finger is an ancient and obsolete non-SI unit of measurement of length. It was originally based on the breadth of a human finger...

s, glyphs, symbols, ideograms, logograms, etc., which are collectively mapped into the standard Universal Character Set
Universal Character Set
The Universal Character Set , defined by the International Standard ISO/IEC 10646, Information technology — Universal multiple-octet coded character set , is a standard set of characters upon which many character encodings are based...

, derived from many different languages and scripts
Writing system
A writing system is a symbolic system used to represent elements or statements expressible in language.-General properties:Writing systems are distinguished from other possible symbolic communication systems in that the reader must usually understand something of the associated spoken language to...

 from around the world. Unlike most conventional computer fonts, which are specific to a particular language or legacy character set and contain only a small subset of the UCS characters, these fonts attempt to include many thousands of possible glyphs, so that they can be used as a single typeface across multi-lingual documents.

Background

The Unicode
Unicode
Unicode is a computing industry standard for the consistent encoding, representation and handling of text expressed in most of the world's writing systems...

  standard does not specify or create the font (typeface
Typeface
In typography, a typeface is the artistic representation or interpretation of characters; it is the way the type looks. Each type is designed and there are thousands of different typefaces in existence, with new ones being developed constantly....

), a collection of graphical shapes called glyphs, itself. Rather, it defines the abstract characters as a specific number (known as a codepoint) and also defines the required changes of shape depending on the context the glyph is used in (e.g., Combining character
Combining character
In digital typography, combining characters are characters that are intended to modify other characters. The most common combining characters in the Latin script are the combining diacritical marks ....

s, precomposed character
Precomposed character
A precomposed character is a Unicode entity that can be defined as a combination of two or more other characters. A precomposed character may typically represent a letter with a diacritical mark, such as é...

s and letter
Letter (alphabet)
A letter is a grapheme in an alphabetic system of writing, such as the Greek alphabet and its descendants. Letters compose phonemes and each phoneme represents a phone in the spoken form of the language....

-diacritic
Diacritic
A diacritic is a glyph added to a letter, or basic glyph. The term derives from the Greek διακριτικός . Diacritic is both an adjective and a noun, whereas diacritical is only an adjective. Some diacritical marks, such as the acute and grave are often called accents...

 combinations). The choice of font, which governs how the abstract UCS characters are converted into a bitmap or vector output that can be viewed on a screen or printed, is left up to the user. If a font is chosen which does not contain a glyph for a codepoint used in the document, typically a question mark ("?"), a box, or some other Substitute character
Substitute character
A substitute character is a control character that is used in the place of a character that is recognized to be invalid or in error or that cannot be represented on a given device....

 is displayed.

Computer font
Computer font
A computer font is an electronic data file containing a set of glyphs, characters, or symbols such as dingbats. Although the term font first referred to a set of metal type sorts in one style and size, since the 1990s it is generally used to refer to a scalable set of digital shapes that may be...

s use various techniques to display characters or glyphs. A Bitmap font contains a grid of dots known as pixel
Pixel
In digital imaging, a pixel, or pel, is a single point in a raster image, or the smallest addressable screen element in a display device; it is the smallest unit of picture that can be represented or controlled....

s forming an image of each glyph in each face and size. Outline fonts (also known as Vector fonts) use drawing instructions or mathematical formulæ to describe each glyph. Stroke fonts use a series of specified lines (for the glyph's border) and additional information to define the profile, or size and shape of the line in a specific face and size, which together describe the appearance of the glyph.

Many fonts have kerning
Kerning
In typography, kerning is the process of adjusting the spacing between characters in a proportional font, usually to achieve a visually pleasing result. Kerning is the adjustment of the space between individual letter forms vs. tracking which is the uniform adjustment of spacing applied over a...

 pairs which implements better spacing in between the characters. Fonts also includes embedded special orthographic
Orthographic
Orthographic may refer to:* Orthographic projection** Orthographic projection ** Orthographic projection * Orthography...

rules to output certain combinations of letterforms (an alternative symbols for the same letter) be combined into special ligature forms
Ligature (typography)
In writing and typography, a ligature occurs where two or more graphemes are joined as a single glyph. Ligatures usually replace consecutive characters sharing common components and are part of a more general class of glyphs called "contextual forms", where the specific shape of a letter depends on...

 (mixed characters). Operating System
Operating system
An operating system is a set of programs that manage computer hardware resources and provide common services for application software. The operating system is the most important type of system software in a computer system...

 or Web-Browser (aka, User Agent
User agent
In computing, a user agent is a client application implementing a network protocol used in communications within a client–server distributed computing system...

) or both, which uses a font to display text on the screen or print media, can be programmed to use those embedded rules, or use external script-shaping technologies (also known as Rendering Technology or Smartfont Engine), and they can also be programmed to use either a large unicode font, or use multiple different fonts for different characters or languages.

No single "Unicode font" includes all the characters defined in the present revision of ISO 10646 (Unicode) standard, as it is continually adding more & more languages and characters. As a result, font developers and foundries are also incorporating those new characters in newer version or revision of a font, and correcting their previous errors if there were any.

The UCS has over 1.1 million code points, but only the first 65,536 (the Plane 0: Basic Multilingual Plane, or BMP) had entered into common use before 2000. (See the Mapping of Unicode characters
Mapping of Unicode characters
Unicode’s Universal Character Set has a potential capacity to support over 1 million characters. Each UCS character is mapped to a code point which is an integer between 0 and 1,114,111 used to represent each character within the internal logic of text processing software .As of Unicode 5.2.0,...

 article for more information on other planes, including Plane 1: SMP, Plane 2: SIP, Plane 14: SSP, Plane 15 and 16: reserved for PUA.)

The first Unicode fonts (with very large character set, and supporting many Unicode blocks) were Lucida Sans Unicode
Lucida Sans Unicode
In digital typography, Lucida Sans Unicode OpenType font from the design studio of Bigelow & Holmes is designed to support the most commonly used characters defined in version 2.0 of the Unicode standard...

 (released March 1993), Unihan font
Unihan font
Unihan font was developed by Ross Paterson in 1993.Unihan font had two variations, 16x16 and 24x24 pixel fonts. These covered most of theCJK Auxiliary and UniHan portions of Unicode. Font files were in HBF format...

 (1993), and Everson Mono
Everson Mono
Everson Mono is a monospaced humanist sans serif Unicode font whose development by Michael Everson began in 1995. At first, Everson Mono was a collection of 8-bit fonts containing glyphs for tables in ISO/IEC 10646; at that time, it was not easy to edit cmaps to have true Unicode indices, and there...

 (1995).

Issues

There are typographical ambiguities in Unicode, so that some of the unified Han
Han unification
Han unification is an effort by the authors of Unicode and the Universal Character Set to map multiple character sets of the so-called CJK languages into a single set of unified characters. Han characters are a common feature of written Chinese , Japanese , Korean , and—at least historically—other...

 characters (seen in Chinese, Japanese, and Korean) will be typographically different in different regions. For example, Unicode point U+9AA8 (骨) is typographically different between simplified Chinese and traditional Chinese. This has implications for the idea that a single typeface can satisfy the needs of all locales.
The design of Unicode ensures that such differences do not create semantic ambiguity, but the use of incorrect forms is often considered inappropriate to native readers of East Asian languages.

Application of Unicode fonts

Despite all the issues, Unicode
Unicode
Unicode is a computing industry standard for the consistent encoding, representation and handling of text expressed in most of the world's writing systems...

 is now the base character set for many new standards and protocols, and is built into the architecture of operating systems (Microsoft Windows
Microsoft Windows
Microsoft Windows is a series of operating systems produced by Microsoft.Microsoft introduced an operating environment named Windows on November 20, 1985 as an add-on to MS-DOS in response to the growing interest in graphical user interfaces . Microsoft Windows came to dominate the world's personal...

, Apple
Apple Computer
Apple Inc. is an American multinational corporation that designs and markets consumer electronics, computer software, and personal computers. The company's best-known hardware products include the Macintosh line of computers, the iPod, the iPhone and the iPad...

 Mac OS X
Mac OS X
Mac OS X is a series of Unix-based operating systems and graphical user interfaces developed, marketed, and sold by Apple Inc. Since 2002, has been included with all new Macintosh computer systems...

, and many versions of Unix
Unix
Unix is a multitasking, multi-user computer operating system originally developed in 1969 by a group of AT&T employees at Bell Labs, including Ken Thompson, Dennis Ritchie, Brian Kernighan, Douglas McIlroy, and Joe Ossanna...

), programming languages (Ada
Ada (programming language)
Ada is a structured, statically typed, imperative, wide-spectrum, and object-oriented high-level computer programming language, extended from Pascal and other languages...

, Perl
Perl
Perl is a high-level, general-purpose, interpreted, dynamic programming language. Perl was originally developed by Larry Wall in 1987 as a general-purpose Unix scripting language to make report processing easier. Since then, it has undergone many changes and revisions and become widely popular...

, Python, Java
Java (programming language)
Java is a programming language originally developed by James Gosling at Sun Microsystems and released in 1995 as a core component of Sun Microsystems' Java platform. The language derives much of its syntax from C and C++ but has a simpler object model and fewer low-level facilities...

, Common LISP
Common Lisp
Common Lisp, commonly abbreviated CL, is a dialect of the Lisp programming language, published in ANSI standard document ANSI INCITS 226-1994 , . From the ANSI Common Lisp standard the Common Lisp HyperSpec has been derived for use with web browsers...

, APL
APL programming language
APL is an interactive array-oriented language and integrated development environment, which is available from a number of commercial and noncommercial vendors and for most computer platforms. It is based on a mathematical notation developed by Kenneth E...

), and libraries (IBM International Components for Unicode
International Components for Unicode
International Components for Unicode is an open source project of mature C/C++ and Java libraries for Unicode support, software internationalization and software globalization. ICU is widely portable to many operating systems and environments. It gives applications the same results on all...

 (ICU) along with the Pango
Pango
Pango is an LGPL licensed open source computing library used by software developers for laying out and rendering text in high quality, emphasising support for multilingual text...

, Graphite
Graphite (SIL)
Graphite is a programmable Unicode-compliant smart-font technology and rendering system developed by SIL International. It is free software, distributed under the terms of the GNU Lesser General Public License and the Common Public License....

, Scribe
Qt (toolkit)
Qt is a cross-platform application framework that is widely used for developing application software with a graphical user interface , and also used for developing non-GUI programs such as command-line tools and consoles for servers...

, Uniscribe
Uniscribe
Uniscribe is the Microsoft Windows set of services for rendering Unicode-encoded text, especially complex text layout. They are implemented in the DLL USP10.DLL. USP10.dll became available to the public with Windows 2000 and Internet Explorer 5.0...

, and ATSUI
Apple Type Services for Unicode Imaging
The Apple Type Services for Unicode Imaging is the set of services for rendering Unicode-encoded text starting with Mac OS 8.5 and in Mac OS X.It replaced the WorldScript engine for legacy encodings....

 rendering engines), font formats (TrueType
TrueType
TrueType is an outline font standard originally developed by Apple Computer in the late 1980s as a competitor to Adobe's Type 1 fonts used in PostScript...

 and OpenType
OpenType
OpenType is a format for scalable computer fonts. It was built on its predecessor TrueType, retaining TrueType's basic structure and adding many intricate data structures for prescribing typographic behavior...

) and so on. Many other standards are also getting upgraded to Unicode compliance, day by day.

Utility software

Utility software can be used to see exactly which characters are included inside a font file:
  • Character Map
    Character Map
    Character Map is a utility included with Microsoft Windows operating systems and is used to view the characters in any installed font, to check what keyboard input is used to enter those characters, and to copy characters to the clipboard in lieu of typing them. The tool is usually useful for...

     applet included with Windows
  • Font Book
    Font Book
    Apple's Font Book is a font manager that comes with Mac OS X since version 10.3 .The program enables users to:* Preview and install font files, avoiding multiple instances of the same font....

     application included with Mac OS X
  • BabelMap application, for Windows
  • FontForge
    FontForge
    FontForge is a typeface editor program developed by George Williams. FontForge is free software and is distributed under the BSD license. FontForge is available for several operating systems and is localized in several languages.- Features :Fontforge supports many font formats, including...

     application (Font Creator), for Linux, Unix, Windows

List of Unicode fonts

Of the many Unicode fonts available, the few ones listed below are the most commonly used by a majority of users around the world on mainstream computing platforms
Platform (computing)
A computing platform includes some sort of hardware architecture and a software framework , where the combination allows software, particularly application software, to run...

. More Unicode fonts can be found in the (List of typefaces) article's "Unicode fonts" section.
List of Unicode Fonts
Font Char(s) Glyphs Kernpairs
(Standard)
Point (Range) : Hinting/ Smoothing Behavior Version & Date Filename,
Size
Font Family Font Weight,
style
Font
type
Serif
style
License Creator / Author (Copyright) Notes
Arial
Arial
Arial, sometimes marketed or displayed in software as Arial MT, is a sans-serif typeface and set of computer fonts. Fonts from the Arial family are packaged with Microsoft Windows, some other Microsoft software applications, Apple Mac OS X and many PostScript 3 computer printers...

3,415 3,415 909 v5.06
2009-01-20
arial.ttf
(756 KB)
Arial Medium (Normal), Regular OTF+TTO Normal Sans Proprietary (2008) The Monotype Corporation.
Arial Unicode MS
Arial Unicode MS
In digital typography, the TrueType font Arial Unicode MS is an extended version of the font Arial. Compared to Arial, it includes higher line height, omits kerning pairs and adds enough glyphs to cover a large subset of Unicode 2.1—thus supporting most Microsoft code pages, but also requiring much...

38,917 50,377 0 v1.01 sfnt rev 1
2002-11-18
ARIALUNI.TTF
(22.1 MB)
Arial Unicode MS Medium (Normal), Regular OTF+TTO Normal Sans Proprietary Agfa Monotype Corporation.
Monotype Corporation.
Bitstream Cyberbit
Bitstream Cyberbit
Bitstream Cyberbit is a commercial Unicode font designed by Bitstream Inc. It is freeware for non-commercial uses. It was historically one of the first widely available fonts with support for a large proportion of the Unicode repertoire....

32,961 29,934 935 v2.0 beta
1998-03-22
Cyberbit.ttf
(12.7 MB)
Bitstream Cyberbit Medium (Normal), Roman TTF  Cove Freeware Bitstream Inc.
BitstreamCyberCJK 30,275 28,686 935 v2.0 beta
1998-03-17
Cyberbit.ttf
(12.4 MB)
Bitstream CyberCJK Medium (Normal), Book, Roman TTF  Cove Freeware Bitstream Inc.
Cardo 2,879 2,882 216 v0.099 sfnt rev 0.098999
2010-05-23
Cardo99s.ttf
(706 KB)
Cardo Medium (Normal), Book, Regular TTF  Cove Freeware. SIL Open Font License v1.1. David J. Perry
Caslon Roman
Caslon Roman
Caslon Roman is a serif style Caslon family TrueType Unicode font, developed by George Williams. It is available free, under BSD-like license or SIL Open Font License.-Character Coverage:...

3,683 3,686 0 v001.000
2003-10-23

(3.70 MB)
Caslon Roman TTF    BSD-like license George Williams
Code2000
Code2000
Code2000 is a pan-Unicode digital font, which includes characters and symbols from a very large range of writing systems. As of the current final version 1.171 released in 2008, Code2000 is designed and implemented by James Kass to include as much of the Unicode 5.2 standard as practical , and to...

53,068 63,546 239 v1.171 sfnt rev 1.1709
2008-06-12
CODE2000.TTF
(7.98 MB)
Code2000 Medium (Normal), Book, Regular TTF  Any Shareware
(unrestricted)
James Kass
Charis SIL
Charis SIL
Charis SIL is a glyphic serif typeface developed by SIL International. It is based on Bitstream Charter, one of the first fonts designed for laser printers...

2,172 4,661 0 v4.106 sfnt rev 4.106
2009-03-18
CharisSILR.ttf
(1.57 MB)
Charis SIL Medium (Normal), Regular TTF  Any OFL
SIL Open Font License
The SIL Open Font License is a free and open source license designed for fonts by SIL International for use with some of their Unicode fonts. The license is considered free by the Free Software Foundation, which states that a simple hello world program is enough to satisfy the license's requirement...

 
SIL International
Chryſanþi Unicode (Chrysanthi Unicode) 4,818 4,383 0 v3.1
2001-07-15
chrysuni.ttf
(737 KB)
Chrysanthi Unicode Medium (Normal), Regular TTF  Any Freeware Dolgthrasir Mioethdrauci.
Every Witch Way.
ClearlyU
ClearlyU
ClearlyU is a bitmap unicode font developed by Mark Leisher.It contains 9,538 glyphs in v1.9. Includes characters from following unicode ranges: Basic Latin, Latin-1 Supplement, Latin Extended-B, IPA Extensions, Spacing Modifier Letters, Combining Diacritical Marks, Greek, Cyrillic, Armenian,...

9,538
0 v1.9
2002-11-22
cu12.bdf
(1.13 MB)
ClearlyU Medium Freeware Mark Leisher.
DejaVu Sans
DejaVu fonts
The DejaVu fonts are modifications of the Bitstream Vera fonts designed for greater coverage of Unicode, as well as providing more styles. The Bitstream Vera family was limited mainly to the characters in the Basic Latin and Latin-1 Supplement portions of Unicode but was released with a license...

5,467 5,762 2,637 v2.32 sfnt rev 2.31999
2010-08-22
DejaVuSans.ttf
(667 KB)
DejaVu Sans Medium (Normal), Book OTF+TTO Normal Sans Bitstream Vera
Bitstream Vera
Vera is a group typeface with a liberal license.It was designed by Jim Lyles from Bitstream, and it is closely based on Bitstream's Prima, for which Lyles was also responsible. It is a TrueType font with full hinting instructions, which improve its rendering quality on low-resolution devices such...

 license and public domain for additions
Bitstream, Inc.
Tavmjong Bah.
public domain.
Doulos SIL
Doulos SIL
Doulos SIL is a serif typeface developed by SIL International, very similar to Times or Times New Roman. Unlike Times New Roman, Doulos only has a single face, Regular...

2,172 4,661 0 v4.106
2009-03-18
DoulosSILR.ttf
(1.64 MB)
Doulos SIL Medium (Normal), Book, Regular TTF  Any OFL
SIL Open Font License
The SIL Open Font License is a free and open source license designed for fonts by SIL International for use with some of their Unicode fonts. The license is considered free by the Free Software Foundation, which states that a simple hello world program is enough to satisfy the license's requirement...

 
SIL International
Everson Mono
Everson Mono
Everson Mono is a monospaced humanist sans serif Unicode font whose development by Michael Everson began in 1995. At first, Everson Mono was a collection of 8-bit fonts containing glyphs for tables in ISO/IEC 10646; at that time, it was not easy to edit cmaps to have true Unicode indices, and there...

6,391 6,399 0 v5.203 sfnt rev v5.203
2010-07-03
Everson Mono.ttf
(1.91 MB)
Everson Mono Medium (Normal), Regular TTF  Any Shareware
(unrestricted)
Michael Everson
FreeSerif 7,203 8,995 36,302 v1.548 sfnt rev 1.54799
2010-09-19
FreeSerif.ttf
(1.60 MB)
FreeSerif Medium (Normal) TTF  Cove GPL  Free Software Foundation.
Gentium Regular
Gentium
Gentium is a Unicode serif typeface designed by Victor Gaultney. Gentium fonts are free and open source software, and are released under the SIL Open Font License , which permits modification and redistribution. Gentium has wide support for languages using the Latin, Greek, and Cyrillic alphabets,...

1,469 1,699 2,857 v1.02; 2005; sfnt rev 1.02
2005-11-20
GenR102.TTF
(354 KB)
Gentium Book, Regular TTF  Any OFL
SIL Open Font License
The SIL Open Font License is a free and open source license designed for fonts by SIL International for use with some of their Unicode fonts. The license is considered free by the Free Software Foundation, which states that a simple hello world program is enough to satisfy the license's requirement...

 
J. Victor Gaultney. SIL International
Gentium Plus 2,513 5,586 0 v1.502 sfnt rev 1.502
2010-8-19
GentiumPlus-R.ttf
(1.54 MB)
Gentium Plus Medium (Normal), Book, Regular TTF  Any OFL
SIL Open Font License
The SIL Open Font License is a free and open source license designed for fonts by SIL International for use with some of their Unicode fonts. The license is considered free by the Free Software Foundation, which states that a simple hello world program is enough to satisfy the license's requirement...

 
SIL International
GNU Unifont
GNU Unifont
The GNU Unifont by Roman Czyborra is a free bitmap font that covers the Unicode Basic Multilingual Plane , using an intermediate bitmapped font format....

(Unifont)
63,446 63,449 0 v1.00 sfnt rev 1 (5.1.20080914)
2008-09-07

(15.5 MB)
unifont Medium (Normal) Bitmap, TTF  Any GPL  Roman Czyborra
HAN NOM A,
HAN NOM B
32,328
45,020
34,147
45,175
0
0
v2.0; 2005 sfnt rev 2
2005-02-05

(20.3 MB)

(32.2 MB)
HAN NOM A
HAN NOM B
Medium (Normal), Book, Regular TTF  No Fit Freeware Chan Nguyen, Tv Vien Chieu, To Minh Tam.
Junicode
Junicode
Junicode is a free old-style serif typeface developed by Peter S. Baker of the University of Virginia. The design is based on a 17th century typeface used in Oxford, England....

3,022 3,096 v0.6.17 sfnt rev 0.599991
2009-07-11

(752 KB)
Junicode Medium (Normal), Book, Regular TTF  Any GPL  Peter S. Baker
Linux Libertine
Linux Libertine
Linux Libertine is a digital typeface created by the Libertine Open Fonts Project, which aims to create free and open alternatives to Proprietary software typefaces such Times Roman...

2,549 2,552 0 v4.7.5 sfnt rev 4.7
2010-06-15

(893 KB)
Linux Libertine Medium (Normal), Book, Regular OTF+TTO Any GPL, OFL
SIL Open Font License
The SIL Open Font License is a free and open source license designed for fonts by SIL International for use with some of their Unicode fonts. The license is considered free by the Free Software Foundation, which states that a simple hello world program is enough to satisfy the license's requirement...

 
Philipp H. Poll
Lucida Grande
Lucida Grande
Lucida Grande is a humanist sans-serif typeface. It is a member of the Lucida family of typefaces designed by Charles Bigelow and Kris Holmes. It has been used throughout Mac OS X user interface since 1999, as well as in Safari for Windows up to the browser's version 3.2.3 released on May 12,...

2,245 2,826 0 v5.0d8e1 sfnt rev 1.2
2003-03-07
Lucida Grande.ttf
(1.07 MB)
Lucida Grande Medium (Normal), Regular OTF Normal Sans Proprietary Bigelow & Holmes Inc
Lucida Sans Unicode
Lucida Sans Unicode
In digital typography, Lucida Sans Unicode OpenType font from the design studio of Bigelow & Holmes is designed to support the most commonly used characters defined in version 2.0 of the Unicode standard...

1,765 1,776 0 v2.00
1998-08-12
l_10646.ttf
(316 KB)
Lucida Sans Medium (Normal), Regular OTF+TTO Normal Sans Proprietary Bigelow & Holmes Inc
Microsoft JhengHei 28,958 29,220 583 v6.02 sfnt rev 6.02
2009-02-25
msjh.ttf
(20.6 MB)
Microsoft JhengHei Medium (Normal), Book, Regular OTF+TTO Normal Sans Proprietary Microsoft Corporation
Microsoft Sans Serif 2,788 3,077 0 v5.02 sfnt rev 5.02
2009-02-12
micross.ttf
(637 KB)
Microsoft Sans Serif Medium (Normal), Book, Regular OTF+TTO Normal Sans Proprietary Microsoft Corporation
New Gulim
New Gulim
New Gulim is a sans-serif type unicode font designed especially for the Korean-language script, designed by HanYang System Co., Limited . It is an expanded version of Hanyang Gulrim ....

46,567 49,284 0 v3.10 sfnt rev 1
2002-10-29
nGulim.ttf
(24.5 MB)
New Gulim Medium (Normal), Regular TTF  Normal Sans. Obtuse Cove Proprietary HanYang System Co.,LTD.
Quivira 7,882 7,398 0 v3.3 sfnt rev 3.3
2010-03-28
Quivira.ttf
(936 KB)
Quivira Medium (Normal), Book, Regular TTF  Cove Freeware GrinningBit.
STIX 3,290 3,292 1,016 v1.0.0 sfnt rev 1.0
2010-04-29
STIXGeneral.otf
(404 KB)
STIXGeneral Medium (Normal), Book, Regular OTF Any Freeware
Sun-ExtA,
Sun-ExtB
50,112
47,564
50,016
47,660
0
0
(A) v5.4 sfnt rev 5
2009-01-03
(B) v5.4 sfnt rev 5.39999
2009-02-02
Sun-ExtA.ttf
(21.9 MB)
Sun-ExtB.ttf
(16.8 MB)
Sun-ExtA
Sun-ExtB
Medium (Normal), Book, Regular TTF  No Fit Both Freeware.
Proprietary
okuc.
Beijing ZhongYi Electronics Co.
Tahoma
Tahoma (typeface)
Tahoma is a humanist sans-serif typeface designed by Matthew Carter for the Microsoft Corporation in 1994 with initial distribution along with Verdana for Windows 95....

1,912 3,412 674 v5.06
2009-02-12
tahoma.ttf
(681 KB)
Tahoma Medium (Normal), Regular OTF+TTO Normal Sans Proprietary Microsoft Corporation
Times New Roman 2,790 3,414 867 v5.05
2009-02-06
times.ttf
(816 KB)
Times New Roman Medium (Normal), Regular OTF+TTO Cove Proprietary; The Monotype Corporation.
TITUS Cyberbit Basic 9,209 10,044 0 v3.0 (2000) sfnt rev 4
2005-10-08
TITUSCBZ.TTF
(1.91 MB)
TITUS Cyberbit Basic Medium (Normal), Regular TTF  Cove Freeware Bitstream Inc / TITUS project.
WenQuanYi
WenQuanYi
WenQuanYi is an open-source project of Chinese computer fonts licensed under GNU General Public License.-General:WenQuanYi project was started by Qianqian Fang , a Chinese biomedical imaging researcher at the Massachusetts General Hospital, in October, 2004.Now the fonts of WenQuanYi project are...

 Bitmap Song
41,295 154,997 0 1.0.0-RC1 WenQuanYi Bitmap Song Regular Multi-strike Bitmap Font Song(Serif) Style for Chinese GPL
WenQuanYi Zen Hei 42,285 43,643 0 v0.9.45 sfnt rev 0.89994
2010-03-11
wqy-zenhei.ttc
(16.0 MB)
WenQuanYi Zen Hei, WenQuanYi Zen Hei Mono and WenQuanYi Zen Hei Sharp Medium, Regular TTC  Hei(Sans) Style for Chinese GPL Qianqian Fang and WenQuanYi Board of Trustees.
WenQuanYi Micro Hei 34,707 48,755 0 0.2.0-beta WenQuanYi Micro Hei and WenQuanYi Micro Hei Mono Regular, Light TTC  Hei(Sans) Style for Chinese GPL
Y.OzFontN 21,957 57,621 0 v13.00 sfnt rev 5 Pen-Ji
2010-08-24
YOzRN.TTC
(13.5 MB)
YOzFontN Regular TTC  Any Freeware Y.OzVox
XITS 3,290 3,292 1,016 v1.0.10
2011-04-25
xits-regular.otf
(247 KB)
XITS Medium (Normal), Book, Regular OTF Any OFL
OFL
OFL may stand for :* Off-line* Ontario Federation of Labour, a Canadian trade union federation.* Open Font Licence by SIL International* Overfishing level* Overflow...

Font Char(s) Glyphs Kernpairs
(Standard)
Point (Range) : Hinting/ Smoothing Behavior Version & Date Filename,
Size
Font Family Font weight,
style
Font
type
Serif
style
License Creator / Author (Copyright) Notes

Note
OTF+TTO: OpenType
OpenType
OpenType is a format for scalable computer fonts. It was built on its predecessor TrueType, retaining TrueType's basic structure and adding many intricate data structures for prescribing typographic behavior...

 font with TrueType
TrueType
TrueType is an outline font standard originally developed by Apple Computer in the late 1980s as a competitor to Adobe's Type 1 fonts used in PostScript...

 outlines.
OpenType fonts sometimes don't contain a one-by-one Kernpair table but a kern-by-classes table where groups of similar characters are seen as one kern group. For instance, V and W have nearly the same left and right geometry. So “0” doesn't mean that no kerning is supported!

Comparison of fonts

Number of characters included by the above version of fonts, for different Unicode blocks (or, ranges), are listed below. Basic Latin (128: 0000–007F) means that in the range called 'Basic Latin', there are 128 assigned codes, numbered 0 to 7F
Hexadecimal
In mathematics and computer science, hexadecimal is a positional numeral system with a radix, or base, of 16. It uses sixteen distinct symbols, most often the symbols 0–9 to represent values zero to nine, and A, B, C, D, E, F to represent values ten to fifteen...

. The cells then show the number of those codes which are covered by each font.
Cells shaded green indicate complete coverage.
Cells shaded blue are not complete, but are the most complete of the fonts listed.
Empty cells indicate that no character exists in that block.

0000-077F

Unicode Fonts
  BMP (Plane 0)
Font 

Range 

Non-Unicode Glyphs 703 23 1 295 8 4 76 3 589 263 443 624 704 866 954
Unassigned Code Points (0000-11FFFF) 261 2 1 10 2 1 7 1 1 2 1 1 32 67 1 37 118
Total Glyphs in BMP (60699: 0000–FFFF)
C0 Control Character (0000–001F) 31 1 1 1 31 1 1 31 31 32
Basic Latin
Latin alphabet
The Latin alphabet, also called the Roman alphabet, is the most recognized alphabet used in the world today. It evolved from a western variety of the Greek alphabet called the Cumaean alphabet, which was adopted and modified by the Etruscans who ruled early Rome...

 (95: 0020–007E)
Delete Character (007F-007F) 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
C1 Control Character (0080-009F) 32 32 32
Latin-1 Supplement (96: 00A0–00FF) 94 95 95 94 94 94 95
Latin Extended-A (128: 0100–017F) 127 8 29
Latin Extended-B (208: 0180–024F) 148 52 178 188 188 178 188 182 194 183 119 1 7 183 8 28
IPA Extensions
International Phonetic Alphabet
The International Phonetic Alphabet "The acronym 'IPA' strictly refers [...] to the 'International Phonetic Association'. But it is now such a common practice to use the acronym also to refer to the alphabet itself that resistance seems pedantic...

 (96: 0250–02AF)
89 94 94 94 94 89 55
Spacing Modifier Letters (80: 02B0–02FF) 57 79 63 75 63 62 63 75 75 63 57 12 10 11 14
Combining Diacritical Marks
Diacritic
A diacritic is a glyph added to a letter, or basic glyph. The term derives from the Greek διακριτικός . Diacritic is both an adjective and a noun, whereas diacritical is only an adjective. Some diacritical marks, such as the acute and grave are often called accents...

 (112: 0300–036F)
72 82 103 82 82 93 103 107 109 111 106 68 1 109 110 106 32
Greek and Coptic
Greek alphabet
The Greek alphabet is the script that has been used to write the Greek language since at least 730 BC . The alphabet in its classical and modern form consists of 24 letters ordered in sequence from alpha to omega...

 (134: 0370–03FF)
126 105 124 110 17 76 110 17 120 128 120 80 110 106 91 117 126 73 127 123 126 117 72 76
Cyrillic
Cyrillic alphabet
The Cyrillic script or azbuka is an alphabetic writing system developed in the First Bulgarian Empire during the 10th century AD at the Preslav Literary School...

 (256: 0400–04FF)
254 226 253 2 238 214 238 244 214 247 214 230 244 153 254 94 253 254 254 246 92 66
Cyrillic Supplement (38: 0500–052F) 24 16 36 34 16 36 34 36 34 16 24 20 24 24 15
Armenian
Armenian alphabet
The Armenian alphabet is an alphabet that has been used to write the Armenian language since the year 405 or 406. It was devised by Saint Mesrop Mashtots, an Armenian linguist and ecclesiastical leader, and contained originally 36 letters. Two more letters, օ and ֆ, were added in the Middle Ages...

 (86: 0530–058F)
85 85 84
Hebrew
Hebrew alphabet
The Hebrew alphabet , known variously by scholars as the Jewish script, square script, block script, or more historically, the Assyrian script, is used in the writing of the Hebrew language, as well as other Jewish languages, most notably Yiddish, Ladino, and Judeo-Arabic. There have been two...

 (87: 0590–05FF)
82 46 86 82 86 60 82 54 53 82 51 83
Arabic
Arabic alphabet
The Arabic alphabet or Arabic abjad is the Arabic script as it is codified for writing the Arabic language. It is written from right to left, in a cursive style, and includes 28 letters. Because letters usually stand for consonants, it is classified as an abjad.-Consonants:The Arabic alphabet has...

 (250: 0600–06FF)
225 194 60 10 185 69 201 161 67 232 225 225 225 183
Syriac
Syriac alphabet
The Syriac alphabet is a writing system primarily used to write the Syriac language from around the 2nd century BC . It is one of the Semitic abjads directly descending from the Aramaic alphabet and shares similarities with the Phoenician, Hebrew, Arabic, and the traditional Mongolian alphabets.-...

 (77: 0700–074F)
50
Arabic Supplement (48: 0750–077F) 30 30 30 30

Range

Font  

Range 

0780-139F


Range

Font  

Range 
Thaana (50: 0780–07BF) 49 49
N'ko
N'Ko
N'Ko is both a script devised by Solomana Kante in 1949 as a writing system for the Mande languages of West Africa, and the name of the literary language itself written in the script. The term N'Ko means 'I say' in all Manding languages....

 (59: 07C0–07FA)
54
Samaritan (46: 0800–082D)
Aramaic (29: 0820–083F)
Samaritan, Punctuation (15: 0830–083E)
Pahlavi (64: 0840–087F)
Tifinagh (48: 0880–08AF)
Mandaic (48: 08D0–08FF)
Devanagari
Devanagari
Devanagari |deva]]" and "nāgarī" ), also called Nagari , is an abugida alphabet of India and Nepal...

 (117: 0900–097F)
104 112 104 103 105 106
Bengali
Bengali script
The Bengali alphabet is the writing system for the Bengali language. The script with variations is used for Assamese and is basis for Meitei, Bishnupriya Manipuri, Kokborok, Garo and Mundari alphabets. All these languages are spoken in the eastern region of South Asia. Historically, the script has...

 (92: 0980–09FF)
89 91 89 91 90
Gurmukhi
Gurmukhi script
Gurmukhi is the most common script used for writing the Punjabi language. An abugida derived from the Laṇḍā script and ultimately descended from Brahmi, Gurmukhi was standardized by the second Sikh guru, Guru Angad Dev Ji, in the 16th century. The whole of the Sri Guru Granth Sahib Ji's 1430...

 (79: 0A00–0A7F)
75 77
Gujarati
Gujarati script
The Gujarati script , which like all Nāgarī writing systems is strictly speaking an abugida rather than an alphabet, is used to write the Gujarati and Kutchi languages...

 (83: 0A80–0AFF)
78 78 78
Oriya
Oriya script
The Oriya script or Utkala Lipi or Utkalakshara is used to write the Oriya language, and can be used for several other Indian languages, for example, Sanskrit.- History :...

 (84: 0B00–0B7F)
79 79
Tamil
Tamil script
The Tamil script is a script that is used to write the Tamil language as well as other minority languages such as Badaga, Irulas, and Paniya...

 (72: 0B80–0BFF)
61 71 70
Telugu
Telugu script
Telugu script, an abugida from the Brahmic family of scripts, is used to write the Telugu language, a language found in the South-Central Indian state of Andhra Pradesh as well as several other neighboring states. The Telugu script is derived from the Bhattiprolu script...

 (93: 0C00–0C7F)
80 42 80
Kannada
Kannada script
The Kannada script is an alphasyllabary of the Brahmic family, used primarily to write the Kannada language, one of the Dravidian languages of southern India and also Sanskrit in the past. The Telugu script is derived from Old Kannada, and resembles Kannada script...

 (86: 0C80–0CFF)
80
Malayalam
Malayalam script
The Malayalam script is a Brahmic script used commonly to write the Malayalam language—which is the principal language of the Indian state of Kerala, spoken by 36 million people in the world. Like many other Indic scripts, it is an abugida, or a writing system that is partially “alphabetic” and...

 (95: 0D00–0D7F)
78 78
Sinhala (80: 0D80–0DFF)
Thai
Thai alphabet
Thai script , is used to write the Thai language and other, minority, languages in Thailand. It has forty-four consonants , fifteen vowel symbols that combine into at least twenty-eight vowel forms, and four tone marks ....

 (87: 0E00–0E7F)
86 1
Lao
Lao alphabet
The Lao alphabet, Aksone Lao , is the main script used to write the Lao language and other minority languages in Laos. It is ultimately of Indic origin, the alphabet includes 27 consonants , 7 consonantal ligatures , 33 vowels , and 4 tone marks...

 (65: 0E80–0EFF)
Tibetan
Tibetan script
The Tibetan alphabet is an abugida of Indic origin used to write the Tibetan language as well as the Dzongkha language, Denzongkha, Ladakhi language and sometimes the Balti language. The printed form of the alphabet is called uchen script while the hand-written cursive form used in everyday...

 (205: 0F00–0FFF)
168 168 55 193
Burmese
Burmese alphabet
The Burmese script is an abugida in the Brahmic family used for writing Burmese. Furthermore, various other scripts share some aspect and letters of the Burmese script, though they should not be considered strictly Burmese, including Mon, Shan, S'gaw Karen, Eastern and Western Pwo Karen and Geba...

 (Mayanmar) (160: 1000–109F)
156 78
Georgian
Georgian alphabet
The Georgian alphabet is the writing system used to write the Georgian language and other Kartvelian languages , and occasionally other languages of the Caucasus such as Ossetic and Abkhaz during the 1940s...

 (83: 10A0–10FF)
78 1 78 1 78
Hangul
Hangul
Hangul,Pronounced or ; Korean: 한글 Hangeul/Han'gŭl or 조선글 Chosŏn'gŭl/Joseongeul the Korean alphabet, is the native alphabet of the Korean language. It is a separate script from Hanja, the logographic Chinese characters which are also sometimes used to write Korean...

 Jamo, Choseong (96: 1100-115F)
91 91 91 95 47
Hangul
Hangul
Hangul,Pronounced or ; Korean: 한글 Hangeul/Han'gŭl or 조선글 Chosŏn'gŭl/Joseongeul the Korean alphabet, is the native alphabet of the Korean language. It is a separate script from Hanja, the logographic Chinese characters which are also sometimes used to write Korean...

 Jamo, Jungseong (72: 1160-11A7)
67 67 71 37
Hangul
Hangul
Hangul,Pronounced or ; Korean: 한글 Hangeul/Han'gŭl or 조선글 Chosŏn'gŭl/Joseongeul the Korean alphabet, is the native alphabet of the Korean language. It is a separate script from Hanja, the logographic Chinese characters which are also sometimes used to write Korean...

 Jamo, Jongseong (88: 11A8-11FF)
82 82 82 82 62
Ethiopic (Ge'ez/Ethiopic) (356: 1200–137F) 345 345 345 346
Ethiopic Supplement (26: 1380–139F)

Range

Font  

Range 

13A0-1DBF


Range

Font  

Range 
Cherokee
Cherokee language
Cherokee is an Iroquoian language spoken by the Cherokee people which uses a unique syllabary writing system. It is the only Southern Iroquoian language that remains spoken. Cherokee is a polysynthetic language.-North American etymology:...

 (85: 13A0–13FF)
85
Unified Canadian Aboriginal Syllabics (669: 1400–167F) 29 630 433 659 29
Ogham
Ogham
Ogham is an Early Medieval alphabet used primarily to write the Old Irish language, and occasionally the Brythonic language. Ogham is sometimes called the "Celtic Tree Alphabet", based on a High Medieval Bríatharogam tradition ascribing names of trees to the individual letters.There are roughly...

 (29: 1680–169F)
Runic (81: 16A0–16FF) 1
Tagalog
Baybayin
Baybayin , is a pre-Spanish Philippine writing system. It is a member of the Brahmic family and is recorded as being in use in the 16th century...

 (Baybayin) (20: 1700–171F)
Hanunoo (23: 1720–173F) 2 2
Buhid
Buhid script
Buhid, is an indigenous Brahmic script of the Philippines, closely related to Baybayin, and is used today by the Mangyans to write their language, Buhid.- Unicode :Buhid script was added to the Unicode Standard in March, 2002 with the release of version 3.2....

 (20: 1740–175F)
Tagbanwa
Tagbanwa
Tagbanwa, also known as Apurahuano, is one of the indigenous writing systems of the Philippines. The Tagbanwa language, an Austronesian language, with about 8, 000 speakers in the central and northern regions of Palawan, is dying out as the younger generations of Tagbanua are learning Cuyonon...

 (18: 1760–177F)
Khmer (114: 1780–17FF) 103 1
Mongolian
Mongolian alphabet
Many alphabets have been devised for the Mongolian language over the centuries, and from a variety of scripts. The oldest, called simply the Mongolian script, has been the predominant script during most of Mongolian history, and is still in active use today in the Inner Mongolia region of China...

 (156: 1800–18AF)
155 155
Limbu
Limbu script
The Limbu script is used to write the Limbu language. The Limbu script is an abugida derived from the Tibetan script.-History:According to traditional histories, the Limbu script was first invented in the late 9th century by King Sirijonga Haang, then fell out of use, to be reintroduced in the 18th...

 (66: 1900–194F)
Tai Le
Tai Le script
Tai Le is the name of Tai Nüa script, the script used for the Tai Nüa language, given by Microsoft.-Unicode:Tai Le script was added to the Unicode Standard in April, 2003 with the release of version 4.0....

 (35: 1950–197F)
Tai Lue
Tai Lü language
Tai Lü is a language spoken by about 670,000 people in South East Asia. This includes 250,000 people in China, 200,000 in Burma, 134,000 in Thailand, and 5,000 in Vietnam...

 (New Tai Lu) (83: 1980–19DF)
Khmer Symbols
Khmer script
The Khmer script is an alphasyllabary script used to write the Khmer language . It is also used to write Pali among the Buddhist liturgy of Cambodia and Thailand....

 (32: 19E0–19FF)
Buginese
Buginese language
Buginese is the language spoken by about four million people mainly in the southern part of Sulawesi, Indonesia.-History:The word Buginese derives from the word Bahasa Bugis in Malay. In Buginese, it is called while the Bugis people are called...

 (30: 1A00–1A1F)
Tai Tham (127: 1A20–1AAD)
Balinese (121: 1B00–1B7F)
Sundanese (55: 1B80–1BBF)
Lepcha (74: 1C00–1C4F)
Ol Chiki (48: 1C50–1C7F)
Phonetic Extensions (128: 1D00–1D7F) 17 109 106 109 110 109 22 108 108
Phonetic Extensions Supplement (64: 1D80–1DBF) 38 3

Range

Font  

Range 

1DC0-257F


Range

Font  

Range 
Combining Diacritical Marks Supplement (42: 1DC0–1DFF) 13 2 13 5 6 5 5 26 13 13 13 13
Latin Extended Additional (256: 1E00–1EFF) 247 246 96 88 246 246 246 252 246 8 247 8 246 247 247 247 8
Greek Extended (233: 1F00–1FFF) 232 4
General Punctuation
Punctuation
Punctuation marks are symbols that indicate the structure and organization of written language, as well as intonation and pauses to be observed when reading aloud.In written English, punctuation is vital to disambiguate the meaning of sentences...

 (107: 2000–206F)
54 63 96 65 69 106 74 56 77 74 101 74 77 59 85 67 27 45 25 97 44 54 62 20 91
Superscripts and Subscripts
Unicode subscripts and superscripts
Unicode has subscripted and superscripted versions of a number of characters including a full set of arabic numerals. These characters allow any polynomial, chemical and certain other equations to be represented in plain text without using any form of markup like HTML or TeX.The World Wide Web...

 (34: 2070–209F)
6 28 9 28 29 29 28 29 29 28 1 6 6 6 6 29 6 34
Currency Symbols (25: 20A0-20CF) 22 13 6 16 22 22 15 16 23 22 24 22 2 9 18 12 17 22 4 22 22 22 12 2 18
Combining Diacritical Marks for Symbols (33: 20D0–20FF) 1 18 20 28 5 20 7 5 5 1 2 1 27 1 1 27
Letterlike Symbols
Letterlike Symbols
Letterlike Symbols are graphemes which are constructed mainly from the glyphs of one or more letters.In Unicode, Letterlike Symbols are placed in the block U+2100–214F, as in the following table.-See also:*Mapping of Unicode characters...

 (80: 2100–214F)
9 57 75 13 59 5 59 59 75 5 71 5 10 31 32 57 4 8 10 77 8 8 10 9 75
Number Forms
Number Forms
Number Forms are Unicode characters which have specific meaning as numbers, but are constructed from other characters. They consist primarily of vulgar fractions and roman numerals. They are placed in the Unicode codepoint range 0x2150 through 0x218F , except for three fractions in ISO-8859-1...

 (58: 2150–218F)
7 48 4 49 54 54 49 49 55 54 54 50 50 49 4 10 5 26 50 5 7 28 30 53
Arrows
Arrow (symbol)
An arrow is a graphical symbol such as → or ←, used to point or indicate direction, being in its simplest form a line segment with a triangle affixed to one end, and in more complex forms a representation of an actual arrow...

 (112: 2190–21FF)
7 91 14 100 19 92 100 19 19 34 20 91 8 13 7 20 12
Mathematical Operators
Unicode Mathematical Operators
Unicode ranges mathematical operators and symbols in multiple blocks.* Mathematical Operators * Miscellaneous Mathematical Symbols-A * Miscellaneous Mathematical Symbols-B...

 (256: 2200–22FF)
16 242 246 24 242 21 242 242 21 21 16 82 18 242 26 13 43 14 16 52 51
Miscellaneous Technical Symbols
Miscellaneous Technical (Unicode)
Miscellaneous Technical is the name of a a Unicode block ranging from U+2300 to U+23FF, which contains various common symbols which are related to and used in the various technical, programming language and academic professions....

 (233: 2300–23FF)
4 123 36 57 228 27 4 154 65 27 219 105 27 6 14 14 10 1 5 211 4 7 1 209
Control Pictures (39: 2400–243F) 37 1 2 1 36 1 1 1 37 1 4
Optical Character Recognition
Optical character recognition
Optical character recognition, usually abbreviated to OCR, is the mechanical or electronic translation of scanned images of handwritten, typewritten or printed text into machine-encoded text. It is widely used to convert books and documents into electronic files, to computerize a record-keeping...

 (11: 2440–245F)
10
Enclosed Alphanumerics (160: 2460–24FF) 139 73 139 10 10 114 1 82 112 139
Box Drawing
Box drawing characters
Box drawing characters, also known as line drawing characters, or pseudographics, are widely used in text user interfaces to draw various frames and boxes...

 (128: 2500–257F)
40 115 1 116 53 97 40 99 117

Range

Font  

Range 

2580-2DFF


Range

Font  

Range 
Block Elements (32: 2580–259F) 8 22 20 22 22 21 18 8 8 10 20
Geometric Shapes
Unicode Geometric Shapes
Geometric Shapes is a Unicode block of 96 symbols at codepoint range U+25A0-25FF.-U+25A0-U+25CF:-U+25D0-U+25FF:-Font coverage:Only two font sets—Code2000 and the DejaVu family—include coverage for each of the glyphs in the Geometric Shapes range, Unifont also contains all the glyphs...

 (96: 25A0–25FF)
16 80 92 8 88 2 95 88 2 2 1 26 18 79 16 7 34 7 16 38 29
Miscellaneous Symbols
Miscellaneous Symbols
The Miscellaneous Symbols Unicode block contains various glyphs representing things from a variety of categories: Astrological, Astronomical, Chess, Dice, Ideological symbols, Musical notation, Political symbols, Recycling, Religious symbols, Trigrams, Warning signs and Weather.-Tables:Note: These...

 (128: 2600–267F)
12 106 118 31 108 127 128 4 45 14 6 2 24 125 1 12 25 33
Dingbats (Zapf Dingbats) (175: 2700–27BF) 160 6 2 174 2 170 1 174 2 174 174 2 12 14 3 174 174
Miscellaneous Mathematical Symbols-A (28: 27C0–27EF) 3 2 2 2 15 6 1 16 16
Supplemental Arrows-A (32: 27F0–27FF) 8 2 23 2 28 22 2 4 2 12 28
Braille Patterns
Braille
The Braille system is a method that is widely used by blind people to read and write, and was the first digital form of writing.Braille was devised in 1825 by Louis Braille, a blind Frenchman. Each Braille character, or cell, is made up of six dot positions, arranged in a rectangle containing two...

 (256: 2800–28FF)
Supplemental Arrows-B (128: 2900–297F) 6 6 119
Miscellaneous Mathematical Symbols-B (128: 2980–29FF) 2 13 1
Supplemental Mathematical Operators (256: 2A00–2AFF) 10 72 194 10 1
Miscellaneous Symbols and Arrows (Supplemental Symbols and Arrows) (87: 2B00–2BFF) 82 35 49 15
Reserved
Glagolitic
Glagolitic alphabet
The Glagolitic alphabet , also known as Glagolitsa, is the oldest known Slavic alphabet. The name was not coined until many centuries after its creation, and comes from the Old Slavic glagolъ "utterance" . The verb glagoliti means "to speak"...

 (94: 2C00–2C5F)
86
Latin Extended-C (32: 2C60-2C7F) 21 29 12 31 12 29 12 5 17 21 17 21 21
Coptic
Coptic alphabet
The Coptic alphabet is the script used for writing the Coptic language. The repertoire of glyphs is based on the Greek alphabet augmented by letters borrowed from the Demotic and is the first alphabetic script used for the Egyptian language...

 (121: 2C80–2CFF)
114 66 114
Georgian Supplement (38: 2D00–2D2F)
Tifinagh
Tifinagh
Tifinagh is a series of abjad and alphabetic scripts used by some Berber peoples, notably the Tuareg, to write their language.A modern derivate of the traditional script, known as Neo-Tifinagh, was introduced in the 20th century...

 (55: 2D30–2D7F)
Ethiopic Extended (79: 2D80–2DDF)
Cyrillic Extended-A (16: 2DE0–2DFF)

Range

Font  

Range 

2E00-4DBF


Range

Font  

Range 
Supplemental Punctuation (50: 2E00–2E7F) 1 24 49 6 1 22 9 1 1 1 1
CJK Radicals (Supplement)
Radical (Chinese character)
A Chinese radical is a component of a Chinese character. The term may variously refer to the original semantic element of a character, or to any semantic element, or, loosely, to any element whatever its origin or purpose...

 (115: 2E80–2EFF)
Kangxi Radicals (214: 2F00–2FDF) 214
Ideographic Description Characters (16: 2FE0–2FEF) 12 12
CJK Symbols and Punctuation (64: 3000–303F) 57 12 9 40 18 17 31 46 45
Hiragana
Hiragana
is a Japanese syllabary, one basic component of the Japanese writing system, along with katakana, kanji, and the Latin alphabet . Hiragana and katakana are both kana systems, in which each character represents one mora...

 (93: 3040–309F)
90 90 90 86 83 90 93
Katakana
Katakana
is a Japanese syllabary, one component of the Japanese writing system along with hiragana, kanji, and in some cases the Latin alphabet . The word katakana means "fragmentary kana", as the katakana scripts are derived from components of more complex kanji. Each kana represents one mora...

 (96: 30A0–30FF)
94 94 94 92 86 94 95
Bopomofo
Bopomofo
Zhuyin fuhao , often abbreviated as zhuyin and colloquially called bopomofo, was introduced in the 1910s as the first official phonetic system for transcribing Chinese, especially Mandarin....

 (41: 3100–312F)
40 37 39 40 37
Hangul Compatibility Jamo (94: 3130–318F) 93 1
Kanbun
Kanbun
The Japanese word originally meant "Classical Chinese writings, Chinese classic texts, Classical Chinese literature". This evolved into a Japanese method of reading annotated Classical Chinese in translation . Much Japanese literature was written in literary Chinese using this annotated style...

 (16: 3190–319F)
14 16 14
Bopomofo Extended
Bopomofo
Zhuyin fuhao , often abbreviated as zhuyin and colloquially called bopomofo, was introduced in the 1910s as the first official phonetic system for transcribing Chinese, especially Mandarin....

 (24: 31A0–31BF)
4
CJK Strokes (36: 31C0–31EF)
Katakana Phonetic Extensions (16: 31F0–31FF)
Enclosed CJK Letters and Months (254: 3200–32FF) 202 242 58 1 58 232 206 179
CJK Compatibility (256: 3300–33FF) 249 105 11 80 249 154
CJK Unified Ideographs Extension A (6646: 3400–4DBF) 1 64 180

Range

Font  

Range 

4DC0-FAFF


Range

Font  

Range 
Yijing Hexagram Symbols (70: 4DC0–4DFF) 64 6 6
CJK Unified Ideographs
CJK Unified Ideographs
The Chinese, Japanese and Korean scripts share a common background. In the process called Han unification the common characters were identified, and named "CJK Unified Ideographs"...

 (Han Unification
Han unification
Han unification is an effort by the authors of Unicode and the Universal Character Set to map multiple character sets of the so-called CJK languages into a single set of unified characters. Han characters are a common feature of written Chinese , Japanese , Korean , and—at least historically—other...

) (20940: 4E00–9FFF)
79
Yi Syllables
Yi script
The Yi script, also historically known as Cuan Wen or Wei Shu , is used to write the Yi languages.-Classical Yi:Classical Yi is a syllabic logographic system that was reputedly devised during the Tang dynasty by someone called Aki...

 (1024: A000–A3FF)
Yi Radicals (32: A490–A4AF) 0
Lisu
Lisu
The Lisu people are a Tibeto-Burman ethnic group who inhabit the mountainous regions of Burma , Southwest China, Thailand, and the Indian state of Arunachal Pradesh....

(48: A4D0–A4FF)
Vai
Vai
Vai has several possible meanings:* Vaï, Moroccan-French Canadian rapper* Vai people** Vai language** Vai syllabary* Vai * Văi, a village in Lupşa Commune, Alba County, Romania* Steve Vai, guitarist* Steve Vai's band Vai - see Steve Vai...

(300: A500–A63F)
Cyrillic Extended-B (78: A640–A69F) 31
Modifier Tone Letters (32: A700–A71F) 9 1 20 1 1 1 9 4 9 9
Latin Extended D (114: A720-A7FF) 7 28 57 28 81 28 97 3 7 2 7 7
Syloti Nagri (44: A800–A82F)
Phags-pa (56: A840–A87F)
Saurashtra (81: A880–A8DF)
Kayah Li (48: A900–A92F)
Rejang
Rejang script
The Rejang script, sometimes spelt Redjang and locally known as Surat Ulu , is an abugida of the Brahmic family, and is related to other scripts of the region, like Batak, Buginese, and others. Rejang is a member of the closely related group of Surat Ulu scripts that include the script variants of...

 (37: A930–A95F)
Cham (83: AA00–AA5F)
Mayanmar Extended (28: AA60–AA7B)
Tai Viet (72: AA80–AADF)
Meetei Mayek (56: ABC0–ABF9)
Hangul Syllables
Hangul
Hangul,Pronounced or ; Korean: 한글 Hangeul/Han'gŭl or 조선글 Chosŏn'gŭl/Joseongeul the Korean alphabet, is the native alphabet of the Korean language. It is a separate script from Hanja, the logographic Chinese characters which are also sometimes used to write Korean...

 (11172: AC00–D7AF)
6
High Surrogates (D800–DB7F)
High Private Use Surrogates (DB80–DBFF)
Low Surrogates (DC00–DFFF)
Private Use Area (Private/Corporate Use) (6400: E000–F8FF) 55 43 20 565 148 352 788 372 29 229 40 229 961 276 6 16 47 84 55
CJK Compatibility Ideographs
Ideograph
Ideograph is a term coined by rhetorical scholar and critic Michael Calvin McGee describing the use of particular words and phrases as political language in a way that captures particular ideological positions...

 (470: F900–FAFF)
302 302 437 302 16 466 1 455 363

Range

Font  

Range 

FB00-FFFF


Range

Font  

Range 
Alphabetic Presentation Forms (Latin Lig + Armenian Lig + Hebrew Lig) (58: FB00–FB4F) 48 57 26 53 42 5 52 5 50 5 7 7 32 3 48 2 48 48 53 7
Latin Ligatures (7: FB00–FB06) 2 2 5 5 5 5 5 2 2 2 2 2
Armenian Ligatures (5: FB13–FB17) 1
Hebrew Ligatures / Pointed Letters (46: FB1D–FB4F) 46 45 2 34 42 38 27
Arabic Presentation Forms-A (595: FB50–FDFF) 90 593 62 155 22 128 98 20 44 89 89 187 25
Variation Selectors (16: FE00–FE0F)
Vertical Forms (10: FE10–FE1F) 2
Combining Half Marks (7: FE20–FE2F) 4 4 4 4 4 4 4 4 4 4 4 4
CJK Compatibility Forms (32: FE30–FE4F) 28 28 28 27
Small Form Variants (26: FE50–FE6F) 18 2 25 25
Arabic Presentation Forms-B (140: FE70–FEFE) 85 139 120 139 1 125 1 131 1 1 53 87 85 139
Byte Order Mark (1: FEFF)
Halfwidth and Fullwidth Forms
Halfwidth and Fullwidth Forms
In CJK computing, graphic characters are traditionally classed into fullwidth and halfwidth characters...

 (225: FF00–FFEF)
223 166 171 186 152 209 100 224 157 101 172
Latin Full Width Forms (94: FF01-FF5E) 94
KataKana Half Width Forms (63: FF61-FF9F) 63
Hangul Jamo Half Width Forms (52: FFA0-FFDC) 15 51
Specials (5: FFF0–FFFF) 1 2 1 1 3 1 1 1 1 2 1 2 2 1

Range

Font  
  BMP (Plane 0)

List of SMP Unicode fonts

Of the many Unicode fonts rich with vast amount of SMP plane characters, few ones listed below are the most commonly used by a majority of users around the world on mainstream computing platforms. You must also see the first top font list, as some of them also contains vast amount of SMP plane characters. More Unicode fonts can be found in the (List of typefaces) article's "Unicode fonts" section.
List of Unicode Fonts with vast amount of SMP Characters
Font Char(s) Glyphs Kernpairs
(Standard)
Range(Point): Hinting/
Smoothing Behavior
Version & Date Filename,
Size
Font Family Font Weight,
style
Font
type
Serif
style
License Creator / Author (Copyright) Notes
Code2001 2,985 3,135 0 v0.919 sfnt rev 0.918945
2008-04-04
CODE2001.TTF
(485 KB)
Code2001 Medium (Normal), Book, Regular TTF  Any Freeware James Kass
New Athena Unicode 2,985 2,033 0 v3.710 sfnt rev 3.70999
2010-05-02
newathu.ttf
(769 KB)
New Athena Unicode Medium (Normal), Book, Regular TTF  Any Freeware. SIL OFL. American Philological Association http://www.apaclassics.org/.
MPH 2B Damase 2,743 2,895 192 v002.010 sfnt rev 1
2005-10-18
damase_v.2.ttf
(879 KB)
MPH 2B Damase Medium (Normal), Book, Regular TTF  Any Freeware. Public Domain 2005. Mark Williamson.

10000-1D7FF

  SMP (Plane 1)
Font  

Range 
Total Glyphs in SMP (5258: 10000–1FFFF) 222 2 304 2 745 2 649 203 994 87 106 210
Linear B Syllabary
Linear B
Linear B is a syllabic script that was used for writing Mycenaean Greek, an early form of Greek. It pre-dated the Greek alphabet by several centuries and seems to have died out with the fall of Mycenaean civilization...

 (88: 10000–1007F)
74
Linear B Ideograms (123: 10080–100FF) 73
Aegean Numbers (64: 10100–1013F) 57
Ancient Greek Numbers (75: 10140–1018F) 5
Ancient Symbols (12: 10190–101CF)
Phaistos Disc
Phaistos Disc
The Phaistos Disc is a disk of fired clay from the Minoan palace of Phaistos on the Greek island of Crete, possibly dating to the middle or late Minoan Bronze Age . It is about 15 cm in diameter and covered on both sides with a spiral of stamped symbols...

 (46: 101D0–101FF)
Lycian (29: 10280–1029F)
Alphabetic and Syllabic LTR scripts (311: 10300–107FF) 62 27 307 35
Old Italic
Old Italic alphabet
Old Italic refers to several now extinct alphabet systems used on the Italian Peninsula in ancient times for various Indo-European languages and non-Indo-European languages...

 (35: 10300–1032F)
Gothic
Gothic language
Gothic is an extinct Germanic language that was spoken by the Goths. It is known primarily from the Codex Argenteus, a 6th-century copy of a 4th-century Bible translation, and is the only East Germanic language with a sizable Text corpus...

 (27: 10330–1034F)
Ugaritic (31: 10380–1039F)
Old Persian
Old Persian language
The Old Persian language is one of the two directly attested Old Iranian languages . Old Persian appears primarily in the inscriptions, clay tablets, and seals of the Achaemenid era...

 (50: 103A0–103DF)
Deseret
Deseret alphabet
The Deseret alphabet is a phonemic English spelling reform developed in the mid-19th century by the board of regents of the University of Deseret under the direction of Brigham Young, second president of The Church of Jesus Christ of Latter-day Saints.In public statements, Young claimed the...

 (80: 10400–1044F)
Shavian (48: 10450–1047F)
Osmanya (40: 10480–104AF)
Cypriot syllabary
Cypriot syllabary
The Cypriot syllabary is a syllabic script used in Iron Age Cyprus, from ca. the 11th to the 4th centuries BCE, when it was replaced by the Greek alphabet. A pioneer of that change was king Evagoras of Salamis...

 (55: 10800–1083F)
Phoenician
Phoenician alphabet
The Phoenician alphabet, called by convention the Proto-Canaanite alphabet for inscriptions older than around 1050 BC, was a non-pictographic consonantal alphabet, or abjad. It was used for the writing of Phoenician, a Northern Semitic language, used by the civilization of Phoenicia...

 (29: 10900-1091F)
27 27
Kharosthi (65: 10A00–10A5F)
Cuneiform
Cuneiform script
Cuneiform script )) is one of the earliest known forms of written expression. Emerging in Sumer around the 30th century BC, with predecessors reaching into the late 4th millennium , cuneiform writing began as a system of pictographs...

 (879: 12000-123FF)
6
Sumero-Akkadian Cuneiform (879: 12000-123FF) 6
Byzantine Musical Symbols (246: 1D000–1D0FF) 33
Musical Symbols (220: 1D100–1D1FF) 44
Ancient Greek Musical Notation (70: 1D200–1D24F)
Tai Xuan Jing Symbols (87: 1D300–1D35F)
Chinese Counting Rod Numerals (18: 1D360–1D371)
Mathematical Alphanumeric Symbols
Mathematical alphanumeric symbols
Mathematical Alphanumeric Symbols is a Unicode block of Latin and Greek letters and decimal digits that enable mathematicians to denote different notions with different letter styles .Unicode now includes many such symbols Mathematical Alphanumeric Symbols is a Unicode block of Latin and Greek...

 (994: 1D400–1D7FF)
13 2 2 2

Range

Font  
  SMP (Plane 1)

List of SIP Unicode fonts

Of the many Unicode fonts rich with vast amount of SIP plane characters, few ones listed below are the most commonly used by a majority of users around the world on mainstream computing platforms. You must also see the first top font list (BMP) and SMP fonts, as some of them also contains vast amount of SIP plane characters. More Unicode fonts can be found in the (List of typefaces) article's "Unicode fonts" section.
List of Unicode Fonts with vast amount of SIP Characters
Font Char(s) Glyphs Kernpairs
(Standard)
Range(Point): Hinting/
Smoothing Behavior
Version & Date Filename,
Size
Font Family Font Weight,
style
Font
type
Serif
style
License Creator / Author (Copyright) Notes
Code2002 20,419 30,469 0 v0.917 sfnt rev 0.916992
2005-04-04
CODE2002.TTF
(4.09 MB)
Code2002 Medium (Normal), Book, Regular TTF  Normal Sans Freeware James Kass

20000-2FFFF

  SIP (Plane 2)
Font  

Range 
Total Glyphs in SIP (47624: 20000–2FFFF) 12 543 387
CJK Unified Ideographs Extension B (42711: 20000-2A6D6) 12 1 342
CJK Unified Ideographs Extension C (4149: 2A700-2B734)
CJK Unified Ideographs Extension D (222: 2B740-2B81D)
CJK Compatibility Ideographs Supplement (542: 2F800-2FA1F) 44
  SIP (Plane 2)

E0000-EFFFF

  SSP (Plane 14; 0Ehexadecimal)
Font  

Range 
Total Glyphs in SSP (337: E0000–EFFFF) 12 16
Tag Characters (97: E0000–E007F) 12
  SSP (Plane 14; 0Ehexadecimal)

F0000-FFFFF

  PUA-A (Plane 15; 0Fhexadecimal)
Font  

Range 
Total Glyphs in PUA-A (65534: F0000–FFFFF) 462 292 73
  PUA-A (Plane 15; 0Fhexadecimal)

100000-10FFFF

  PUA-B (Plane 16; 10hexadecimal)
Font  

Range 
Total Glyphs in PUA-B (65534: 100000–10FFFF)
  PUA-B (Plane 16; 10hexadecimal)
Unicode Fonts

See also

  • Free software Unicode typefaces
    Free software Unicode typefaces
    A few projects exist to provide free and open-source Unicode typefaces, i.e. Unicode typefaces which are open-source and designed to contain glyphs of all Unicode characters. However there are also numerous projects aimed at providing only a certain script, such as the Arabeyes Arabic font...

  • Unicode fallback font
  • Comparison of Unicode encodings
    Comparison of Unicode encodings
    This article compares Unicode encodings. Two situations are considered: 8-bit-clean environments and environments that forbid use of byte values that have the high bit set. Originally such prohibitions were to allow for links that used only seven data bits, but they remain in the standards and so...

  • List of typefaces
  • Typographic unit
    Typographic unit
    Typographic units are the units of measurement used in typography or typesetting. The traditional units are different from common metric units, as they were established earlier. Even though these units are all very small, across a line of print they add up quickly...

  • Foundries
    Type foundry
    A type foundry is a company that designs or distributes typefaces. Originally, type foundries manufactured and sold metal and wood typefaces and matrices for line-casting machines like the Linotype and Monotype machines designed to be printed on letterpress printers...

  • HTML
    HTML
    HyperText Markup Language is the predominant markup language for web pages. HTML elements are the basic building-blocks of webpages....

  • Calligraphy
    Calligraphy
    Calligraphy is a type of visual art. It is often called the art of fancy lettering . A contemporary definition of calligraphic practice is "the art of giving form to signs in an expressive, harmonious and skillful manner"...

  • Font-management program
    Font-management program
    Font management software is utility software that computer users use to browse and preview fonts and typically to install and uninstall fonts. In addition, depending on the individual font management program's capabilities, users can also...

  • List of CJK fonts


  • Unicode Consortium
    Unicode Consortium
    The Unicode Consortium is a non-profit organization that coordinates the development of the Unicode standard. Its stated goal is to eventually replace existing character encoding schemes with Unicode and its standard Unicode Transformation Format schemes, claiming that many of the existing...

  • Alt codes
    Alt codes
    On personal computers running the Microsoft Windows or DOS operating systems, additional characters to those available in the current keyboard layout can be typed using an Alt code: pressing and holding the Alt key while entering a character code with the keyboard's numeric keypad...

  • Input method editor
    Input method editor
    An input method is an operating system component or program that allows any data, such as keyboard strokes or mouse movements, to be received as input. In this way users can enter characters and symbols not found on their input devices...

     (IME)
  • Chinese input methods for computers
    Chinese input methods for computers
    Hundreds of Chinese input methods are available for entry of Chinese characters into computers, but most keyboard-based methods rely on either pinyin phonetic readings or root shapes in Chinese characters...

  • Korean language and computers
    Korean language and computers
    This article addresses how computers are used to read and write Korean, using Hangul.-Character encodings:In RFC 1557, a method known as ISO-2022-KR for a 7-bit encoding of Korean characters in email was described.  Where 8 bits are allowed, the EUC-KR encoding is preferred.  These two...

  • Japanese input methods
    Japanese input methods
    Japanese input methods are the methods used to input Japanese characters on a computer.There are two main methods of inputting Japanese on computers. One is via a romanized version of Japanese called rōmaji , and the other is via keyboard keys corresponding to the Japanese kana...

  • List of input methods for UNIX platforms
  • Enabling complex text support for Indic scripts
  • Keyboard layout
    Keyboard layout
    A keyboard layout is any specific mechanical, visual, or functional arrangement of the keys, legends, or key–meaning associations of a computer, typewriter, or other typographic keyboard....

  • Code page
    Code page
    Code page is another term for character encoding. It consists of a table of values that describes the character set for a particular language. The term code page originated from IBM's EBCDIC-based mainframe systems, but many vendors use this term including Microsoft, SAP, and Oracle Corporation...


External links

The source of this article is wikipedia, the free encyclopedia.  The text of this article is licensed under the GFDL.
 
x
OK