Unicode blocks in ConTeXt

From Wiki
Revision as of 22:20, 24 October 2017 by Nyraghu (talk | contribs)
Jump to navigation Jump to search

A Unicode block is an interval of code points which represent characters that are semantically related to each other. For example, there is a Unicode block for characters from the Devanagari script which is used by several Indian languages. Another Unicode block corresponds to characters which denote mathematical operators, such as those that indicate the union and the intersection of sets.

ConTeXt has special names for all Unicode blocks. These names can be used to specify ranges of code points in the setups of several commands.

Unicode blocks

A Unicode block is an organisational unit of the Unicode code space. The Unicode code space is the set of all integers from 0 to 0x10FFF.

Every block is an interval of code points. Different blocks are disjoint from each other. In particular, the blocks form a partition of the set of all Unicode code points. Every block has a unique name that describes the common semantic nature of its code points.

A code block starts at a code point that is a multiple of 16. The number of code points in each block is also a multiple of 16. Thus, the first code point in a block is of the form 0xpqrs0, and the last code point in it is of the form 0xtuvwF.

The number of code points in a block varies. Some have just 16 code points, and some others have thousands of code points.

ConTeXt names for Unicode blocks

ConTeXt has its own names for all the Unicode blocks. These names are defined in the source file char-ini.lua. Most of them are obtained by converting the Unicode name of every blocks to the lower case, and removing the spaces between words in the name.

Table of blocks

Block Unicode name ConTeXt name Chart
0000–007F Basic Latin U0000.pdf
0080–00FF Latin-1 Supplement U0080.pdf
0100–017F Latin Extended-A U0100.pdf
0180–024F Latin Extended-B U0180.pdf
0250–02AF IPA Extensions U0250.pdf
02B0–02FF Spacing Modifier Letters U02B0.pdf
0300–036F Combining Diacritical Marks U0300.pdf
0370–03FF Greek and Coptic U0370.pdf
0400–04FF Cyrillic U0400.pdf
0500–052F Cyrillic Supplement U0500.pdf
0530–058F Armenian U0530.pdf
0590–05FF Hebrew U0590.pdf
0600–06FF Arabic U0600.pdf
0700–074F Syriac U0700.pdf
0750–077F Arabic Supplement U0750.pdf
0780–07BF Thaana U0780.pdf
07C0–07FF NKo U07C0.pdf
0800–083F Samaritan U0800.pdf
0840–085F Mandaic U0840.pdf
0860–086F Syriac Supplement U0860.pdf
08A0–08FF Arabic Extended-A U08A0.pdf
0900–097F Devanagari U0900.pdf
0980–09FF Bengali U0980.pdf
0A00–0A7F Gurmukhi U0A00.pdf
0A80–0AFF Gujarati U0A80.pdf
0B00–0B7F Oriya U0B00.pdf
0B80–0BFF Tamil U0B80.pdf
0C00–0C7F Telugu U0C00.pdf
0C80–0CFF Kannada U0C80.pdf
0D00–0D7F Malayalam U0D00.pdf
0D80–0DFF Sinhala U0D80.pdf
0E00–0E7F Thai U0E00.pdf
0E80–0EFF Lao U0E80.pdf
0F00–0FFF Tibetan U0F00.pdf
1000–109F Myanmar U1000.pdf
10A0–10FF Georgian U10A0.pdf
1100–11FF Hangul Jamo U1100.pdf
1200–137F Ethiopic U1200.pdf
1380–139F Ethiopic Supplement U1380.pdf
13A0–13FF Cherokee U13A0.pdf
1400–167F Unified Canadian Aboriginal Syllabics U1400.pdf
1680–169F Ogham U1680.pdf
16A0–16FF Runic U16A0.pdf
1700–171F Tagalog U1700.pdf
1720–173F Hanunoo U1720.pdf
1740–175F Buhid U1740.pdf
1760–177F Tagbanwa U1760.pdf
1780–17FF Khmer U1780.pdf
1800–18AF Mongolian U1800.pdf
18B0–18FF Unified Canadian Aboriginal Syllabics Extended U18B0.pdf
1900–194F Limbu U1900.pdf
1950–197F Tai Le U1950.pdf
1980–19DF New Tai Lue U1980.pdf
19E0–19FF Khmer Symbols U19E0.pdf
1A00–1A1F Buginese U1A00.pdf
1A20–1AAF Tai Tham U1A20.pdf
1AB0–1AFF Combining Diacritical Marks Extended U1AB0.pdf
1B00–1B7F Balinese U1B00.pdf
1B80–1BBF Sundanese U1B80.pdf
1BC0–1BFF Batak U1BC0.pdf
1C00–1C4F Lepcha U1C00.pdf
1C50–1C7F Ol Chiki U1C50.pdf
1C80–1C8F Cyrillic Extended-C U1C80.pdf
1CC0–1CCF Sundanese Supplement U1CC0.pdf
1CD0–1CFF Vedic Extensions U1CD0.pdf
1D00–1D7F Phonetic Extensions U1D00.pdf
1D80–1DBF Phonetic Extensions Supplement U1D80.pdf
1DC0–1DFF Combining Diacritical Marks Supplement U1DC0.pdf
1E00–1EFF Latin Extended Additional U1E00.pdf
1F00–1FFF Greek Extended U1F00.pdf
2000–206F General Punctuation U2000.pdf
2070–209F Superscripts and Subscripts U2070.pdf
20A0–20CF Currency Symbols U20A0.pdf
20D0–20FF Combining Diacritical Marks for Symbols U20D0.pdf
2100–214F Letterlike Symbols U2100.pdf
2150–218F Number Forms U2150.pdf
2190–21FF Arrows U2190.pdf
2200–22FF Mathematical Operators U2200.pdf
2300–23FF Miscellaneous Technical U2300.pdf
2400–243F Control Pictures U2400.pdf
2440–245F Optical Character Recognition U2440.pdf
2460–24FF Enclosed Alphanumerics U2460.pdf
2500–257F Box Drawing U2500.pdf
2580–259F Block Elements U2580.pdf
25A0–25FF Geometric Shapes U25A0.pdf
2600–26FF Miscellaneous Symbols U2600.pdf
2700–27BF Dingbats U2700.pdf
27C0–27EF Miscellaneous Mathematical Symbols-A U27C0.pdf
27F0–27FF Supplemental Arrows-A U27F0.pdf
2800–28FF Braille Patterns U2800.pdf
2900–297F Supplemental Arrows-B U2900.pdf
2980–29FF Miscellaneous Mathematical Symbols-B U2980.pdf
2A00–2AFF Supplemental Mathematical Operators U2A00.pdf
2B00–2BFF Miscellaneous Symbols and Arrows U2B00.pdf
2C00–2C5F Glagolitic U2C00.pdf
2C60–2C7F Latin Extended-C U2C60.pdf
2C80–2CFF Coptic U2C80.pdf
2D00–2D2F Georgian Supplement U2D00.pdf
2D30–2D7F Tifinagh U2D30.pdf
2D80–2DDF Ethiopic Extended U2D80.pdf
2DE0–2DFF Cyrillic Extended-A U2DE0.pdf
2E00–2E7F Supplemental Punctuation U2E00.pdf
2E80–2EFF CJK Radicals Supplement U2E80.pdf
2F00–2FDF Kangxi Radicals U2F00.pdf
2FF0–2FFF Ideographic Description Characters U2FF0.pdf
3000–303F CJK Symbols and Punctuation U3000.pdf
3040–309F Hiragana U3040.pdf
30A0–30FF Katakana U30A0.pdf
3100–312F Bopomofo U3100.pdf
3130–318F Hangul Compatibility Jamo U3130.pdf
3190–319F Kanbun U3190.pdf
31A0–31BF Bopomofo Extended U31A0.pdf
31C0–31EF CJK Strokes U31C0.pdf
31F0–31FF Katakana Phonetic Extensions U31F0.pdf
3200–32FF Enclosed CJK Letters and Months U3200.pdf
3300–33FF CJK Compatibility U3300.pdf
3400–4DBF CJK Unified Ideographs Extension A U3400.pdf
4DC0–4DFF Yijing Hexagram Symbols U4DC0.pdf
4E00–9FFF CJK Unified Ideographs U4E00.pdf
A000–A48F Yi Syllables UA000.pdf
A490–A4CF Yi Radicals UA490.pdf
A4D0–A4FF Lisu UA4D0.pdf
A500–A63F Vai UA500.pdf
A640–A69F Cyrillic Extended-B UA640.pdf
A6A0–A6FF Bamum UA6A0.pdf
A700–A71F Modifier Tone Letters UA700.pdf
A720–A7FF Latin Extended-D UA720.pdf
A800–A82F Syloti Nagri UA800.pdf
A830–A83F Common Indic Number Forms UA830.pdf
A840–A87F Phags-pa UA840.pdf
A880–A8DF Saurashtra UA880.pdf
A8E0–A8FF Devanagari Extended UA8E0.pdf
A900–A92F Kayah Li UA900.pdf
A930–A95F Rejang UA930.pdf
A960–A97F Hangul Jamo Extended-A UA960.pdf
A980–A9DF Javanese UA980.pdf
A9E0–A9FF Myanmar Extended-B UA9E0.pdf
AA00–AA5F Cham UAA00.pdf
AA60–AA7F Myanmar Extended-A UAA60.pdf
AA80–AADF Tai Viet UAA80.pdf
AAE0–AAFF Meetei Mayek Extensions UAAE0.pdf
AB00–AB2F Ethiopic Extended-A UAB00.pdf
AB30–AB6F Latin Extended-E UAB30.pdf
AB70–ABBF Cherokee Supplement UAB70.pdf
ABC0–ABFF Meetei Mayek UABC0.pdf
AC00–D7AF Hangul Syllables UAC00.pdf
D7B0–D7FF Hangul Jamo Extended-B UD7B0.pdf
D800–DB7F High Surrogates UD800.pdf
DB80–DBFF High Private Use Surrogates UDB80.pdf
DC00–DFFF Low Surrogates UDC00.pdf
E000–F8FF Private Use Area UE000.pdf
F900–FAFF CJK Compatibility Ideographs UF900.pdf
FB00–FB4F Alphabetic Presentation Forms UFB00.pdf
FB50–FDFF Arabic Presentation Forms-A UFB50.pdf
FE00–FE0F Variation Selectors UFE00.pdf
FE10–FE1F Vertical Forms UFE10.pdf
FE20–FE2F Combining Half Marks UFE20.pdf
FE30–FE4F CJK Compatibility Forms UFE30.pdf
FE50–FE6F Small Form Variants UFE50.pdf
FE70–FEFF Arabic Presentation Forms-B UFE70.pdf
FF00–FFEF Halfwidth and Fullwidth Forms UFF00.pdf
FFF0–FFFF Specials UFFF0.pdf
10000–1007F Linear B Syllabary U10000.pdf
10080–100FF Linear B Ideograms U10080.pdf
10100–1013F Aegean Numbers U10100.pdf
10140–1018F Ancient Greek Numbers U10140.pdf
10190–101CF Ancient Symbols U10190.pdf
101D0–101FF Phaistos Disc U101D0.pdf
10280–1029F Lycian U10280.pdf
102A0–102DF Carian U102A0.pdf
102E0–102FF Coptic Epact Numbers U102E0.pdf
10300–1032F Old Italic U10300.pdf
10330–1034F Gothic U10330.pdf
10350–1037F Old Permic U10350.pdf
10380–1039F Ugaritic U10380.pdf
103A0–103DF Old Persian U103A0.pdf
10400–1044F Deseret U10400.pdf
10450–1047F Shavian U10450.pdf
10480–104AF Osmanya U10480.pdf
104B0–104FF Osage U104B0.pdf
10500–1052F Elbasan U10500.pdf
10530–1056F Caucasian Albanian U10530.pdf
10600–1077F Linear A U10600.pdf
10800–1083F Cypriot Syllabary U10800.pdf
10840–1085F Imperial Aramaic U10840.pdf
10860–1087F Palmyrene U10860.pdf
10880–108AF Nabataean U10880.pdf
108E0–108FF Hatran U108E0.pdf
10900–1091F Phoenician U10900.pdf
10920–1093F Lydian U10920.pdf
10980–1099F Meroitic Hieroglyphs U10980.pdf
109A0–109FF Meroitic Cursive U109A0.pdf
10A00–10A5F Kharoshthi U10A00.pdf
10A60–10A7F Old South Arabian U10A60.pdf
10A80–10A9F Old North Arabian U10A80.pdf
10AC0–10AFF Manichaean U10AC0.pdf
10B00–10B3F Avestan U10B00.pdf
10B40–10B5F Inscriptional Parthian U10B40.pdf
10B60–10B7F Inscriptional Pahlavi U10B60.pdf
10B80–10BAF Psalter Pahlavi U10B80.pdf
10C00–10C4F Old Turkic U10C00.pdf
10C80–10CFF Old Hungarian U10C80.pdf
10E60–10E7F Rumi Numeral Symbols U10E60.pdf
11000–1107F Brahmi U11000.pdf
11080–110CF Kaithi U11080.pdf
110D0–110FF Sora Sompeng U110D0.pdf
11100–1114F Chakma U11100.pdf
11150–1117F Mahajani U11150.pdf
11180–111DF Sharada U11180.pdf
111E0–111FF Sinhala Archaic Numbers U111E0.pdf
11200–1124F Khojki U11200.pdf
11280–112AF Multani U11280.pdf
112B0–112FF Khudawadi U112B0.pdf
11300–1137F Grantha U11300.pdf
11400–1147F Newa U11400.pdf
11480–114DF Tirhuta U11480.pdf
11580–115FF Siddham U11580.pdf
11600–1165F Modi U11600.pdf
11660–1167F Mongolian Supplement U11660.pdf
11680–116CF Takri U11680.pdf
11700–1173F Ahom U11700.pdf
118A0–118FF Warang Citi U118A0.pdf
11A00–11A4F Zanabazar Square U11A00.pdf
11A50–11AAF Soyombo U11A50.pdf
11AC0–11AFF Pau Cin Hau U11AC0.pdf
11C00–11C6F Bhaiksuki U11C00.pdf
11C70–11CBF Marchen U11C70.pdf
11D00–11D5F Masaram Gondi U11D00.pdf
12000–123FF Cuneiform U12000.pdf
12400–1247F Cuneiform Numbers and Punctuation U12400.pdf
12480–1254F Early Dynastic Cuneiform U12480.pdf
13000–1342F Egyptian Hieroglyphs U13000.pdf
14400–1467F Anatolian Hieroglyphs U14400.pdf
16800–16A3F Bamum Supplement U16800.pdf
16A40–16A6F Mro U16A40.pdf
16AD0–16AFF Bassa Vah U16AD0.pdf
16B00–16B8F Pahawh Hmong U16B00.pdf
16F00–16F9F Miao U16F00.pdf
16FE0–16FFF Ideographic Symbols and Punctuation U16FE0.pdf
17000–187FF Tangut U17000.pdf
18800–18AFF Tangut Components U18800.pdf
1B000–1B0FF Kana Supplement U1B000.pdf
1B100–1B12F Kana Extended-A U1B100.pdf
1B170–1B2FF Nushu U1B170.pdf
1BC00–1BC9F Duployan U1BC00.pdf
1BCA0–1BCAF Shorthand Format Controls U1BCA0.pdf
1D000–1D0FF Byzantine Musical Symbols U1D000.pdf
1D100–1D1FF Musical Symbols U1D100.pdf
1D200–1D24F Ancient Greek Musical Notation U1D200.pdf
1D300–1D35F Tai Xuan Jing Symbols U1D300.pdf
1D360–1D37F Counting Rod Numerals U1D360.pdf
1D400–1D7FF Mathematical Alphanumeric Symbols U1D400.pdf
1D800–1DAAF Sutton SignWriting U1D800.pdf
1E000–1E02F Glagolitic Supplement U1E000.pdf
1E800–1E8DF Mende Kikakui U1E800.pdf
1E900–1E95F Adlam U1E900.pdf
1EE00–1EEFF Arabic Mathematical Alphabetic Symbols U1EE00.pdf
1F000–1F02F Mahjong Tiles U1F000.pdf
1F030–1F09F Domino Tiles U1F030.pdf
1F0A0–1F0FF Playing Cards U1F0A0.pdf
1F100–1F1FF Enclosed Alphanumeric Supplement U1F100.pdf
1F200–1F2FF Enclosed Ideographic Supplement U1F200.pdf
1F300–1F5FF Miscellaneous Symbols and Pictographs U1F300.pdf
1F600–1F64F Emoticons U1F600.pdf
1F650–1F67F Ornamental Dingbats U1F650.pdf
1F680–1F6FF Transport and Map Symbols U1F680.pdf
1F700–1F77F Alchemical Symbols U1F700.pdf
1F780–1F7FF Geometric Shapes Extended U1F780.pdf
1F800–1F8FF Supplemental Arrows-C U1F800.pdf
1F900–1F9FF Supplemental Symbols and Pictographs U1F900.pdf
20000–2A6DF CJK Unified Ideographs Extension B U20000.pdf
2A700–2B73F CJK Unified Ideographs Extension C U2A700.pdf
2B740–2B81F CJK Unified Ideographs Extension D U2B740.pdf
2B820–2CEAF CJK Unified Ideographs Extension E U2B820.pdf
2CEB0–2EBEF CJK Unified Ideographs Extension F U2CEB0.pdf
2F800–2FA1F CJK Compatibility Ideographs Supplement U2F800.pdf
E0000–E007F Tags UE0000.pdf
E0100–E01EF Variation Selectors Supplement UE0100.pdf
F0000–FFFFF Supplementary Private Use Area-A UF0000.pdf
100000–10FFFF Supplementary Private Use Area-B U100000.pdf