Open-source image
open-source
Hime logo

Table of content

Unicode blocks

This page summarizes the supported Unicode blocks. To refer to a block in a lexical rule, use the construct ub{NAME}.

In the table, the Start and End column are the bounds (included) of the corresponding block. They are Unicode code points. See Unicode blocks.

Block NameStartEnd
BasicLatinU+0000U+007F
Latin-1SupplementU+0080U+00FF
LatinExtended-AU+0100U+017F
LatinExtended-BU+0180U+024F
IPAExtensionsU+0250U+02AF
SpacingModifierLettersU+02B0U+02FF
CombiningDiacriticalMarksU+0300U+036F
GreekandCopticU+0370U+03FF
CyrillicU+0400U+04FF
CyrillicSupplementU+0500U+052F
ArmenianU+0530U+058F
HebrewU+0590U+05FF
ArabicU+0600U+06FF
SyriacU+0700U+074F
ArabicSupplementU+0750U+077F
ThaanaU+0780U+07BF
NKoU+07C0U+07FF
SamaritanU+0800U+083F
MandaicU+0840U+085F
SyriacSupplementU+0860U+086F
ArabicExtended-AU+08A0U+08FF
DevanagariU+0900U+097F
BengaliU+0980U+09FF
GurmukhiU+0A00U+0A7F
GujaratiU+0A80U+0AFF
OriyaU+0B00U+0B7F
TamilU+0B80U+0BFF
TeluguU+0C00U+0C7F
KannadaU+0C80U+0CFF
MalayalamU+0D00U+0D7F
SinhalaU+0D80U+0DFF
ThaiU+0E00U+0E7F
LaoU+0E80U+0EFF
TibetanU+0F00U+0FFF
MyanmarU+1000U+109F
GeorgianU+10A0U+10FF
HangulJamoU+1100U+11FF
EthiopicU+1200U+137F
EthiopicSupplementU+1380U+139F
CherokeeU+13A0U+13FF
UnifiedCanadianAboriginalSyllabicsU+1400U+167F
OghamU+1680U+169F
RunicU+16A0U+16FF
TagalogU+1700U+171F
HanunooU+1720U+173F
BuhidU+1740U+175F
TagbanwaU+1760U+177F
KhmerU+1780U+17FF
MongolianU+1800U+18AF
UnifiedCanadianAboriginalSyllabicsExtendedU+18B0U+18FF
LimbuU+1900U+194F
TaiLeU+1950U+197F
NewTaiLueU+1980U+19DF
KhmerSymbolsU+19E0U+19FF
BugineseU+1A00U+1A1F
TaiThamU+1A20U+1AAF
CombiningDiacriticalMarksExtendedU+1AB0U+1AFF
BalineseU+1B00U+1B7F
SundaneseU+1B80U+1BBF
BatakU+1BC0U+1BFF
LepchaU+1C00U+1C4F
OlChikiU+1C50U+1C7F
CyrillicExtended-CU+1C80U+1C8F
SundaneseSupplementU+1CC0U+1CCF
VedicExtensionsU+1CD0U+1CFF
PhoneticExtensionsU+1D00U+1D7F
PhoneticExtensionsSupplementU+1D80U+1DBF
CombiningDiacriticalMarksSupplementU+1DC0U+1DFF
LatinExtendedAdditionalU+1E00U+1EFF
GreekExtendedU+1F00U+1FFF
GeneralPunctuationU+2000U+206F
SuperscriptsandSubscriptsU+2070U+209F
CurrencySymbolsU+20A0U+20CF
CombiningDiacriticalMarksforSymbolsU+20D0U+20FF
LetterlikeSymbolsU+2100U+214F
NumberFormsU+2150U+218F
ArrowsU+2190U+21FF
MathematicalOperatorsU+2200U+22FF
MiscellaneousTechnicalU+2300U+23FF
ControlPicturesU+2400U+243F
OpticalCharacterRecognitionU+2440U+245F
EnclosedAlphanumericsU+2460U+24FF
BoxDrawingU+2500U+257F
BlockElementsU+2580U+259F
GeometricShapesU+25A0U+25FF
MiscellaneousSymbolsU+2600U+26FF
DingbatsU+2700U+27BF
MiscellaneousMathematicalSymbols-AU+27C0U+27EF
SupplementalArrows-AU+27F0U+27FF
BraillePatternsU+2800U+28FF
SupplementalArrows-BU+2900U+297F
MiscellaneousMathematicalSymbols-BU+2980U+29FF
SupplementalMathematicalOperatorsU+2A00U+2AFF
MiscellaneousSymbolsandArrowsU+2B00U+2BFF
GlagoliticU+2C00U+2C5F
LatinExtended-CU+2C60U+2C7F
CopticU+2C80U+2CFF
GeorgianSupplementU+2D00U+2D2F
TifinaghU+2D30U+2D7F
EthiopicExtendedU+2D80U+2DDF
CyrillicExtended-AU+2DE0U+2DFF
SupplementalPunctuationU+2E00U+2E7F
CJKRadicalsSupplementU+2E80U+2EFF
KangxiRadicalsU+2F00U+2FDF
IdeographicDescriptionCharactersU+2FF0U+2FFF
CJKSymbolsandPunctuationU+3000U+303F
HiraganaU+3040U+309F
KatakanaU+30A0U+30FF
BopomofoU+3100U+312F
HangulCompatibilityJamoU+3130U+318F
KanbunU+3190U+319F
BopomofoExtendedU+31A0U+31BF
CJKStrokesU+31C0U+31EF
KatakanaPhoneticExtensionsU+31F0U+31FF
EnclosedCJKLettersandMonthsU+3200U+32FF
CJKCompatibilityU+3300U+33FF
CJKUnifiedIdeographsExtensionAU+3400U+4DBF
YijingHexagramSymbolsU+4DC0U+4DFF
CJKUnifiedIdeographsU+4E00U+9FFF
YiSyllablesU+A000U+A48F
YiRadicalsU+A490U+A4CF
LisuU+A4D0U+A4FF
VaiU+A500U+A63F
CyrillicExtended-BU+A640U+A69F
BamumU+A6A0U+A6FF
ModifierToneLettersU+A700U+A71F
LatinExtended-DU+A720U+A7FF
SylotiNagriU+A800U+A82F
CommonIndicNumberFormsU+A830U+A83F
Phags-paU+A840U+A87F
SaurashtraU+A880U+A8DF
DevanagariExtendedU+A8E0U+A8FF
KayahLiU+A900U+A92F
RejangU+A930U+A95F
HangulJamoExtended-AU+A960U+A97F
JavaneseU+A980U+A9DF
MyanmarExtended-BU+A9E0U+A9FF
ChamU+AA00U+AA5F
MyanmarExtended-AU+AA60U+AA7F
TaiVietU+AA80U+AADF
MeeteiMayekExtensionsU+AAE0U+AAFF
EthiopicExtended-AU+AB00U+AB2F
LatinExtended-EU+AB30U+AB6F
CherokeeSupplementU+AB70U+ABBF
MeeteiMayekU+ABC0U+ABFF
HangulSyllablesU+AC00U+D7AF
HangulJamoExtended-BU+D7B0U+D7FF
PrivateUseAreaU+E000U+F8FF
CJKCompatibilityIdeographsU+F900U+FAFF
AlphabeticPresentationFormsU+FB00U+FB4F
ArabicPresentationForms-AU+FB50U+FDFF
VariationSelectorsU+FE00U+FE0F
VerticalFormsU+FE10U+FE1F
CombiningHalfMarksU+FE20U+FE2F
CJKCompatibilityFormsU+FE30U+FE4F
SmallFormVariantsU+FE50U+FE6F
ArabicPresentationForms-BU+FE70U+FEFF
HalfwidthandFullwidthFormsU+FF00U+FFEF
SpecialsU+FFF0U+FFFF
LinearBSyllabaryU+00010000U+0001007F
LinearBIdeogramsU+00010080U+000100FF
AegeanNumbersU+00010100U+0001013F
AncientGreekNumbersU+00010140U+0001018F
AncientSymbolsU+00010190U+000101CF
PhaistosDiscU+000101D0U+000101FF
LycianU+00010280U+0001029F
CarianU+000102A0U+000102DF
CopticEpactNumbersU+000102E0U+000102FF
OldItalicU+00010300U+0001032F
GothicU+00010330U+0001034F
OldPermicU+00010350U+0001037F
UgariticU+00010380U+0001039F
OldPersianU+000103A0U+000103DF
DeseretU+00010400U+0001044F
ShavianU+00010450U+0001047F
OsmanyaU+00010480U+000104AF
OsageU+000104B0U+000104FF
ElbasanU+00010500U+0001052F
CaucasianAlbanianU+00010530U+0001056F
LinearAU+00010600U+0001077F
CypriotSyllabaryU+00010800U+0001083F
ImperialAramaicU+00010840U+0001085F
PalmyreneU+00010860U+0001087F
NabataeanU+00010880U+000108AF
HatranU+000108E0U+000108FF
PhoenicianU+00010900U+0001091F
LydianU+00010920U+0001093F
MeroiticHieroglyphsU+00010980U+0001099F
MeroiticCursiveU+000109A0U+000109FF
KharoshthiU+00010A00U+00010A5F
OldSouthArabianU+00010A60U+00010A7F
OldNorthArabianU+00010A80U+00010A9F
ManichaeanU+00010AC0U+00010AFF
AvestanU+00010B00U+00010B3F
InscriptionalParthianU+00010B40U+00010B5F
InscriptionalPahlaviU+00010B60U+00010B7F
PsalterPahlaviU+00010B80U+00010BAF
OldTurkicU+00010C00U+00010C4F
OldHungarianU+00010C80U+00010CFF
RumiNumeralSymbolsU+00010E60U+00010E7F
BrahmiU+00011000U+0001107F
KaithiU+00011080U+000110CF
SoraSompengU+000110D0U+000110FF
ChakmaU+00011100U+0001114F
MahajaniU+00011150U+0001117F
SharadaU+00011180U+000111DF
SinhalaArchaicNumbersU+000111E0U+000111FF
KhojkiU+00011200U+0001124F
MultaniU+00011280U+000112AF
KhudawadiU+000112B0U+000112FF
GranthaU+00011300U+0001137F
NewaU+00011400U+0001147F
TirhutaU+00011480U+000114DF
SiddhamU+00011580U+000115FF
ModiU+00011600U+0001165F
MongolianSupplementU+00011660U+0001167F
TakriU+00011680U+000116CF
AhomU+00011700U+0001173F
WarangCitiU+000118A0U+000118FF
ZanabazarSquareU+00011A00U+00011A4F
SoyomboU+00011A50U+00011AAF
PauCinHauU+00011AC0U+00011AFF
BhaiksukiU+00011C00U+00011C6F
MarchenU+00011C70U+00011CBF
MasaramGondiU+00011D00U+00011D5F
CuneiformU+00012000U+000123FF
CuneiformNumbersandPunctuationU+00012400U+0001247F
EarlyDynasticCuneiformU+00012480U+0001254F
EgyptianHieroglyphsU+00013000U+0001342F
AnatolianHieroglyphsU+00014400U+0001467F
BamumSupplementU+00016800U+00016A3F
MroU+00016A40U+00016A6F
BassaVahU+00016AD0U+00016AFF
PahawhHmongU+00016B00U+00016B8F
MiaoU+00016F00U+00016F9F
IdeographicSymbolsandPunctuationU+00016FE0U+00016FFF
TangutU+00017000U+000187FF
TangutComponentsU+00018800U+00018AFF
KanaSupplementU+0001B000U+0001B0FF
KanaExtended-AU+0001B100U+0001B12F
NushuU+0001B170U+0001B2FF
DuployanU+0001BC00U+0001BC9F
ShorthandFormatControlsU+0001BCA0U+0001BCAF
ByzantineMusicalSymbolsU+0001D000U+0001D0FF
MusicalSymbolsU+0001D100U+0001D1FF
AncientGreekMusicalNotationU+0001D200U+0001D24F
TaiXuanJingSymbolsU+0001D300U+0001D35F
CountingRodNumeralsU+0001D360U+0001D37F
MathematicalAlphanumericSymbolsU+0001D400U+0001D7FF
SuttonSignWritingU+0001D800U+0001DAAF
GlagoliticSupplementU+0001E000U+0001E02F
MendeKikakuiU+0001E800U+0001E8DF
AdlamU+0001E900U+0001E95F
ArabicMathematicalAlphabeticSymbolsU+0001EE00U+0001EEFF
MahjongTilesU+0001F000U+0001F02F
DominoTilesU+0001F030U+0001F09F
PlayingCardsU+0001F0A0U+0001F0FF
EnclosedAlphanumericSupplementU+0001F100U+0001F1FF
EnclosedIdeographicSupplementU+0001F200U+0001F2FF
MiscellaneousSymbolsandPictographsU+0001F300U+0001F5FF
EmoticonsU+0001F600U+0001F64F
OrnamentalDingbatsU+0001F650U+0001F67F
TransportandMapSymbolsU+0001F680U+0001F6FF
AlchemicalSymbolsU+0001F700U+0001F77F
GeometricShapesExtendedU+0001F780U+0001F7FF
SupplementalArrows-CU+0001F800U+0001F8FF
SupplementalSymbolsandPictographsU+0001F900U+0001F9FF
CJKUnifiedIdeographsExtensionBU+00020000U+0002A6DF
CJKUnifiedIdeographsExtensionCU+0002A700U+0002B73F
CJKUnifiedIdeographsExtensionDU+0002B740U+0002B81F
CJKUnifiedIdeographsExtensionEU+0002B820U+0002CEAF
CJKUnifiedIdeographsExtensionFU+0002CEB0U+0002EBEF
CJKCompatibilityIdeographsSupplementU+0002F800U+0002FA1F
TagsU+000E0000U+000E007F
VariationSelectorsSupplementU+000E0100U+000E01EF
SupplementaryPrivateUseArea-AU+000F0000U+000FFFFF
SupplementaryPrivateUseArea-BU+00100000U+0010FFFF