Tags (Unicode block)
Tags | |
---|---|
Range |
U+E0000..U+E007F (128 code points) |
Plane | SSP |
Scripts | Common |
Assigned | 97 code points |
Unused |
31 reserved code points 1 deprecated |
Unicode version history | |
3.1 | 97 (+97) |
Note: [1][2] |
Tags is a Unicode block containing formatting tag characters (language tag and ASCII character tags).
U+E0001, U+E0020–U+E007F were originally intended for invisibly tagging texts by language[3] but that use is no longer recommended.[4] All of those characters were deprecated in Unicode 5.1.
With the release of Unicode 8.0, U+E0020–U+E007E are no longer deprecated characters. The change was made "to clear the way for the potential future use of tag characters for a purpose other than to represent language tags".[5] Unicode states that "the use of tag characters to represent language tags in a plain text stream is still a deprecated mechanism for conveying language information about text".[5]
With the release of Unicode 9.0, U+E007F is no longer a deprecated character. (U+E0001 LANGUAGE TAG remains deprecated.)
Tags[1][2][3] Official Unicode Consortium code chart (PDF) | ||||||||||||||||
0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | A | B | C | D | E | F | |
U+E000x | LANG | |||||||||||||||
U+E001x | ||||||||||||||||
U+E002x | SP | ! | " | # | $ | % | & | ' | ( | ) | * | + | , | - | . | / |
U+E003x | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | : | : | < | = | > | ? |
U+E004x | @ | A | B | C | D | E | F | G | H | I | J | K | L | M | N | O |
U+E005x | P | Q | R | S | T | U | V | W | X | Y | Z | [ | \ | ] | ^ | _ |
U+E006x | ` | a | b | c | d | e | f | g | h | i | j | k | l | m | n | o |
U+E007x | p | q | r | s | t | u | v | w | x | y | z | { | | | } | ~ | END |
Notes |
References
- ↑ "Unicode character database". The Unicode Standard. Retrieved 2016-07-09.
- ↑ "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2016-07-09.
- ↑ "RFC2482: Language Tagging in Unicode Plain Text". Network Working Group. January 1999.
- ↑ "RFC6082: Deprecating Unicode Language Tag Characters: RFC 2482 is Historic". Internet Engineering Task Force (IETF). November 2010.
- 1 2 "Unicode 8.0.0, Implications for Migration". Unicode Consortium.