Tags (Unicode block)

Tags
Range U+E0000..U+E007F
(128 code points)
Plane SSP
Scripts Common
Assigned 97 code points
Unused 31 reserved code points
1 deprecated
Unicode version history
3.1 97 (+97)
Note: [1][2]

Tags is a Unicode block containing formatting tag characters (language tag and ASCII character tags).

U+E0001, U+E0020–U+E007F were originally intended for invisibly tagging texts by language[3] but that use is no longer recommended.[4] All of those characters were deprecated in Unicode 5.1.

With the release of Unicode 8.0, U+E0020–U+E007E are no longer deprecated characters. The change was made "to clear the way for the potential future use of tag characters for a purpose other than to represent language tags".[5] Unicode states that "the use of tag characters to represent language tags in a plain text stream is still a deprecated mechanism for conveying language information about text".[5]

With the release of Unicode 9.0, U+E007F is no longer a deprecated character. (U+E0001 LANGUAGE TAG remains deprecated.)

Tags[1][2][3]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+E000x  LANG 
U+E001x
U+E002x   SP     !     "     #     $     %     &     '     (     )     *     +     ,     -     .     /  
U+E003x   0     1     2     3     4     5     6     7     8     9     :     :     <     =     >     ?  
U+E004x   @     A     B     C     D     E     F     G     H     I     J     K     L     M     N     O  
U+E005x   P     Q     R     S     T     U     V     W     X     Y     Z     [     \     ]     ^     _  
U+E006x   `     a     b     c     d     e     f     g     h     i     j     k     l     m     n     o  
U+E007x   p     q     r     s     t     u     v     w     x     y     z     {     |     }     ~    END 
Notes
1.^ As of Unicode version 9.0
2.^ Grey areas indicate non-assigned code points
3.^ Unicode code points U+E0001 and U+E0020 through U+E007F were deprecated with Unicode version 5.1 however as of Unicode version 9.0 only U+E0001 remains deprecated

References

  1. "Unicode character database". The Unicode Standard. Retrieved 2016-07-09.
  2. "Enumerated Versions of The Unicode Standard". The Unicode Standard. Retrieved 2016-07-09.
  3. "RFC2482: Language Tagging in Unicode Plain Text". Network Working Group. January 1999.
  4. "RFC6082: Deprecating Unicode Language Tag Characters: RFC 2482 is Historic". Internet Engineering Task Force (IETF). November 2010.
  5. 1 2 "Unicode 8.0.0, Implications for Migration". Unicode Consortium.
This article is issued from Wikipedia - version of the 9/29/2016. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.