Code page 1386
Code page 1386 (CP1386), also known under Microsoft Windows as Code page 936 (CP936), is Microsoft's character encoding for simplified Chinese, one of the four DBCSs for East Asian languages. Originally it was identical to GB 2312, and expanded to cover most part of GBK with the release of Windows 95. It was superseded by code page 54936 (GB 18030), but as of 2014 was still prevalent in use. The Windows command prompt uses CP936 as the default code page for simplified Chinese installations, although part of the GB 18030 was made mandatory for all software products sold in China. In 2002, the IANA Internet name GBK was registered for CP1386's mapping,[1] making it the official GBK charset on the Internet.
The concept "CP1386", "GBK"[lower-alpha 1] and "GB2312" are sometimes confused in various software products. Code page 1386 is not identical to GBK because a code page encodes characters while the GBK only defines code points. In addition, the Euro sign (€), encoded as 0x80 in CP1386, is not defined in GBK. On the other hand, 95 characters defined in GBK were initially not encoded into CP1386. This is partly resolved in later versions of Windows and, as in Windows 7, all GBK characters not in the Unicode BMP Private Use Area can be displayed using code page 1386, but encoding the 95 characters was still not supported as of 2014. However, "CP936" and "GBK" are often used interchangeably because of the popularity of Microsoft products on the Chinese market when GBK was then published. Since GBK superseded GB2312 long ago, these two terms have also become virtually equivalent to many users, so "CP1386", "GBK" and "GB2312" are misunderstood by many to mean the same thing while they actually differ significantly. Instead of supporting precisely GB2312, most modern-day software products mean partial support for GBK using CP1386 when they use the term "GB2312" as a character encoding option. This can be observed in products such as Microsoft Internet Explorer and Notepad++.
See also
Notes
- ↑ GBK 1.0
References
- ↑ "Character Sets". Retrieved 3 October 2016.
External links
- Microsoft's Reference for Code Page 1386 (936 in Microsoft Windows)
- ICU's Authoritative GBK mapping - part of GB18030 data
- Mapping of Code Page 1386 (936 in Microsoft Windows) to Unicode