diff options
author | Jarkko Hietaniemi <jhi@iki.fi> | 2002-03-04 21:44:42 +0000 |
---|---|---|
committer | Jarkko Hietaniemi <jhi@iki.fi> | 2002-03-04 21:44:42 +0000 |
commit | b27299349ad81150118bac89c15f9a07eb50cb15 (patch) | |
tree | c035dec83eab464f332070fcab0922b413ee5fce /ext/Encode/TW | |
parent | ef985a5e619db32e9d186349467c230900236933 (diff) | |
download | perl-b27299349ad81150118bac89c15f9a07eb50cb15.tar.gz |
More Han tweaks from Autrjius Tang: most importantly,
gbk is identical to cp936, so gbk can be removed and
taken care of by an alias.
p4raw-id: //depot/perl@15020
Diffstat (limited to 'ext/Encode/TW')
-rw-r--r-- | ext/Encode/TW/TW.pm | 46 |
1 files changed, 46 insertions, 0 deletions
diff --git a/ext/Encode/TW/TW.pm b/ext/Encode/TW/TW.pm index 7a68811ac3..90b046041b 100644 --- a/ext/Encode/TW/TW.pm +++ b/ext/Encode/TW/TW.pm @@ -6,3 +6,49 @@ XSLoader::load('Encode::TW',$VERSION); 1; __END__ +=head1 NAME + +Encode::TW - Taiwan-based Chinese Encodings + +=head1 SYNOPSIS + + use Encode::CN; + $big5 = encode("big5", $utf8); + $utf8 = encode("big5", $big5); + +=head1 DESCRIPTION + +This module implements Taiwan-based Chinese charset encodings. +Encodings supported are as follows. + + big5 The original Big5 encoding + big5-hkscs Big5 plus Cantonese characters in Hong Kong + cp950 Code Page 950 (Big5 + Microsoft vendor mappings) + +To find how to use this module in detail, see L<Encode>. + +=head1 NOTES + +Due to size concerns, C<EUC-TW> (Extended Unix Character) and C<BIG5PLUS> +(CMEX's Big5+) are distributed separately on CPAN, under the name +L<Encode::HanExtra>. That module also contains extra China-based encodings. + +=head1 BUGS + +The C<CNS11643> encoding files are not complete (only the first two planes, +C<11643-1> and C<11643-2>, exist in the distribution). For common CNS11643 +manipulation, please use C<EUC-TW> in L<Encode::HanExtra>, which contains +plane 1-7. + +ASCII part (0x00-0x7f) is preserved for all encodings, even though it +conflicts with mappings by the Unicode Consortium. See + +F<http://www.debian.or.jp/~kubota/unicode-symbols.html.en> + +to find why it is implemented that way. + +=head1 SEE ALSO + +L<Encode> + +=cut |