summaryrefslogtreecommitdiff
path: root/ext/Encode/TW
diff options
context:
space:
mode:
authorJarkko Hietaniemi <jhi@iki.fi>2002-03-04 21:44:42 +0000
committerJarkko Hietaniemi <jhi@iki.fi>2002-03-04 21:44:42 +0000
commitb27299349ad81150118bac89c15f9a07eb50cb15 (patch)
treec035dec83eab464f332070fcab0922b413ee5fce /ext/Encode/TW
parentef985a5e619db32e9d186349467c230900236933 (diff)
downloadperl-b27299349ad81150118bac89c15f9a07eb50cb15.tar.gz
More Han tweaks from Autrjius Tang: most importantly,
gbk is identical to cp936, so gbk can be removed and taken care of by an alias. p4raw-id: //depot/perl@15020
Diffstat (limited to 'ext/Encode/TW')
-rw-r--r--ext/Encode/TW/TW.pm46
1 files changed, 46 insertions, 0 deletions
diff --git a/ext/Encode/TW/TW.pm b/ext/Encode/TW/TW.pm
index 7a68811ac3..90b046041b 100644
--- a/ext/Encode/TW/TW.pm
+++ b/ext/Encode/TW/TW.pm
@@ -6,3 +6,49 @@ XSLoader::load('Encode::TW',$VERSION);
1;
__END__
+=head1 NAME
+
+Encode::TW - Taiwan-based Chinese Encodings
+
+=head1 SYNOPSIS
+
+ use Encode::CN;
+ $big5 = encode("big5", $utf8);
+ $utf8 = encode("big5", $big5);
+
+=head1 DESCRIPTION
+
+This module implements Taiwan-based Chinese charset encodings.
+Encodings supported are as follows.
+
+ big5 The original Big5 encoding
+ big5-hkscs Big5 plus Cantonese characters in Hong Kong
+ cp950 Code Page 950 (Big5 + Microsoft vendor mappings)
+
+To find how to use this module in detail, see L<Encode>.
+
+=head1 NOTES
+
+Due to size concerns, C<EUC-TW> (Extended Unix Character) and C<BIG5PLUS>
+(CMEX's Big5+) are distributed separately on CPAN, under the name
+L<Encode::HanExtra>. That module also contains extra China-based encodings.
+
+=head1 BUGS
+
+The C<CNS11643> encoding files are not complete (only the first two planes,
+C<11643-1> and C<11643-2>, exist in the distribution). For common CNS11643
+manipulation, please use C<EUC-TW> in L<Encode::HanExtra>, which contains
+plane 1-7.
+
+ASCII part (0x00-0x7f) is preserved for all encodings, even though it
+conflicts with mappings by the Unicode Consortium. See
+
+F<http://www.debian.or.jp/~kubota/unicode-symbols.html.en>
+
+to find why it is implemented that way.
+
+=head1 SEE ALSO
+
+L<Encode>
+
+=cut