summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorKarl Williamson <public@khwilliamson.com>2013-03-08 12:08:17 -0700
committerKarl Williamson <public@khwilliamson.com>2013-03-08 12:12:31 -0700
commitd5924ca660e2fe6f9a41f9be79d82972ea3c3616 (patch)
tree51176336faf33153b69e080f5c5864d045789688
parent3b2d14782fbfa2afbd16c5ee36653a25675e6fc5 (diff)
downloadperl-d5924ca660e2fe6f9a41f9be79d82972ea3c3616.tar.gz
perlebcdic.pod: Document 2 fcns; nits
The functions to convert code points from/to native were not previously documented. This also fixes some heading issues.
-rw-r--r--pod/perlebcdic.pod17
1 files changed, 13 insertions, 4 deletions
diff --git a/pod/perlebcdic.pod b/pod/perlebcdic.pod
index ecf94d14f3..2256fb1ef6 100644
--- a/pod/perlebcdic.pod
+++ b/pod/perlebcdic.pod
@@ -77,7 +77,7 @@ identification numbers (CCSID numbers) or code page numbers.
Perl can be compiled on platforms that run any of three commonly used EBCDIC
character sets, listed below.
-=head2 The 13 variant characters
+=head3 The 13 variant characters
Among IBM EBCDIC character code sets there are 13 characters that
are often mapped to different integer values. Those characters
@@ -92,7 +92,9 @@ one of the three Perl knows about, Perl will either fail to compile, or
mistakenly and silently choose one of the three.
They are:
-=head2 0037
+=over
+
+=item B<0037>
Character code set ID 0037 is a mapping of the ASCII plus Latin-1
characters (i.e. ISO 8859-1) to an EBCDIC set. 0037 is used
@@ -100,18 +102,20 @@ in North American English locales on the OS/400 operating system
that runs on AS/400 computers. CCSID 0037 differs from ISO 8859-1
in 237 places, in other words they agree on only 19 code point values.
-=head2 1047
+=item B<1047>
Character code set ID 1047 is also a mapping of the ASCII plus
Latin-1 characters (i.e. ISO 8859-1) to an EBCDIC set. 1047 is
used under Unix System Services for OS/390 or z/OS, and OpenEdition
for VM/ESA. CCSID 1047 differs from CCSID 0037 in eight places.
-=head2 POSIX-BC
+=item B<POSIX-BC>
The EBCDIC code page in use on Siemens' BS2000 system is distinct from
1047 and 0037. It is identified below as the POSIX-BC set.
+=back
+
=head2 Unicode code points versus EBCDIC code points
In Unicode terminology a I<code point> is the number assigned to a
@@ -683,6 +687,11 @@ code page you can use the Config module like so:
=head1 CONVERSIONS
+=head2 C<utf8::unicode_to_native()> and C<utf8::native_to_unicode()>
+
+These functions take an input numeric code point in one encoding and
+return what its equivalent value is in the other.
+
=head2 tr///
In order to convert a string of characters from one character set to