diff options
author | Steve Peters <steve@fisharerojo.org> | 2008-12-19 11:38:31 -0600 |
---|---|---|
committer | Steve Peters <steve@fisharerojo.org> | 2008-12-19 11:38:31 -0600 |
commit | 2bbc8d558d247c6ef91207a12a4650c0bc292dd6 (patch) | |
tree | f56c82008dc643d8e799b8e21fb9a3c36b64b3b4 /autodoc.pl | |
parent | 7df2e4bc09d8ad053532c5f9232b2d713856c938 (diff) | |
download | perl-2bbc8d558d247c6ef91207a12a4650c0bc292dd6.tar.gz |
Subject: PATCH 5.10 documentation
From: karl williamson <public@khwilliamson.com>
Date: Tue, 16 Dec 2008 16:00:34 -0700
Message-ID: <49483312.80804@khwilliamson.com>
Diffstat (limited to 'autodoc.pl')
-rw-r--r-- | autodoc.pl | 25 |
1 files changed, 24 insertions, 1 deletions
diff --git a/autodoc.pl b/autodoc.pl index f97af93acf..25fabf0ca4 100644 --- a/autodoc.pl +++ b/autodoc.pl @@ -238,7 +238,30 @@ Note that all Perl API global variables must be referenced with the C<PL_> prefix. Some macros are provided for compatibility with the older, unadorned names, but this support may be disabled in a future release. -The listing is alphabetical, case insensitive. +Perl was originally written to handle US-ASCII only (that is characters +whose ordinal numbers are in the range 0 - 127). +And documentation and comments may still use the term ASCII, when +sometimes in fact the entire range from 0 - 255 is meant. + +Note that Perl can be compiled and run under EBCDIC (See L<perlebcdic>) +or ASCII. Most of the documentation (and even comments in the code) +ignore the EBCDIC possibility. +For almost all purposes the differences are transparent. +As an example, under EBCDIC, +instead of UTF-8, UTF-EBCDIC is used to encode Unicode strings, and so +whenever this documentation refers to C<utf8> +(and variants of that name, including in function names), +it also (essentially transparently) means C<UTF-EBCDIC>. +But the ordinals of characters differ between ASCII, EBCDIC, and +the UTF- encodings, and a string encoded in UTF-EBCDIC may occupy more bytes +than in UTF-8. + +Also, on some EBCDIC machines, functions that are documented as operating on +US-ASCII (or Basic Latin in Unicode terminology) may in fact operate on all +256 characters in the EBCDIC range, not just the subset corresponding to +US-ASCII. + +The listing below is alphabetical, case insensitive. _EOB_ |