summaryrefslogtreecommitdiff
path: root/pod/perlunicode.pod
diff options
context:
space:
mode:
authorAnton Tagunov <tagunov@motor.ru>2002-03-05 00:14:39 +0300
committerJarkko Hietaniemi <jhi@iki.fi>2002-03-04 21:19:31 +0000
commitb31c5e315f5d5cd338d1988c0e246776f4a63890 (patch)
tree8d8b1daea54cb29427fcd89df12534cb911c36c6 /pod/perlunicode.pod
parent54162f5c00ecbf69ece32b5c45c43c32cdb1626c (diff)
downloadperl-b31c5e315f5d5cd338d1988c0e246776f4a63890.tar.gz
Based on
Subject: Re[2]: [ID 20020303.005] Patch perlinicode C API description Message-ID: <1814137429.20020304211439@motor.ru> p4raw-id: //depot/perl@15018
Diffstat (limited to 'pod/perlunicode.pod')
-rw-r--r--pod/perlunicode.pod15
1 files changed, 8 insertions, 7 deletions
diff --git a/pod/perlunicode.pod b/pod/perlunicode.pod
index c170d2c1da..44bd568b79 100644
--- a/pod/perlunicode.pod
+++ b/pod/perlunicode.pod
@@ -851,13 +851,14 @@ the following C APIs useful (see perlapi for details):
DO_UTF8(sv) returns true if the UTF8 flag is on and the bytes pragma
is not in effect. SvUTF8(sv) returns true is the UTF8 flag is on, the
bytes pragma is ignored. The UTF8 flag being on does B<not> mean that
-there are any characters of code points greater than 255 (or 127) in the
-scalar, or that there even are any characters in the scalar. What the
-UTF8 flag means is that the sequence of octets in the representation
-of the scalar should be treated as UTF-8 encoding of a string.
-The UTF8 flag being off means that each octet in this representation
-encodes a single character with codepoint 0..255 within the string.
-Perl's Unicode model is not to use UTF-8 until it's really necessary.
+there are any characters of code points greater than 255 (or 127) in
+the scalar, or that there even are any characters in the scalar.
+What the UTF8 flag means is that the sequence of octets in the
+representation of the scalar is the sequence of UTF-8 encoded
+code points of the characters of a string. The UTF8 flag being
+off means that each octet in this representation encodes a single
+character with codepoint 0..255 within the string. Perl's Unicode
+model is not to use UTF-8 until it's really necessary.
=item *