summaryrefslogtreecommitdiff
path: root/pod/perluniintro.pod
diff options
context:
space:
mode:
authorDanny Rathjens <unknown>2007-06-21 10:35:26 -0700
committerRafael Garcia-Suarez <rgarciasuarez@gmail.com>2007-06-25 12:10:10 +0000
commita365f2ce4defc0d7fecd4e9484f8f958454c9192 (patch)
tree0c68361d3dfd14bd1500cd81949c7bab09a59c13 /pod/perluniintro.pod
parent85594c3193fdbad2767cc92f3cce383a97bccd08 (diff)
downloadperl-a365f2ce4defc0d7fecd4e9484f8f958454c9192.tar.gz
Apply doc suggestion from:
Subject: [perl #43287] perluniintro inaccurate answer to testing encoding validity From: Danny Rathjens (via RT) <perlbug-followup@perl.org> Message-ID: <rt-3.6.HEAD-1276-1182472526-1129.43287-75-0@perl.org> p4raw-id: //depot/perl@31462
Diffstat (limited to 'pod/perluniintro.pod')
-rw-r--r--pod/perluniintro.pod12
1 files changed, 6 insertions, 6 deletions
diff --git a/pod/perluniintro.pod b/pod/perluniintro.pod
index 9337e5f919..dcfb11bdaf 100644
--- a/pod/perluniintro.pod
+++ b/pod/perluniintro.pod
@@ -656,10 +656,11 @@ Use the C<Encode> package to try converting it.
For example,
use Encode 'decode_utf8';
- if (decode_utf8($string_of_bytes_that_I_think_is_utf8)) {
- # valid
+ eval { decode_utf8($string, Encode::FB_CROAK) };
+ if ($@) {
+ # $string is valid utf8
} else {
- # invalid
+ # $string is not valid utf8
}
Or use C<unpack> to try decoding it:
@@ -667,9 +668,8 @@ Or use C<unpack> to try decoding it:
use warnings;
@chars = unpack("C0U*", $string_of_bytes_that_I_think_is_utf8);
-If invalid, a C<Malformed UTF-8 character (byte 0x##) in unpack>
-warning is produced. The "C0" means
-"process the string character per character". Without that the
+If invalid, a C<Malformed UTF-8 character> warning is produced. The "C0" means
+"process the string character per character". Without that, the
C<unpack("U*", ...)> would work in C<U0> mode (the default if the format
string starts with C<U>) and it would return the bytes making up the UTF-8
encoding of the target string, something that will always work.