diff options
author | Ævar Arnfjörð Bjarmason <avar@cpan.org> | 2007-04-06 21:41:57 +0000 |
---|---|---|
committer | Rafael Garcia-Suarez <rgarciasuarez@gmail.com> | 2007-04-12 13:57:49 +0000 |
commit | d38f6844f6115251e4fe0705e9b721f08bebf22b (patch) | |
tree | 2a492fc6dc1b0fb20073535cf077d6f86b7fad30 /pod/perlreguts.pod | |
parent | 102b13d314016f7ec14c00406088a88475fe52db (diff) | |
download | perl-d38f6844f6115251e4fe0705e9b721f08bebf22b.tar.gz |
Re: [PATCH] perlreguts.pod: use the unicode name for ß and show the codepoint
Message-ID: <51dd1af80704061441v4b972257ta4c95230bdbc47c5@mail.gmail.com>
p4raw-id: //depot/perl@30920
Diffstat (limited to 'pod/perlreguts.pod')
-rw-r--r-- | pod/perlreguts.pod | 4 |
1 files changed, 2 insertions, 2 deletions
diff --git a/pod/perlreguts.pod b/pod/perlreguts.pod index c61a9cf793..577f672bf4 100644 --- a/pod/perlreguts.pod +++ b/pod/perlreguts.pod @@ -694,8 +694,8 @@ Unicode. For instance, in ASCII, it is safe to assume that C<sizeof(char1) == sizeof(char2)>, but in UTF-8 it isn't. Unicode case folding is vastly more complex than the simple rules of ASCII, and even when not using Unicode but only localised single byte encodings, things can get -tricky (for example, GERMAN-SHARP-ESS should match 'SS' in localised -case-insensitive matching). +tricky (for example, B<LATIN SMALL LETTER SHARP S> (U+00DF, E<szlig>) +should match 'SS' in localised case-insensitive matching). Making things worse is that UTF-8 support was a later addition to the regex engine (as it was to perl) and this necessarily made things a lot |