Stop pos() from being confused by changing utf8ness

The value of pos() is stored as a byte offset. If it is stored on a tied variable or a reference (or glob), then the stringification could change, resulting in pos() now pointing to a different character off- set or pointing to the middle of a character: $ ./perl -Ilib -le '$x = bless [], chr 256; pos $x=1; bless $x, a; print pos $x' 2 $ ./perl -Ilib -le '$x = bless [], chr 256; pos $x=1; bless $x, "\x{1000}"; print pos $x' Malformed UTF-8 character (unexpected end of string) in match position at -e line 1. 0 So pos() should be stored as a character offset. The regular expression engine expects byte offsets always, so allow it to store bytes when possible (a pure non-magical string) but use char- acters otherwise. This does result in more complexity than I should like, but the alter- native (always storing a character offset) would slow down regular expressions, which is a big no-no.
author: Father Chrysostomos <sprout@cpan.org> 2013-07-23 13:15:34 -0700
committer: Father Chrysostomos <sprout@cpan.org> 2013-08-25 12:22:40 -0700
commit: 25fdce4a165b6305e760d4c8d94404ce055657a0 (patch)
tree: 7c3aa76b83b1518991bf23909ee072c55de29138 /sv.h
parent: 428ccf1e2d78d72b07c5e959e967569a82ce07ba (diff)
download: perl-25fdce4a165b6305e760d4c8d94404ce055657a0.tar.gz
1 files changed, 1 insertions, 2 deletions
diff --git a/sv.h b/sv.h
index 6d8a40e8f6..2f0eabc74a 100644
--- a/sv.h
+++ b/sv.h
@@ -1976,12 +1976,11 @@ mg.c:1024: warning: left-hand operand of comma expression has no effect
 #define sv_catpvn_nomg_maybeutf8(dsv, sstr, slen, is_utf8) \
 	sv_catpvn_flags(dsv, sstr, slen, (is_utf8)?SV_CATUTF8:SV_CATBYTES)
 
-#ifdef PERL_CORE
+#if defined(PERL_CORE) || defined(PERL_EXT)
 # define sv_or_pv_len_utf8(sv, pv, bytelen)	      \
     (SvGAMAGIC(sv)				       \
 	? utf8_length((U8 *)(pv), (U8 *)(pv)+(bytelen))	\
 	: sv_len_utf8(sv))
-# define sv_or_pv_pos_u2b(sv,s,p,lp) S_sv_or_pv_pos_u2b(aTHX_ sv,s,p,lp)
 #endif
 
 /*
author	Father Chrysostomos <sprout@cpan.org>	2013-07-23 13:15:34 -0700
committer	Father Chrysostomos <sprout@cpan.org>	2013-08-25 12:22:40 -0700
commit	25fdce4a165b6305e760d4c8d94404ce055657a0 (patch)
tree	7c3aa76b83b1518991bf23909ee072c55de29138 /sv.h
parent	428ccf1e2d78d72b07c5e959e967569a82ce07ba (diff)
download	perl-25fdce4a165b6305e760d4c8d94404ce055657a0.tar.gz