| Commit message (Collapse) | Author | Age | Files | Lines |
| |
|
|
|
|
|
|
|
|
|
| |
[DELTA]
1.04 Sat Dec 7 11:34:18 2013
- XS: a workaround for perl 5.6.x to handle U+FFFF correctly.
unpack_U() is implemented by using XS again as well as that in 1.02,
but now that is used only in the versions before perl 5.8.0.
|
|
|
|
|
|
|
|
| |
[DELTA]
1.03 Sun Dec 1 21:45:46 2013
- XS: now unpack_U() uses unpack('U*') in pure perl.
avoid XS for the internal "utf8" encoding of perl.
|
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
1.02 Sun Nov 10 18:39:37 2013
- POD: fix [rt.cpan.org #90170] about iso-8859-1 letters in pod.
E<> is used for the compatibility with perl 5.6.1 and possibly EBCDIC.
- 1.01 forgot to increase the version number of CJK/Korean.pm.
- modified tests: cjkrange.t, compatui.t, hangtype.t, illegal.t,
loc_ja.t, loc_ta.t, overcjk0.t, overcjk1.t, view.t in t.
|
|
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
1.01 Sat Nov 2 19:00:38 2013
- DUCET is updated (for Unicode 6.3.0) as Collate/allkeys.txt.
! Please notice that allkeys.txt will be overwritten if you have had
other allkeys.txt already.
- The default UCA_Version is 28.
- Locale/*.pl (except fr.pl) and CJK/Korean.pm are updated.
- modified tests: loc_es.t, loc_estr.t, rewrite.t, version.t in t.
|
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
1.00 Sun Oct 27 13:22:17 2013
- When a subroutine by 'overrideOut' taking a out-of-range value and
returning undef, now the value is treated as if it were U+FFFD.
* 0.99 wrongly calculates implicit weights based on out-of-range values.
- Assertion using unpack 'U' is added. If not only pack('U') but also
unpack('U') of CORE:: don't work as expected, this module will die.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
This was discussed on p5p:
http://www.nntp.perl.org/group/perl.perl5.porters/2013/10/msg208635.html
One LICENSE file (Locale-Codes) has been removed (contrary to one suggestion
in that thread) since it was the only one, and only says "This module is
free software; you can redistribute it and/or modify it under the same
terms as Perl itself." anyway.
Two files that I had previously suggested retaining for the sake of tests
have also been removed. In one case, a test script has been modified (in
Digest-MD5; I will send a ticket upstream); in the other case (in
Test-Harness) I have removed the test which depended on the MANIFEST files,
as suggested in the above thread.
|
| |
|
|
|
|
|
|
|
| |
[DELTA]
0.98 Sat Jun 15 19:44:06 2013
- typo (see [rt.cpan.org #85655] typo fixes)
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.97 Sat Dec 22 14:25:50 2012
- bug fix: XS of 0.96 (if UCA_Version is 9 to 11) wrongly referred to
DUCET for completely ignorable characters, even though the collator
don't use DUCET.
- separated t/notable.t from t/test.t.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.96 Sat Dec 15 19:43:10 2012
- special noncharancter tailorings ('highestFFFF' and 'minimalFFFE')
* some locales are modified for 'highestFFFF': as, bn, fa, gu, hi, hy,
kn, kok, mr, or, sa, si, si_dict, ta, te, th, ur.
- U::C::Locale now allows 'entry' to add or override mappings.
- bug fix: using DUCET through XS wrongly prevented completely ignorable
characters from tailoring.
- modified tests: default.t, loc_as.t, loc_bn.t, loc_fa.t, loc_gu.t,
loc_hi.t, loc_hy.t, loc_kn.t, loc_kok.t, loc_mr.t, loc_or.t, loc_sa.t,
loc_si.t, loc_sidt.t, loc_ta.t, loc_te.t, loc_test.t, loc_th.t,
loc_ur.t, nonchar.t in t.
|
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.95 Sat Dec 8 15:11:09 2012
- U::C::Locale newly supports locales: bs_Cyrl, ee.
- updated to CLDR 21: uk.
- updated to CLDR 22: th, to.
- added loc_bscy.t, loc_ee.t in t.
- modified tests: loc_th.t, loc_to.t, loc_uk.t in t.
|
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.94 Fri Nov 23 18:45:53 2012
- U::C::Locale newly supports locale: zh__zhuyin.
- added Unicode::Collate::CJK::Zhuyin for zh__zhuyin.
- doc: added CAVEAT to CJK/Stroke.pm
- modified tests: loc_cjk.t, loc_cjkc.t in t.
- added cjk_zy.t, loc_zhzy.t in t.
|
|
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.93 Sun Nov 18 18:13:42 2012
- DUCET is updated (for Unicode 6.2.0) as Collate/allkeys.txt.
! Please notice that allkeys.txt will be overwritten if you have had
other allkeys.txt already.
- The default UCA_Version is 26.
- Locale/*.pl (except fr.pl) and CJK/Korean.pm are updated.
- modified tests: loc_es.t, loc_estr.t, version.t in t.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.92 Wed Nov 14 20:58:19 2012
- fix: index() etc. with preprocess/normalization should be always croaked.
- doc: referred to the latest UTS #10 and updated its section numbers.
- supported the identical level (see 'identical' in POD).
- Now UCA_Version 26 (for Unicode 6.2.0) is supported.
- added ident.t in t.
- modified tests: cjkrange.t, compatui.t, hangtype.t, index.t,
overcjk0.t, overcjk1.t, test.t, view.t in t.
* But the default UCA_Version is still 24.
(In the next release, UCA_Version 26 will be the default.)
|
|
|
|
|
|
|
|
| |
[DELTA]
0.91 Sun Nov 4 17:00:20 2012
- XSUB: use PERL_NO_GET_CONTEXT (see perlguts)
(see [rt.cpan.org #80313])
|
|
|
|
|
|
|
|
| |
[DELTA]
0.90 Sun Sep 23 10:42:26 2012
- perl 5.11.0 or later: Install to 'site' instead of 'perl'
(see [rt.cpan.org #79800])
|
| |
|
|
|
|
|
|
|
|
| |
[DELTA]
0.87 Sat Nov 26 17:01:42 2011
- Now Locale/*.pl files are searched in @INC. (see [rt.cpan.org #72666])
- added locale_version method to access the version number of Locale/*.pl.
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.86 Wed Nov 23 17:16:00 2011
- tailored compatibility ideographs as well as unified ideographs for
the locales: ja, ko, zh__big5han, zh__gb2312han, zh__pinyin, zh__stroke.
- added loc_cjkc.t in t.
|
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.85 Sat Nov 19 20:01:57 2011
- U::C::Locale newly supports locales: bn, sa.
- added loc_bn.t, loc_cjk.t, loc_sa.t in t.
- updated some locales to CLDR 2.0 : zh__pinyin, zh__stroke.
* supported compatibility decomposable characters and U+FDD0 indexes.
* updated CJK/Pinyin.pm and CJK/Stroke.pm.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.84 Sun Nov 6 14:44:51 2011
- U::C::Locale supports script codes.
- U::C::Locale newly supports locales: fa, sr_Latn, ur.
- added loc_fa.t, loc_srla.t, loc_ur.t in t.
0.83 Sun Oct 30 20:22:04 2011
- mklocale: auto-generate equivalents for suppressed contractions.
* be.txt, bg.txt, kk.txt, mk.txt, ru.txt, sr.txt, uk.txt in data
are simplified.
* but no Locale/*.pl will be modified.
0.82 Sun Oct 30 10:03:48 2011
- U::C::Locale newly supports locales: si, si__dictionary,
sv__reformed, ta, te, th, wae.
- added loc_si.t, loc_sidt.t, loc_svrf.t, loc_ta.t, loc_te.t,
loc_th.t, loc_wae.t in t.
- updated some locales to CLDR 2.0 : sk, sr, sv, uk.
- updated CJK/Pinyin.pm according to CLDR 2.0.
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.81 Sun Oct 23 21:32:36 2011
- U::C::Locale newly supports locales: ml, mr, or, pa.
- added loc_ml.t, loc_mr.t, loc_or.t, loc_pa.t in t.
- updated some locales to CLDR 2.0 : mk, mt, nb, nn, ro, ru.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.80 Sun Oct 9 21:00:21 2011
- U::C::Locale newly supports locales: bs, hi, kn, kok, ln.
- added loc_bs.t, loc_hi.t, loc_kn.t, loc_kok.t, loc_ln.t in t.
- updated some locales to CLDR 2.0 : ha, hr, kk, lt.
0.79 Sun Oct 2 20:31:01 2011
- pod: [rt.cpan.org #70241] Fix minor grammar error in manpage
by Harlan Lieberman-Berg.
- 'suppress' no longer affects contractions via 'entry'.
- U::C::Locale newly supports locales: as, fi__phonebook, gu.
- added loc_as.t, loc_fiph.t, loc_gu in t.
- updated some locales to CLDR 2.0 : ar, be, bg.
|
|
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.79 Sun Oct 2 20:31:01 2011
- pod: [rt.cpan.org #70241] Fix minor grammar error in manpage
by Harlan Lieberman-Berg.
- 'suppress' no longer affects contractions via 'entry'.
- U::C::Locale newly supports locales: as, fi__phonebook, gu.
- added loc_as.t, loc_fiph.t, loc_gu in t.
- updated some locales to CLDR 2.0 : ar, be, bg.
|
|
|
|
|
|
|
|
| |
[DELTA]
0.78 Mon Jul 25 21:29:50 2011
- tried fixing the tarball with world writable files.
( http://www.perlmonks.org/?node_id=731935 )
|
|
|
|
|
|
|
| |
[DELTA]
0.77 Sun Jul 3 21:15:08 2011
- xs: [perl #93470] [PATCH] consting in Collate.xs by Robin Barker.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.76 Sun May 15 10:06:59 2011
- updated CJK/Pinyin.pm and CJK/Stroke.pm according to CLDR 1.9.1 using
type='pinyin' alt='short' and type='stroke' alt='short' respectively.
0.75 Sat May 7 21:07:38 2011
- supported ignore_level2 and rewrite.
- Added iglevel2.t and rewrite.t in t.
0.74 Mon Mar 21 19:07:38 2011
- removed sw (Swahili) collation according to CLDR 1.9.
(removed files: Collate/Locale/sw.pl and data/sw.txt)
- shifted primary weights of letters > Z for some languages.
(affected locales: da, fi, fo, kl, nb, nn, sv)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.73 Sun Mar 6 13:24:22 2011
- DUCET is updated (for Unicode 6.0.0) as Collate/allkeys.txt.
! However no maint perl has supported Unicode 6.0.0 yet;
wait for 5.14, or try developing 5.13.7 or later.
! Please notice that allkeys.txt will be overwritten if you have had
other allkeys.txt already.
- The default UCA_Version is 22. Locale/*.pl and Korean.pm are updated.
- test: compare allkeys.txt's version with Base_Unicode_Version
in t/default.t.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
Second attempt to integrate the XS version of
Unicode::Collate into core.
[DELTA]
0.72 Sat Jan 22 17:28:32 2011
- xs: fix mixing char* and U8*.
0.71 Tue Jan 18 22:29:44 2011
- t/loc_test.t should not fail without Unicode::Normalize.
0.70 Sun Jan 16 20:31:07 2011
- Now U::C::Locale->new will use the compiled DUCET via XS if available.
added some tests in t/loc_test.t.
0.69 Sat Jan 15 19:41:11 2011
- clarified about XSUB. revised INSTALL in README.
- xs: flag passed to utf8n_to_uvuni().
- doc and comments: [perl #81876] Fix typos by Peter J. Acklam.
|
|
|
|
| |
This reverts commit 211cc5012284f4bd900fcaa630adbcac69ca6112.
|
|
|
|
|
|
| |
This reverts commit cac3df65afe2fed9ad80147a24b5ae6ea601a609.
Backing out change to XS
|
|
|
|
|
|
|
| |
[DELTA]
0.71 Tue Jan 18 22:29:44 2011
- t/loc_test.t should not fail without Unicode::Normalize.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.70 Sun Jan 16 20:31:07 2011
- Now U::C::Locale->new will use the compiled DUCET via XS.
0.69 Sat Jan 15 19:41:11 2011
- clarified about XSUB. revised INSTALL in README.
- xs: flag passed to utf8n_to_uvuni().
- doc and comments: [perl #81876] Fix typos by Peter J. Acklam.
A pure-perl version of this module was previously installed,
this commit enables the XS version of the module instead.
|
| |
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.68 Tue Nov 23 20:17:22 2010
- doc: clarified about (backwards => [ ]) and (backwards => undef).
- separated t/backwds.t from t/test.t.
- added cjk_b5.t, cjk_gb.t, cjk_ja.t, cjk_ko.t, cjk_py.t, cjk_st.t in t
for CJK/*.pm without Locale.pm.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.67 Sun Nov 14 11:38:59 2010
- supported UCA_Version 22 for Unicode 6.0.0.
* 2B740..2B81D are new CJK unified ideographs.
* noncharacters (e.g. U+FFFF) should be overridable, not be ignored.
! DUCET is NOT updated, as no maint perl supports Unicode 6.0.0.
Thus the default UCA_Version is still 20.
- added t/nonchar.t.
- improved discontiguous contractions of 3 or more characters.
(e.g. 0FB2 0F71 0F80 and 0FB3 0F71 0F80)
- auxiliary: now 'mklocale' also copes with Korean.pm according to DUCET.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.66 Sun Nov 7 10:47:30 2010
- U::C::Locale newly supports locale: ko.
- added Unicode::Collate::CJK::Korean for ko.
- added t/loc_ko.t.
- 12 compat. ideographs (e.g. U+FA0E) are treated as unified ideographs.
(though DUCET also does it, now Unicode::Collate does it without DUCET.)
- added t/compatui.t.
! Ideographs Ext.B (U+20000..U+2A6D6) can be overrided with UCA_Version 8.
This is a long-standing behavior from Unicode::Collate 0.11 to 0.63.
A wrong fix at 0.64 should be abandoned.
|
|
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.65 Wed Nov 3 13:10:20 2010
- U::C::Locale newly supports locale: zh and its some variants.
(zh__big5han, zh__gb2312han, zh__pinyin, zh__stroke)
- added Unicode::Collate::CJK::Big5 for zh__big5han.
- added Unicode::Collate::CJK::GB2312 for zh__gb2312han.
- added Unicode::Collate::CJK::Pinyin for zh__pinyin.
- added Unicode::Collate::CJK::Stroke for zh__stroke.
|
|
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.64 Sun Oct 31 14:17:29 2010
- U::C::Locale newly supports locale: ja.
- added Unicode::Collate::CJK::JISX0208 for ja.
- a subroutine specified in 'overrideCJK' or 'overrideHangul' is allowed
to return an integer or undef value.
- fix: Ideographs Ext.A are assigned since UCA Version 9 (Unicode 3.1).
This fix should affect only the case of (UCA_Version => 8).
|
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.63 Sun Oct 10 22:13:21 2010
- supported suppress contractions (see 'suppress' in POD).
- internal for hangul_terminator in getSortKey().
- U::C::Locale newly supports locales: be, bg, kk, mk, ru, sr.
- added tailoring with U+0340 or U+0341 instead of U+0300 or U+0301.
(affected locales: hr, is, pl, se, to, wo)
|
|
|
|
|
|
|
|
| |
[DELTA]
0.62 Wed Oct 6 21:35:54 2010
- U::C::Locale newly supports locales: ar, hu, hy, se, to, uk.
- Vietnamese (vi): added tailoring for U+0340 and U+0341.
|
|
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.61 Sat Oct 2 11:41:29 2010
- U::C::Locale newly supports locales: hr, ig, sq.
- precomposites of e-dot-below, o-dot-below, o-tilde are tailored as well.
(affected locales: et, yo)
- Vietnamese (vi): added contractions for non-blocked decompositions
* base + dot-below + diacritical such as "a\x{323}\x{306} etc.
* base + tone + horn such as "o\x{300}\x{31B}" etc.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.60 Thu Sep 23 21:37:36 2010
- bug fix: index() [and its friends including gmatch()] didn't remove
ignorable characters in the substring correctly.
Thanks for the bug report:
http://www.xray.mpe.mpg.de/mailing-lists/perl-unicode/2010-09/msg00014.html
- U::C::Locale newly supports locales: de__phonebook, nso, om, tn, vi.
- precomposites of a-breve, a-circ, e-circ, o-circ are tailored as well.
(affected locales: ro, sk, sv)
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.59 Sun Sep 5 17:03:52 2010
- U::C::Locale newly supports locales: az, fil, ha, lt, mt, tr, wo, yo.
- precomposites of a-uml, o-uml, and u-uml are tailored as well.
(affected locales: da, et, fi, fo, is, kl, nb, nn, sk, sv)
|
|
|
|
|
|
|
| |
[DELTA]
0.58 Sun Aug 29 19:56:50 2010
- U::C::Locale newly supports locales: af, cy, da, fo, haw, is, kl, sw.
|
|
|
|
|
|
|
| |
[DELTA]
0.57 Sun Aug 22 22:39:58 2010
- Unicode::Collate::Locale newly supports locales: ca, et, fi, lv, sk, sl.
|
|
|
|
|
|
|
|
| |
[DELTA]
0.56 Sun Aug 8 20:24:03 2010
- Unicode::Collate::Locale newly supports locales: eo, nb, ro, sv.
! renamed t/locale_{xy}.t to t/loc_{xy}.t (for safer 8.3 names)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.55 Sun Aug 1 21:21:23 2010
- incorporated Unicode::Collate::Locale with some changes.
(see http://www.xray.mpe.mpg.de/mailing-lists/perl-unicode/2004-03/msg00030.html)
! added t/locale*.t that uses DUCET.
0.54 Sun Jul 25 21:37:04 2010
- Now UCA Revision 20 (based on Unicode 5.2.0).
- DUCET is also updated (for Unicode 5.2.0) as Collate/allkeys.txt,
which *is required* to test this module.
! Please notice that allkeys.txt will be overwritten if you have had
other allkeys.txt already.
- U+9FC4..U+9FCB and U+2A700..U+2B734 are new CJK Unified Ideographs.
- Many hangul jamo are assigned (affecting hangul_terminator).
! DUCET will be compiled when XS is used. Explicit saying
<table => 'allkeys.txt'> (or using another table) will prevent
this module using the compiled DUCET.
! added t/default.t that uses DUCET.
|
|
|
|
|
|
|
|
|
|
|
|
| |
[DELTA]
0.53 Sun Feb 14 20:46:27 2010
- Now UCA Revision 18 (based on Unicode 5.1.0).
- DUCET is also upgrated (for Unicode 5.1.0) as Collate/allkeys.txt,
which is not required to test this module.
! Please notice that allkeys.txt will be overwritten if you have had
other allkeys.txt already.
- U+9FBC..U+9FC3 are new CJK Unified Ideographs.
|