summaryrefslogtreecommitdiff
path: root/test/test_unicode_normalize.rb
Commit message (Collapse)AuthorAgeFilesLines
* Removed unused variablesNobuyoshi Nakada2019-06-281-1/+1
|
* update to Unicode Version 12.1.0 (beta)duerst2019-04-051-0/+4
| | | | | | | | | | | | | | | | | | | | | | Unicode Version 12.1.0 adds one single character, U+32FF SQUARE ERA NAME REIWA, for the new Japanese era starting on May 1st. 12.1.0 will be finalized only on May 7th, so we go with the beta version because further changes in the data we need are highly unlikely, and we want to make sure Ruby is ready for the new era. * common.mk: change UNICODE_VERSION to 12.1.0, UNICODE_BETA to YES * enc/unicode/12.1.0, enc/unicode/12.1.0/casefold.h, enc/unicode/12.1.0/name2ctype.h: add directory and generated data files for new version * lib/unicode_normalize/tables.rb: update for new character * test/ruby/test_regexp.rb: add test for character property age=12.1 * test/test_unicode_normalize.rb: add test for NFKC decomposition of new character This (mostly) completes issue #15195. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@67441 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* fix range check for Hangul jamo trailers in Unicode normalizationduerst2018-07-281-0/+7
| | | | | | | | | | | | | | | | | | | * lib/unicode_normalize/normalize.rb: Fix the range check for trailing Hangul jamo characters in Unicode normalization. Different from leading or vowel jamos, where LBASE and VBASE are actual characters, a value equal to TBASE expresses the absence of a trailing jamo. This fix is technically correct, but there was no bug because the regular expressions in lib/unicode_normalize/tables.rb eliminate jamos equal to TBASE from normalization processing. * test/test_unicode_normalize.rb: Add preventive test test_no_trailing_jamo based on https://github.com/python/cpython/commit/d134809cd3764c6a634eab7bb8995e3e2eff14d5 just for the case we ever get a regression. This closes issue #14934, thanks to MaLin (Lin Ma) for reporting. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@64087 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* always run unicode normalization that do not depend on data fileduerst2018-07-281-0/+2
| | | | | | | | | * test/test_unicode_normalize.rb: extract tests that do not depend on NormalizationTest.txt data file from conditionally constructed part of TestUnicodeNormalize class, to always run them even if the data file isn't found. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@64086 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* update unicode data files directory for normalization testsduerst2018-07-281-2/+3
| | | | | | | * test/test_unicode_normalize.rb: search ucd directory first if it exists. This change follows r61415. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@64085 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* * test/ruby/enc/test_case_comprehensive.rb, test_regex_casefold.rb,duerst2016-07-031-1/+1
| | | | | | | | | test/test_unicode_normalize.rb: Replace UNICODE_VERSION from UnicodeNormalize with RbConfig::CONFIG['UNICODE_VERSION'] from feature 12460. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@55567 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* Skip unavailable testsnobu2016-06-281-10/+14
| | | | | | | | | * test/ruby/enc/test_case_comprehensive.rb: noting to test if Unicode data files are available. [ruby-core:76160] [Bug #12433] * test/test_unicode_normalize.rb: ditto. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@55521 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* * test/test_unicode_normalize.rb: Add test to check for availability ofduerst2016-05-281-6/+15
| | | | | | | | Unicode data file; refactoring; fix an error with tests for destructive method (unicode_normalize!). git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@55196 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* Add frozen_string_literal: false for all filesnaruse2015-12-161-0/+1
| | | | | | When you change this to true, you may need to add more tests. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@53141 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* * test/test_unicode_normalize.rb: get rid of a warning.usa2015-06-141-2/+2
| | | | git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@50883 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* test_unicode_normalize.rb: skip only oncenobu2014-11-111-2/+3
| | | | | | | * test/test_unicode_normalize.rb (setup): skip only once, and silently ignore other tests. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@48388 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* test_unicode_normalize.rb: skip if not foundnobu2014-11-111-0/+2
| | | | | | | * test/test_unicode_normalize.rb (read_tests): do nothing if the test data file is not found. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@48387 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* unicode_normalize: UNICODE_VERSION constantnobu2014-11-101-1/+3
| | | | | | | | | | * template/unicode_norm_gen.tmpl (UnicodeNormalize): embed the version of Unicode data files used to generate. * test/test_unicode_normalize.rb (TestUnicodeNormalize): use the embedded version to load the test data. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@48357 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* * test/test_unicode_normalize.rb: Rename TestNormalize toakr2014-11-061-3/+4
| | | | | | | | | TestUnicodeNormalize. Define constants under TestUnicodeNormalize. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@48301 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* * test/test_unicode_normalize.rb: added unicode version number tohsbt2014-10-311-1/+3
| | | | | | test data location. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@48202 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* test/test-unicode_normalize.rb: added test_us_ascii.duerst2014-10-251-0/+19
| | | | git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@48135 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* * remove trailing spaces.svn2014-10-251-1/+1
| | | | git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@48133 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* test/test-unicode_normalize.rb: added ↵duerst2014-10-251-0/+6
| | | | | | test_raise_exception_for_non_unicode_encoding. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@48132 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* test/test-unicode_normalize.rb: removed unused function.duerst2014-10-251-8/+1
| | | | git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@48131 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* test_unicode_normalize.rb: show error messagesnobu2014-10-231-23/+15
| | | | | | | | | | | * test/test_unicode_normalize.rb (generate_test_normalize): defer building explicit error messages until assertion failed. this is 3% slower than @@debug = false. (generate_test_check_true): ditto. (generate_test_check_false): ditto. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@48113 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* * test/test_unicode_normalize.rb: set encoding when reading.usa2014-10-221-1/+1
| | | | git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@48092 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* test/test_unicode_normalize.rb: don't set default encodings globallynobu2014-10-221-2/+0
| | | | git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@48090 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* * test/test-unicode_normalize.rb: as often said, ruby is sometimes builtusa2014-10-221-1/+1
| | | | | | | at non-srcdir. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@48087 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* test/test-unicode_normalize.rb: Adjusted path for test data file (now ↵duerst2014-10-221-1/+1
| | | | | | ../enc/unicode/data/NormalizationTest.txt). git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@48086 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* test/test-unicode_normalize.rb: Removed explicit require, changed method ↵duerst2014-10-221-19/+16
| | | | | | names, adjusted copyright. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@48085 b2dd03c8-39d4-4d8f-98ff-823fe69b080e
* test/test-unicode_normalize.rb: Importing fromduerst2014-10-221-0/+171
https://github.com/duerst/eprun/blob/master/test/test_normalize.rb. (removing trailing whitespace, fixing EOLs and adding EOL property) git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@48084 b2dd03c8-39d4-4d8f-98ff-823fe69b080e