diff options
author | Martin Dürst <duerst@it.aoyama.ac.jp> | 2019-07-14 10:58:50 +0900 |
---|---|---|
committer | Martin Dürst <duerst@it.aoyama.ac.jp> | 2019-07-14 10:58:50 +0900 |
commit | 369ff79394765ce198ac7cee872a8c739d895aaa (patch) | |
tree | d373a8e2a3b835f981a85dbb0e25e93730fba776 /enc/big5.c | |
parent | ac2866005b96baf986072f86ecd3dfd887f2bda3 (diff) | |
download | bundler-369ff79394765ce198ac7cee872a8c739d895aaa.tar.gz |
add encoding conversion from/to CESU-8
Add encoding conversion (transcoding) from UTF-8 to CESU-8
and back. CESU-8 is an encoding similar to UTF-8, but encodes
codepoints above U+FFFF as two surrogates, these surrogates
again being encoded as if they were UTF-8 codepoints. This
preserves the same binary sorting order as in UTF-16. It is
also somewhat similar (although not exactly identical) to an
encoding used internally by Java.
This completes issue #15995.
enc/trans/cesu_8.trans: Add encoding conversion from/to CESU-8
test/ruby/test_transcode.rb: Add tests for above
Diffstat (limited to 'enc/big5.c')
0 files changed, 0 insertions, 0 deletions