summaryrefslogtreecommitdiff
path: root/update_unicode.sh
Commit message (Collapse)AuthorAgeFilesLines
* update_unicode.sh: delete the command groupbb/update-unicode-tableBeat Bolli2014-12-221-12/+11
| | | | | | | | Now that the whole file is generated by one single command, the command group is no longer needed. Signed-off-by: Beat Bolli <dev+git@drbeat.li> Signed-off-by: Junio C Hamano <gitster@pobox.com>
* update_unicode.sh: make the output structure visibleBeat Bolli2014-12-221-7/+9
| | | | | | | | By using a here document instead of the echo/uniset sequence, the final structure of the generated file becomes obvious. Signed-off-by: Beat Bolli <dev+git@drbeat.li> Signed-off-by: Junio C Hamano <gitster@pobox.com>
* update_unicode.sh: shorten uniset invocation pathBeat Bolli2014-12-221-2/+2
| | | | | | | | "uniset/uniset" is a relative path; there's no need to prefix it with "./". Signed-off-by: Beat Bolli <dev+git@drbeat.li> Signed-off-by: Junio C Hamano <gitster@pobox.com>
* update_unicode.sh: set UNICODE_DIR only onceBeat Bolli2014-12-221-2/+3
| | | | | | | | | | | The value is the same on both uniset invocations, so "Don't Repeat Yourself" applies. Since this is done as the last command in the sequence, there's no need to unset UNICODE_DIR at the end. Signed-off-by: Beat Bolli <dev+git@drbeat.li> Signed-off-by: Junio C Hamano <gitster@pobox.com>
* update_unicode.sh: simplify output captureBeat Bolli2014-12-221-8/+9
| | | | | | | | | Instead of capturing the output of each echo and uniset invocation, wrap the whole section in a group command and redirect its output all at once. Signed-off-by: Beat Bolli <dev+git@drbeat.li> Signed-off-by: Junio C Hamano <gitster@pobox.com>
* utf8: make it easier to auto-update git_wcwidth()tb/unicode-6.3-zero-widthTorsten Bögershausen2014-05-121-0/+37
The function git_wcwidth() returns for a given unicode code point the width on the display: -1 for control characters, 0 for combining or other non-visible code points 1 for e.g. ASCII 2 for double-width code points. This table had been originally been extracted for one Unicode version, probably 3.2. We now use two tables these days, one for zero-width and another for double-width. Make it easier to update these tables to a later version of Unicode by factoring out the table from utf8.c into unicode_width.h and add the script update_unicode.sh to update the table based on the latest Unicode specification files. Thanks to Peter Krefting <peter@softwolves.pp.se> and Kevin Bracey <kevin@bracey.fi> for helping with their Unicode knowledge. Signed-off-by: Torsten Bögershausen <tboegi@web.de> Signed-off-by: Junio C Hamano <gitster@pobox.com>