Discussion:
UAX #29 changes
(too old to reply)
Daiki Ueno
2016-10-19 14:11:17 UTC
Permalink
Raw Message
Hello,

I have been trying to update libunistring to Unicode 9.0.0. Initially I
planned it for the end of this month, but now I'm almost giving up,
because of the recent additions to the UAX #29 algorithms:

- The 3 rules added to the Grapheme Cluster Boundary Rules, namely
(GB10, GB12, GB13), involve 3 consequent characters, while the current
API uc_is_grapheme_break() only takes 2 characters

- The similar rules are also added to the Word Boundary Rules. Though
it wouldn't be a problem as uniwbrk.h doesn't expose such API, the
implementation of WB15 and WB16 could be complicated because it
requires lookahead of a next character

I have pushed the current version to my github repository:
https://github.com/ueno/gnulib/tree/wip/dueno/unicode-9.0.0

but I will be unlikely to have time to complete it any time soon.
Sorry. If anyone could finish it or rewrite it, it would be
appreciated.

By the way, I would like to change the "Maintainer:" field of the Gnulib
modules to "all", if nobody opposes.

Regards,
--
Daiki Ueno
Paul Eggert
2016-10-19 19:45:12 UTC
Permalink
Raw Message
Post by Daiki Ueno
I would like to change the "Maintainer:" field of the Gnulib
modules to "all", if nobody opposes.
Sounds good to me, and thanks for all the work you are doing.

Loading...