|
On 05/12/2012 19:46, Jay Carlson wrote:
Life is usually easier in NFC. In fact, Perl6 was proposing to collapse composed sequences without a single precomposed character into internal-only dynamically allocated codepoints past the end of plane U+10xxxx, and this has some things to recommend it.
_That_ would certainly remove one final issue (once all others are solved, though). (We could also use the surrogate ranges for that, since they're not needed once decoded anyway. Maybe easier to cope with existing software, since many utf8 decoders take >= 0x110000 as invalid, but let surrogate values pass through.)
Denis