[Date Prev][Date Next][Thread Prev][Thread Next]
- Subject: Re: Unicode?
- From: Tuomo Valkonen <tuomov@...>
- Date: Fri, 13 Jun 2003 01:32:05 +0300
On Thu, Jun 12, 2003 at 10:47:55PM +0100, Abigail Brady wrote:
> I'm confused though, by the suggestion that a "sensible subset" of
> Unicode excludes Cyrillic and Greek. Perhaps you meant something else?
"Normalization Form KC" or something like that, see down from
perhaps NFKD could be used, but for as simple as possible string
processing, again, IMHO combining characters should have been coded
in a standard way to the charater numbers instead of being separate
"semi-characters". I don't care if it doesn't fit in 16 bits, UTF-8
is more sensible than UCS* anyway.