Re: question about Unicode

lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]

Subject: Re: question about Unicode
From: Rici Lake <lua@...>
Date: Thu, 7 Dec 2006 18:02:02 -0500


On 7-Dec-06, at 5:55 PM, Mike Pall wrote:

Well, then there are also distinct characters that have the same
glyph shape, Like 'a' and '\u0430' (Cyrillic a). Normalization
won't help you here ... There is no perfect solution.


Absolutely, but one can minimize confusion.

I'm unlikely to accidentally type a Cyrillic a when I meant 'a', but itis very easy to accidentally have the wrong character encoding, or tobe using an input method which decomposition normalizes instead ofcomposition normalizing.

Protecting against the wrong character encoding is easy, though: justinsist that the source file be valid utf-8, which is a very fast test.

References:
- Re: question about Unicode, David Jones
- Re: question about Unicode, Roberto Ierusalimschy
- Re: question about Unicode, David Given
- Re: question about Unicode, Rici Lake
- Re: question about Unicode, Roberto Ierusalimschy
- Re: Re: question about Unicode, Ken Smith
- Re: question about Unicode, Adrian Perez
- Re: question about Unicode, David Given
- Re: question about Unicode, Mike Pall
- Re: question about Unicode, Rici Lake
- Re: question about Unicode, Mike Pall

Prev by Date: Re: question about Unicode
Next by Date: Re: question about Unicode
Previous by thread: Re: question about Unicode
Next by thread: Re: question about Unicode
Index(es):
- Date
- Thread