Re: Lost in Unicode

lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]

Subject: Re: Lost in Unicode
From: Enrico Colombini <erix@...>
Date: Mon, 20 Oct 2003 16:19:46 +0200

On Monday 20 October 2003 15:41, Reuben Thomas wrote:
> There is a way, because ISO-8859-1 files are invalid unicode. 

Even if a 2-character sequence (in the high range) happens to be the same as a 
valid Unicode character?

By the way, I gather that Roberto's "toISO" function would not work correctly 
if a "combining character" is encountered (e.g. "e" followed by "combining 
dieresis") instead of a single UTF-8 character ("e with dieresis"). 
Are they commonly used in editors?

  Enrico

Follow-Ups:
- Re: Lost in Unicode, Reuben Thomas

References:
- Re: Lost in Unicode, Roberto Ierusalimschy
- Re: Lost in Unicode, Enrico Colombini
- Re: Lost in Unicode, Reuben Thomas

Prev by Date: RE: callback implementation details..
Next by Date: RE: Running a script in each Lua thread.
Previous by thread: Re: Lost in Unicode
Next by thread: Re: Lost in Unicode
Index(es):
- Date
- Thread