[Date Prev][Date Next][Thread Prev][Thread Next]
- Subject: Re: Lost in Unicode
- From: Enrico Colombini <erix@...>
- Date: Mon, 20 Oct 2003 16:19:46 +0200
On Monday 20 October 2003 15:41, Reuben Thomas wrote:
> There is a way, because ISO-8859-1 files are invalid unicode.
Even if a 2-character sequence (in the high range) happens to be the same as a
valid Unicode character?
By the way, I gather that Roberto's "toISO" function would not work correctly
if a "combining character" is encountered (e.g. "e" followed by "combining
dieresis") instead of a single UTF-8 character ("e with dieresis").
Are they commonly used in editors?