Re: io:lines() and \0

lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]

Subject: Re: io:lines() and \0
From: Enrico Colombini <erix@...>
Date: Sat, 22 Feb 2014 13:07:52 +0100

On 22/02/2014 9.35, Thijs Schreijer wrote:

UTF8 was mentioned as a possible feature to be included in future
versions. If that happens, the arguments to get control characters
handled without data mangling, gets a lot stronger.

I may be mistaken, not being an Unicode expert (to put it mildly) but Iam under the impression that using a 'traditional' line input functionfor UTF-8 (with or without '\0') could open another, larger, can of worms.

The set of line terminators and white space characters seems to bedifferent; for example, U+2028 is a line separator and cannot berecognized by a simple test on the value returned by getc(). An UTF-8oriented line iterator would probably be needed.


P.S. It is not my intention to start a thread about what a line is :-)

--
  Enrico

Follow-Ups:
- Re: io:lines() and \0, Andrew Starks
- Re: io:lines() and \0, Francisco Olarte

References:
- io:lines() and \0, René Rebe
- Re: io:lines() and \0, steve donovan
- Re: io:lines() and \0, René Rebe
- Re: io:lines() and \0, Enrico Colombini
- Re: io:lines() and \0, steve donovan
- Re: io:lines() and \0, René Rebe
- Re: io:lines() and \0, Craig Barnes
- Re: io:lines() and \0, René Rebe
- Re: io:lines() and \0, Sean Conner
- Re: io:lines() and \0, René Rebe
- Re: io:lines() and \0, Tim Hill
- RE: io:lines() and \0, Thijs Schreijer

Prev by Date: Re: io:lines() and \0
Next by Date: Re: io:lines() and \0
Previous by thread: RE: io:lines() and \0
Next by thread: Re: io:lines() and \0
Index(es):
- Date
- Thread