[Date Prev][Date Next][Thread Prev][Thread Next]
[Date Index]
[Thread Index]
- Subject: Re: The Lua utf8 library (Was: Issues: Character 160 ...)
- From: Roberto Ierusalimschy <roberto@...>
- Date: Wed, 11 Jul 2018 11:45:46 -0300
> On Wed, Jul 11, 2018 at 3:10 PM, Hisham <h@hisham.hm> wrote:
> > Ultimately, the problem is: you would expect utf8.match("name:
> > %a*%d+", "name: Hélène123") to work, but that doesn't seem feasible to
> > do without adding Unicode knowledge.
>
> Which is a _heavy_ task, given the number of human scripts in common use!
>
> By the way, always been curious how non-English Lua people cope with
> the existing limitations of Lua patterns?
>
> Assume 'ASCII' punctuation and work around that?
Mainly. Either your text have all kinds of stuff, and then you need real
Unicode support, or else everything outside ASCII can be assumed to be
letters (accented letters and c-cedilla).
-- Roberto