Re: Should Lua be more strict about Unicode errors?

lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]

Subject: Re: Should Lua be more strict about Unicode errors?
From: Dirk Laurie <dirk.laurie@...>
Date: Sun, 30 Aug 2015 14:53:11 +0200

2015-08-30 14:30 GMT+02:00 Soni L. <fakedme@gmail.com>:
> LuaJIT recently added Lua 5.3's "\u{}" escapes. It's also more strict about
> Unicode errors than Lua 5.3[1].
>
> For example, "\u{d800}" is valid in Lua 5.3, but not in LuaJIT.
>
> Should Lua be more strict about Unicode errors?

Why should it be invalid? The `d` indicates that here should be
a codepoint of two bytes, and two bytes are given. Surely it depends
on the application, not the language, what to make of it. The utf8
section of the Lua  manual says:

This library provides basic support for UTF-8 encoding. It provides
all its functions inside the table utf8. This library does not provide
any support for Unicode other than the handling of the encoding.
Any operation that needs the meaning of a character, such as
character classification, is outside its scope.

It is not unreasonable for this rule to apply to \u too.

Remember that LuaJIT is not even 5.2 compliant, let alone 5.3.

Follow-Ups:
- Re: Should Lua be more strict about Unicode errors?, Soni L.
- Re: Should Lua be more strict about Unicode errors?, Jay Carlson

References:
- Should Lua be more strict about Unicode errors?, Soni L.

Prev by Date: Should Lua be more strict about Unicode errors?
Next by Date: Re: Should Lua be more strict about Unicode errors?
Previous by thread: Should Lua be more strict about Unicode errors?
Next by thread: Re: Should Lua be more strict about Unicode errors?
Index(es):
- Date
- Thread