Re: Lua interpreter and Lua files encoding

lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]

Subject: Re: Lua interpreter and Lua files encoding
From: Peter Odding <peter@...>
Date: Thu, 06 Jan 2011 11:45:43 +0100

Anyway, is this only an implementation artifact? Or is something that
will last? In this latter case a mention in the reference manual could
be useful, since utf8 is very common nowadays and generating utf8 files
using Lua, _without specialized libraries_ and without the hassle of
encoding literals with escape sequence, is really a useful!

The Lua 5.1 reference manual defines that "strings in Lua can containany 8-bit value" but it doesn't guarantee the same for literal stringsembedded in Lua source code. So if you really want to guaranteecompatibility with different Lua implementations (e.g. LuaJIT, Kahlua,luaj, Jill, LuaCLR, LuaToCee, the list* goes on for quite a while..)then it might be wise to encode UTF-8 string literals using escapesequences. On the other hand, the end of line normalization mentionedearlier should never corrupt valid UTF-8 sequences because UTF-8 wasspecifically designed to be compatible with ASCII.

I don't think the Lua reference manual should mention UTF-8 unless itwill guarantee that string literals with UTF-8 contents are passedthrough unharmed. However the writing in the Lua reference manual isgenerally quite conservative. I think one of the reasons for this is toease the implementation of Lua on a range of platforms with differentcharacteristics.


 - Peter Odding

* http://lua-users.org/wiki/LuaImplementations

PS. Given the above I don't see how you would need "specializedlibraries" to generate Lua source code containing literal strings withUTF-8 using escape sequences, i.e. the following should suffice tooutput such string literals:


function encode_literal(s)
  return '"' .. s:gsub('[^A-Za-z0-9 ]', function(c)
    return ('\\%d'):format(c:byte())
  end) .. '"'
end

print(encode_literal 'Ångström')

Follow-Ups:
- Re: Lua interpreter and Lua files encoding, Dirk Laurie

References:
- RE: Lua interpreter and Lua files encoding, jgiors
- Re: Lua interpreter and Lua files encoding, Lorenzo Donati

Prev by Date: Re: Lua Cookbook
Next by Date: Re: Lua Cookbook
Previous by thread: Re: Lua interpreter and Lua files encoding
Next by thread: Re: Lua interpreter and Lua files encoding
Index(es):
- Date
- Thread