lua-users home
lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]


On Tue, Mar 19, 2013 at 1:07 PM, Brad Jorsch <bjorsch@wikimedia.org> wrote:
> On Tue, Mar 19, 2013 at 3:57 PM, Vadim Peretokin <vperetokin@gmail.com>
> wrote:
>>
>> I'm curious about their utf-8 string library addition, has anyone tried it
>> and is it tailored for Wikipedia's setup or will it work on its own?
>
>
> There's two parts to it: one part (in the "ustring" subdirectory) is
> designed to work on its own, and then the second part overrides some of the
> slower bits with PHP callbacks for use inside MediaWiki.

Since I just posted it to another list and have it handy, here's what
I know of for UTF-8 string support:

One thing that's been missing from Batteries so far as I can tell is a
module/library for processing Unicode strings. I'd like to see that
added for both Lua 5.1 and Lua 5.2.

The lua-users wiki points to slnunicode as the solution.
<http://luaforge.net/projects/sln/>. But I've found zero documentation
for it, having checked in sources and on the Selene project web site.

I've been using this one for some time without problems on embedded
Lua 5.2. <http://www.curse.com/addons/wow/utf8>. It's implemented in
pure Lua. It's particularly nice because it includes the upper case <>
lower case mappings in a separate file and has functions for
converting strings to upper case and lower case.

There's another pure Lua implementation at
<https://github.com/alexander-yakushev/awesompd/blob/master/utf8.lua>.
I haven't tested it but from the code it looks less capable than the
last one mentioned.

Best regards,

Paul