lua-users home
lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]


On Fri, Mar 21, 2014 at 1:44 PM, Luiz Henrique de Figueiredo
<lhf@tecgraf.puc-rio.br> wrote:
> Lua 5.3.0 (work2) is now available for testing at
>         http://www.lua.org/work/lua-5.3.0-work2.tar.gz
>
> MD5     52bd13d0b40f637bc388a133b9bb8771  -
> SHA1    e52ea0acf4b2d7bf042f48bd01dddc149d517184  -
>
> This is a work version. An updated reference manual is included but
> all details may change in the final version. See
>         http://www.lua.org/work/doc/readme.html
>
> The main change in Lua 5.3.0 is the introduction of integers.
> For other changes, see
>         http://www.lua.org/work/doc/readme.html#changes
>
> The complete diffs from work1 are at
>         http://www.lua.org/work/diffs-lua-5.3.0-work1-work2.txt
>
> Enjoy. All feedback welcome. Thanks.
> --lhf
>
>

After having a look at the utf8 library in 5.3, I like what I see -- I
only wish for the addition of a utf8.sub() and a utf8.strip()
1) utf8.sub() would be a utf8-aware version of string.sub; 'nuff said
2) utf8.strip() would remove the greater-than-single-byte codepoints
from a string -- it is possible to do this with the utf8.codes()
generator you guys provided, but in the interest of speed I think this
should be in C

I really like utf8.offset(), that's something I should have added to
my own project: https://github.com/Pogs/lua-utf8-simple

Please do add utf8.sub() to be identical to string.sub()'s behavior
(apart from being utf8-aware).  I imagine a lot of people would wind
up rolling their own with varying subtle differences (like some
accepting negative indices and some not).  If you *really* want to
impress, make generator/iterator that can cycle through a utf8 string
backward -- I would love that!  ((I also want a string.find() that
iterates backward))

Anyway, hope I was helpful :-)