lua-users home
lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]



On Oct 04, 2007, at 15:30, Aladdin Lampé wrote:

I am building a large application needing (Lua - of course) and also UTF-8 support.

While we are at it, there is also something like Sean M. Burke's Text::Unidecode:

http://interglacial.com/~sburke/tpj/as_html/tpj22.html
http://search.cpan.org/~sburke/Text-Unidecode-0.04/lib/Text/Unidecode.pm

Here is an example of an implementation of Unidecode in Lua [1]:

local Unidecode = require( 'Unidecode' )

print( Unidecode( 'Москва́' ) )
print( Unidecode( '北京' ) )
print( Unidecode( 'Ἀθηνᾶ' ) )
print( Unidecode( '서울' ) )
print( Unidecode( '東京' ) )
print( Unidecode( '京都市' ) )
print( Unidecode( 'नेपाल' ) )
print( Unidecode( 'תֵּל־אָבִיב-יָפוֹ' ) )
print( Unidecode( 'تَلْ أَبِيبْ يَافَا' ) )
print( Unidecode( 'تهران' ) )
print( Unidecode( 'Géometrie Différentielle' ) )

> Moskva
> beijing
> Athena
> seoul
> dongjing
> jingdushi
> nepaal
> te'labiyb-yapvo
> tal 'abiyb yaafaa
> thran
> Geometrie Differentielle

[1] http://dev.alt.textdrive.com/browser/HTTP/Unidecode.lua