lua-users home
lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]


Depends on how far you want to go with normalisation etc. I hope you have some time on your hands :) http://unicode.org/reports/tr15/

On 6 January 2011 18:40, Henning Diedrich <hd2010@eonblast.com> wrote:
i am using this text [1] to test UTF-8 character counting.

Does somebody know how to get an authoritative count of how many that should actually be? Mines possible invalid ones, should they be in that text?