Re: question about Unicode

lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]

Subject: Re: question about Unicode
From: Matt Campbell <mattc@...>
Date: Mon, 04 Dec 2006 12:22:52 -0600

It depends on whether you want to use the encoding specified by thecurrent locale, or always use UTF-8. The former is a more generalsolution and is probably preferred on Unix; GNU/Linux distributions aremoving toward UTF-8 anyway. However, it's problematic on Windows;someone please correct me if I'm wrong, but I believe that UTF-8 isnever (or rarely) the encoding associated with the system locale onWindows. So if you always want to use UTF-8, it's probably better touse a hand-written converter.

By the way, I find it annoying that on Windows, you have to use specialwide-character functions if you want your code to be Unicode-aware;passing UTF-8 strings around would be a much more portable solution. Ihave considered developing an alternative C runtime library for Windowsin which, among other differentiating features, UTF-8 would be assumedas the encoding for non-wide-character strings (so for example, you'dpass a UTF-8 string to fopen instead of having to call wfopen). Wouldthere be any interest in this among Lua users?


--
Matt Campbell
Lead Programmer
Serotek Corporation
www.freedombox.info
"The Accessibility Anywhere People"

Follow-Ups:
- Re: question about Unicode, Roberto Ierusalimschy
- Re: question about Unicode, Mike Pall

References:
- question about Unicode, Roberto Ierusalimschy

Prev by Date: Re: Wanted: ncurses & utf-8 string library
Next by Date: Re: How to unregister a lua_CFunction?
Previous by thread: question about Unicode
Next by thread: Re: question about Unicode
Index(es):
- Date
- Thread