lua-users home
lua-l archive

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]

On Mon, Jul 9, 2012 at 7:55 AM, Owen Shepherd <> wrote:
> The Unicode byte order mark is defined by the Unicode standard to be the
> character U+FEFF (formerly "zero width non breaking space"). The use of the
> BOM character to identify text files encoded using a Unicode Transformation
> Format is encouraged by the Unicode Consortium for cases where out of band
> signalling is not available

.... and they explicitly recommended against its use in UTF-8

of course, given the "be liberal in what you accept, conservative in
what you send", it's not unreasonable to implement a BOM-skipping
code, even in UTF-8.