[Date Prev][Date Next][Thread Prev][Thread Next]
- Subject: Re: utf8.len and BOM
- From: Marco Mastropaolo <marco@...>
- Date: Fri, 16 Jan 2015 18:22:45 +0100
From the Unicode standard:
>> The serialized order of the bytes must not depart from the order defined by the UTF-
>> 8 encoding form. Use of a BOM is neither required nor recommended for UTF-8, but may
>> be encountered in contexts where UTF-8 data is converted from other encoding forms that
>> use a BOM or where the BOM is used as a UTF-8 signature.
But, in a nutshell: having a BOM breaks unix utilities, not having it might break windows ones.