[Date Prev][Date Next][Thread Prev][Thread Next]
[Date Index]
[Thread Index]
- Subject: Re: question about Unicode
 
- From: "Brian Weed" <bw@...>
 
- Date: Thu, 07 Dec 2006 15:44:05 -0500
 
Asko Kauppi wrote:
But there may be some identifier "stamp" that can be used to know a 
file is UTF-8, no?
There are two that I know of.  I don't know how "standard" they are.  
One is called a BOM Header, which is some binary code in the first 2 
bytes of the "text" file.  The other is the occurrence of this text 
"charset=utf-8", anywhere in the file (at least according to the editor 
I use: UltraEdit).
Brian
- References:
- question about Unicode, Roberto Ierusalimschy
 
- Re: question about Unicode, Matt Campbell
 
- Re: question about Unicode, Roberto Ierusalimschy
 
- Re: question about Unicode, David Jones
 
- Re: question about Unicode, Roberto Ierusalimschy
 
- Re: question about Unicode, David Given
 
- Re: question about Unicode, Rici Lake
 
- Re: question about Unicode, Roberto Ierusalimschy
 
- Re: Re: question about Unicode, Ken Smith
 
- Re: question about Unicode, Adrian Perez
 
- Re: question about Unicode, Asko Kauppi