[Date Prev][Date Next][Thread Prev][Thread Next]
[Date Index]
[Thread Index]
- Subject: Clearing up misconceptions about characters vs bytes in the manual
- From: Rob Hoelz <rob@...>
- Date: Fri, 2 Nov 2012 14:27:16 +0100
Hi list,
A user came on the IRC channel today asking about Unicode support.
When I tried to explain that string.sub(2, 2) and io.read(1) wouldn't
work as expected on UTF-8 data by explaining that Lua only uses 8-bit
clean strings and doesn't understand UTF-8 data beyond a string of
bytes, the user pointed out that the manual speaks in terms of
characters, not bytes.
Would it be a good idea to make a distinction between characters and
bytes, or do you guys feel that this is already clear in the manual
(and PiL)?
-Rob
Attachment:
signature.asc
Description: PGP signature