[Date Prev][Date Next][Thread Prev][Thread Next]
[Date Index]
[Thread Index]
- Subject: Re: Issues: Character 160 - Non-breaking space + Additional Issue with UTF-8
- From: Sean Conner <sean@...>
- Date: Tue, 10 Jul 2018 18:17:18 -0400
It was thus said that the Great Gregg Reynolds once stated:
> On Tue, Jul 10, 2018, 4:44 PM Dirk Laurie <dirk.laurie@gmail.com> wrote:
> ...
>
> >
> > I. Am. Not. Asking. For. Unicode.
> >
> > I am merely asking for extra functions along the lines of what the
> > utf8 library already does.
> > E.g. Sam's examples:
> >
> > > s1 = "Hélène"
> > > s2 = "Hélène"
They look similar, but they are construct differently.
> FYI these look identical on Android.
>
> > > utf8.len(s1)
> > 6
> > > utf8.len(s2)
> > 7
> >
> > If you really not understand what I mean, I can elaborate.
>
> Please do.
>
> What does "len" mean? Number of Unicode chars ot number of bytes?
The number of Unicode code points. The second one has a letter 'e'
followed by a combining accent (I'm not sure which accent is the combining
one), thus the different number of Unicode code points.
-spc
- References:
- Re: Issues: Character 160 - Non-breaking space + Additional Issue with UTF-8, Alysson Cunha
- Re: Issues: Character 160 - Non-breaking space + Additional Issue with UTF-8, Axel Kittenberger
- Re: Issues: Character 160 - Non-breaking space + Additional Issue with UTF-8, Lorenzo Donati
- Re: Issues: Character 160 - Non-breaking space + Additional Issue with UTF-8, Albert Chan
- Re: Issues: Character 160 - Non-breaking space + Additional Issue with UTF-8, Sean Conner
- Re: Issues: Character 160 - Non-breaking space + Additional Issue with UTF-8, Lorenzo Donati
- Re: Issues: Character 160 - Non-breaking space + Additional Issue with UTF-8, Dirk Laurie
- Re: Issues: Character 160 - Non-breaking space + Additional Issue with UTF-8, Gregg Reynolds
- Re: Issues: Character 160 - Non-breaking space + Additional Issue with UTF-8, Dirk Laurie
- Re: Issues: Character 160 - Non-breaking space + Additional Issue with UTF-8, Gregg Reynolds