Jump to content

User talk:Celada

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia

Welcome!

Hello, Celada, and welcome to Wikipedia! Thank you for your contributions. I hope you like the place and decide to stay. Here are a few good links for newcomers:

I hope you enjoy editing here and being a Wikipedian! Please sign your name on talk pages using four tildes (~~~~); this will automatically produce your name and the date. If you need help, check out Wikipedia:Where to ask a question, ask me on my talk page, or place {{helpme}} on your talk page and someone will show up shortly to answer your questions. Again, welcome!  --Flockmeal 03:23, Nov 20, 2004 (UTC)

UTF-8 overlong sequences

[edit]

You wrote to sunny256:

Hi,

You put a note in the UTF-8 page back in March to the effect that reducing the UTF-8 range from 6 bytes to 4 bytes gets rid of the problem of overlong sequences. In fact I do not believe this is true, but I'm willing to be enlightened if there's something I'm missing!

For example, E0 8E B1 is an overlong encoding for α (alpha) (correct: CE B1) and it doesn't use any of the new invalid characters. Celada 03:24, 2004 Nov 24 (UTC)

Yes, it seems you are right. I got the impression that this was the case in a discussion on the linux-utf8 mailing list, I don't remember the actual thread.
I have removed the section you refer to, and I tried to find the reason why RFC3629 limits the sequence length from six to four bytes, but haven't been able to find anything. I'll add a note about this on the UTF-8 talk page.
Thanks for finding this incorrect information. -- sunny256 2004-12-06