User talk:Celada
Welcome!
Hello, Celada, and welcome to Wikipedia! Thank you for your contributions. I hope you like the place and decide to stay. Here are a few good links for newcomers:
- The five pillars of Wikipedia
- How to edit a page
- Help pages
- Tutorial
- How to write a great article
- Manual of Style
I hope you enjoy editing here and being a Wikipedian! Please sign your name on talk pages using four tildes (~~~~); this will automatically produce your name and the date. If you need help, check out Wikipedia:Where to ask a question, ask me on my talk page, or place {{helpme}}
on your talk page and someone will show up shortly to answer your questions. Again, welcome! --Flockmeal 03:23, Nov 20, 2004 (UTC)
UTF-8 overlong sequences
[edit]You wrote to sunny256:
Hi,
You put a note in the UTF-8 page back in March to the effect that reducing the UTF-8 range from 6 bytes to 4 bytes gets rid of the problem of overlong sequences. In fact I do not believe this is true, but I'm willing to be enlightened if there's something I'm missing!
For example, E0 8E B1 is an overlong encoding for α (alpha) (correct: CE B1) and it doesn't use any of the new invalid characters. Celada 03:24, 2004 Nov 24 (UTC)
- Yes, it seems you are right. I got the impression that this was the case in a discussion on the linux-utf8 mailing list, I don't remember the actual thread.
- I have removed the section you refer to, and I tried to find the reason why RFC3629 limits the sequence length from six to four bytes, but haven't been able to find anything. I'll add a note about this on the UTF-8 talk page.
- Thanks for finding this incorrect information. -- sunny256 2004-12-06