Should old aquaintance *not* be forgot, code pages may screw up their names anyhow

by Michael S. Kaplan, published on 2007/08/12 00:01 -04:00, original URI: http://blogs.msdn.com/b/michkap/archive/2007/08/11/4342275.aspx


I can always count on Cristian Secară to send me mail about random internationalization issues, whether or not they are in Microsoft products. :-)

Just the other day he pointed a new one out to me, this one in Community Server (the technology behind SIAO and of the rest of the MSDN Blogs and others):

Hi,

On all msdn's blogs, so including yours, there is a problem with my "stored" name after posting a comment (the "remember me" feature), when I am coming back, or when I simply refersh the page.

See the snapshots. Usually I am using Opera, but it does the same with IE.

It does the same on Tudor Galoș's blog for example (
http://studentclub.ro/tudorg/default.aspx), which blog structure I suspect is something based on same software as the msdn one.

At the same time, there are other various blogs around, some also utf-8 encoded, where my remembered name always appears correctly.

It is not that tragic, but it's annoying. When posting, if I forget to delete the two „ă” utf-8 low-level characters displayed and replace them with my „ă”, then the posted comment will also be displayed back with „ă”.

Maybe you can report this to whom might be interested in correcting this bug, assuming it is not a known bug ...

Thank you,
Cristi

I'll include the screenshots he included so you know what we are talking about (both for IE and for Opera):

 

 

Regular readers (including Cristi!) might be reminded of Consistent garbage text can be incorrect encoding identification (or detection) or Do not adjust your browser, a.k.a. sometimes two wrongs DO make a right, a.k.a. dumb quotes or even the side effect I discussed in Linguistic and Unicode considerations (or Language-specific Processing #4).

This time the bug is in the site cookie that Community Server creates, which is writing out the name to remember in UTF-8 but then reading it back as something else (probably Encoding.Default?).

Probably something for the Telligent folks to take a look at (they are the owners of Community Server)....

 

This post brought to you by ƒ (U+0192, a.k.a. LATIN SMALL LETTER F WITH HOOK)


# Dan on 12 Aug 2007 6:15 AM:

How do I receive updates to the posts?

-Dan Grosp
CEO
grosp.com

# Michael S. Kaplan on 12 Aug 2007 8:53 AM:

????

Not sure what you mean, Dan.

# Kemp on 12 Aug 2007 11:49 AM:

Directly under the post (above this comment section) it states:

"Comment Notification

If you would like to receive an email when updates are made to this post, please register here

Subscribe to this post's comments using RSS"

Is that what you are looking for?

# Johannes Roessel on 12 Aug 2007 3:40 PM:

Hmm, yeah, that name issue occurs to me, too. That's why I switched to oe instead of ö.

# Michael S. Kaplan on 12 Aug 2007 5:18 PM:

Of course you could just not save the name, and type in 'Johannes Rössel' each time until the bug gets fixed. If you ever forget, you already have the 'oe' name remembered. :-)


referenced by

2011/07/08 Not dumb, but dumb quotes! (aka Sorry Mr. Boehner, this one may be our fault)

2008/04/23 That brings new meaning to having "a ç-section" (Ãç§), doesn't it?

go to newer or older post, or back to index or month or day