New in Vista Beta 1: Updated OS casing tables

by Michael S. Kaplan, published on 2005/08/20 16:15 -04:00, original URI: http://blogs.msdn.com/b/michkap/archive/2005/08/20/454071.aspx


Not too long ago, I posted about New in Vista Beta 1: giving more strings weight, where I explained about how in Vista we are adding all of the assigned code points in Unicode 4.1 to the collation tables.

Well, there is another set of tables that needed such an update -- tables that have not been updated in a long time. The casing tables.

Over time this just gets worse, especially as most of the characters that have been added to Unicode have made their way into our collation tables and our fonts. So that it looks almost like NTFS and the rest of the "case insensitive" parts of the operating system are only case inensitive some of the time.

It had actually been getting a little embarrassing. This was something that was good to get fixed.

Finally, in Vista, the casing table is being updated to all of the Unicode 4.1 values.

Now as a bonus, the Georgian issue gets addressed as a part of this update. :-)

 

This post brought to you by "ⴀ" (U+2d00, GEORGIAN SMALL LETTER AN)
One of the new Nushkuri Georgian letters, added in Unicode 4.1


# JS on 24 Oct 2007 3:42 PM:

$UpCase is still 131,072 bytes on Vista -- are non-BMP characters always "case sensitive" as far as NTFS is concerned?

# Michael S. Kaplan on 24 Oct 2007 5:20 PM:

Correct, the OS does not upcase the "off the BMP" characters....


Please consider a donation to keep this archive running, maintained and free of advertising.
Donate €20 or more to receive an offline copy of the whole archive including all images.

referenced by

2007/12/27 No upproblems with $UpCase when you do a upVista upgrade

2007/10/24 In Case you have problems that you might think are ǸȦȘȚȲ

2007/04/26 No Regex in the Unicode room! (and no sex in the champagne room, either!)

2007/04/25 The nature of OrdinalIgnoreCase vs. intuitive expectations

go to newer or older post, or back to index or month or day