Sorting it All Out: Search Archives

Search Sorting it All Out Archive

(or go back to index)

Search in posts:

Regular expressions are allowed, but play nice. Sorted descending by date published.


  1. The last word on the FINAL SIGMA (2005/05/26)

    Back in the beginning of April, I explained about the one scenario where casing does not need to roundtrip in .NET -- the Greek final sigma. Anyway, the day before yesterday I got an email from someone…

  2. Similar descriptions does not mean similar methodologies (2005/05/08)

    …rsions of Win9x. Kind of ironic when you note the small behavior differences between them all, huh? 2) Given the Georgian casing issue, it is a little sad that almost all of these functions that convert prior to comparison use a lowercasing operation…

  3. Intelligent unmanaged string comparison (2005/04/26)

    …file system (or of Win32 namespace objects like the names for events, names pipes, mutexes, etc.) then you should be uppercasing the string and then doing a binary comparison. If you are not, then you have to ask yourself why are you bothering to com…

  4. Invariant and Ordinal Redux (2005/04/13)

    …es surrounding Invariant and Ordinal comparisons will probably have a prominent place in that list (as will some of those casing issues). This post brought to you by "ლ" (U+10da, a.k.a. GEORGIAN LETTER LAS) # Ben on 13 Apr 2005 10:39 PM: Michael, I a…

  5. What the %#$* is wrong with German sorting? (2005/04/10)

    …rman language, the Sharp S ("ß" or U+00df) is a lowercase letter, and it capitalizes to the letters "SS". Now Microsoft's casing tables only support simple Unicode casing, which does not include any rules that would change the size of the string such…

  6. When casing does not need to roundtrip in .NET (2005/04/04)

    …ER FINAL SIGMA. You can read about it here. In the deep, dark past I had also talked about the the meaning of "linguistic casing" on Windows. I never did talk about when/if that setting is used in .NET Framework (I wonder if anyone noticed I palmed t…

  7. Every character has a story #8: U+03c2 (GREEK SMALL LETTER FINAL SIGMA) (2005/03/31)

    … same result. For example, both a sigma and a final sigma uppercase to a capital sigma. and Q: Near the end of the SpecialCasing.txt, there are the two lines on SIGMA that look weird to me. Can you explain them: # 03C3; 03C2; 03A3; 03A3; FINAL; # GRE…

  8. What does the the third letter in GIFT stand for? (2005/03/30)

    … But their work is no less important, in fact I would argue it is often more important since the utility of collation and casing and encoding is pretty limited if you can't see what the characters are (only people on the NLS team get good at speaking…

  9. Linguistic and Unicode considerations (or Language-specific Processing #4) (2005/03/21)

    …act that I will tentatively consider depressing for Turkic languages like Azeri and Turkish but might hope for linguistic casing rules that would do the right thing for these scenarios. :-) Nonbreaking Spaces actually talks about all sorts of charact…

  10. Dere are qvestions? In zat case... (2005/03/10)

    …; "straße" is wrong (unless there is a verb "strassen"). Windows and the .NET Framework mainly support simple, reversible casing -- which is to say single code point casing that have ToUpper() and ToLower() as inverse operations that can "undo" each …

  11. And yet another blogger to keep an eye on! (2005/03/04)

    …nce that she already thinks of the over-overloaded functionality in LCMapString as a huge hack!). There was no linguistic casing flag added, to match LCMAP_LINGUISTIC_CASING (I describe what that flag does when I asked What does "linguistic casing" m…

  12. "Michael, why does ToTitleCase suck so much?" (2005/03/04)

    …case." It does so by defining the word breaking characters as follows: The following are valid word separators for proper casing: Null (Chr$(0)), horizontal tab (Chr$(9)), linefeed (Chr$(10)), vertical tab (Chr$(11)), form feed (Chr$(12)), carriage r…

  13. An interview question (by popular demand) (2005/02/28)

    … almost any point. I have asked similar questions with Unicode, encodings, parsing, formatting, calendars, normalization, casing, string iteration, and more. They seem to work well, and I can usually get a good idea of how a candidate approaches a ne…

  14. My apparent obsession with "case" puns (2005/01/16)

    …e to make a posting about case without involving a pun in the title of the posting. To wit: Get off my [lower] case! (or: Casing, the 1st) The [Upper]Case of the Turkish İ (or: Casing, the 2nd) How [case-]insensitive (apologies to Frank Sinatra) Now …

  15. How [case-]insensitive (apologies to Frank Sinatra) (2005/01/16)

    Tor Lillqvist noted that in some of my previous entries on casing (cf: Get off my [lower] case! (or: Casing, the 1st) and The [Upper]Case of the Turkish İ (or: Casing, the 2nd)) I made so…

  16. Comparison confusion: INVARIANT vs. ORDINAL (2004/12/29)

    …ome people wanted a case-insensitive ordinal comparison?!? Now the closet linguist in me shudders at this concept since a casing operation is essentially a linguistic one while an ordinal one is specifically not -- it's lexicographic. So people are a…

  17. What does "linguistic casing" mean? (2004/12/11)

    The LCMapString API is the ultimate Win32 API that does casing. All of those functions in USER32 (CharUpper, CharLower, CharUpperBuff, and CharLowerBuff) are wrappers around calls to L…

  18. "My language is better than yours!" he said. He was wrong.... (2004/12/06)

    …irtual for over a decade? Well I have had to look over code the in the NT boot loader a few months ago that loads the NLS casing tables, the very same ones that are later used by kernel32.dll. This data gets used so early in the boot cycle that it ha…

  19. The [Upper]Case of the Turkish İ (or: Casing, the 2nd) (2004/12/03)

    …res of developers scan through thousands of files in the .NET Framework to (among other things) properly not use "Turkic" casing when trying to look at the filesystem or the registry. Its amazing how difficult and expensive it can be to make a produc…

  20. Get off my [lower] case! (or: Casing, the 1st) (2004/12/02)

    …this, he was referring to legal jurisprudence. But believe it or not, the quote can be just as applicable to alphabetical casing operations! For reasons that surpass my understanding, there are many developers who run code that uppercases a lowercasi…

  21. They ask me "why is my Korean text in random order?" (2004/11/30)

    …n on whether or not to ignore spaces (or what C calls whitespace) is separate from a decision on whether or not to ignore casing (upper/lower/title) is separate from a decision on whether or not to ignore width (at least in Japanese character sets) i…

‹‹ Previous Showing results 101—121.