I Kana understand you, could you repeater that? (Part 1)

by Michael S. Kaplan, published on 2007/03/29 04:01 -04:00, original URI: http://blogs.msdn.com/b/michkap/archive/2007/03/29/1985735.aspx

I have made bad puns in the titles of blog posts talking about Kana previously (e.g. Knock knock! Who's there? Kana! Kana Who?).

Now, in a bold bit of irony, I am going to repeat that experience here and talk about the repeater. Somewhat repeatedly in a series....

There are several different kinds of repeaters. One of them is (U+3005, a.k.a. IDEOGRAPHIC ITERATION MARK).

As I pointed out in that previous post:

An iteration mark duplicates a letter, so it duplicates the weight and adds a little bit (so it sorts right after)

But now let's add in another character to the mix:


The name is kind of implies the usage -- it is like its non-vertical counterpart, but used in vertical writing. Which is cool since sometimes Kana and also Kanji is written vertically.

(And yes, usually one would see the Hiragana or Katakana iteration marks with Kana, but let's leave that aside for the time being, I am trying to make a different point here!)

It is not like CompareString has a vertical directionality parameter, so what would one do with U+303b?

I mean, is there really more that one could do other than just blindly doing the same thing that one does with U+3005?

Although the name implies the usage, the fact is that they simply have two different appearances. Perhaps one character could have been encoded which conditionally changed its appearance based on whether one was displaying vertically or not, but these characters were added to JIS long before Unicode or OpenType were around, so even if that were desirable, it is too late now.

And whether is desirable is also somewhat questionable -- what about situations where one needs to display U+303b in a horizontal context (say if one is documenting its usage)?

And since the collation functions do not have any kind of directionality-specific behavior and it is unclear whether one should treat the character differently if it is used in the wrong context -- that would be like punishing a spelling error by screwing with the sort order. Hardly a kind behavior....

So in the end, in Vista the two characters have the same behavior. Which is not such a bad thing. :-)

I'll be talking more about the repeater in an upcoming post in this series....


This post brought to you by (U+303b, a.k.a. VERTICAL IDEOGRAPHIC ITERATION MARK)

# wagahai on 29 Mar 2007 8:13 AM:

You example きき -> き々 is not valid.

U+3005 is used to repeat kanji, not kana.

To repeat kana, you may use U+309D, U+309E, U+30FD, or U+30FE. Note that kana repetition marks are fairly old style and not that common anymore; there may be valid literacy problems for younger Japanese. On the other hand, the kanji repetition mark U+3005 is used so often it's hard to go an hour without missing in Japan.

Tōkyō, Japan

# wagahai on 29 Mar 2007 8:15 AM:


The name for U+3005 (IDEOGRAPHIC ITERATION MARK) is also a big clue. Kana, as opposed to kanji (or Han), are not ideographic.

You saw my note here on this issue, right? As I said

(And yes, usually one would see the Hiragana or Katakana iteration marks with Kana, but let's leave that aside for the time being, I am trying to make a different point here!)

I was focusing on a different issue here, to wit the verticalality issue.... - Michael

# Michael S. Kaplan on 29 Mar 2007 8:21 AM:

Well, although not valid, they actually work in Windows, technically (since the repeaters will repeat the previous character, whatever it may be).

So you could think of it as more of a functional example. :-)

# roxfan on 29 Mar 2007 5:11 PM:

Here's a good reference on repetition marks:


referenced by

2007/04/01 I Kana understand you, could you repeater that? (Part 2)

go to newer or older post, or back to index or month or day