Google does not seem to be indexing the Vista SDK site

by Michael S. Kaplan, published on 2005/10/02 10:59 -04:00, original URI: http://blogs.msdn.com/b/michkap/archive/2005/10/02/476212.aspx


That is right, if you want to get results from windowssdk.msdn.microsoft.com for new Vista NLS API functions like GetDurationFormat, Google is not really able to help very much at the moment, though start.com and search.msn.com are. Yet another reason to keep an open mind about search engines -- you never know who is going to have what indexed. :-)

The funny part -- Google does have my blog indexed and after that crawl has gotten the skinny on new functions I have blogged about like FindNLSString. I am curious how long it will take them to pick up the rest of the new functions that no one has given them the links to yet? :-)


# Maurits [MSFT] on 3 Oct 2005 8:37 PM:

Go to http://windowssdk.msdn.microsoft.com/library/

Look at the source:

<HTML>
<HEAD>
<META NAME="Robots" CONTENT="noindex">
(more...)

This is robot-ese for "go away"

# Michael S. Kaplan on 3 Oct 2005 8:56 PM:

Ok, I have changed my mind -- that makes it a[n obnoxious] bug in MSN Search!

# Michael S. Kaplan on 4 Oct 2005 10:43 AM:

Ok, flip-flopping again. :-)

From the folks on the MSN Search team:

---------------------------

We index anchor text from links that point to the URL and the URL itself for docs that contain <META NAME="Robots" CONTENT="noindex">.

We do not index the document content. There is no contextual description or link to cached page:

windowssdk.msdn.microsoft.com/library

---------------------------

So I am not worried anymore. :-)

# Michael S. Kaplan on 4 Oct 2005 10:57 AM:

Also, it is worth noting that the topic sites for the functions themselves do not have this meta tag. So it looks like Google just hasn't hit this one yet....

They must be too busy working on their new product with Sun. :-)

# Maurits [MSFT] on 4 Oct 2005 11:36 AM:

Yup, my bad. msdn.microsoft.com/library also has the "Robots: noindex" meta tag, and it's indexed.

The inner frame (where the content is) has a <meta name="robots" content="">, which is robot-ese for "pull up a chair, get comfy"

/me wakes up in cold sweat shouting Frames... Frames... Frames...

So I'm guessing no-one gave Google a heads-up that the site exists!

But since Google spiders this blog, it should figure it out eventually.

# Maurits [MSFT] on 8 Oct 2005 5:33 AM:

http://www.google.com/search?hl=en&q=site%3Awindowssdk.msdn.microsoft.com

202K results

# Michael S. Kaplan on 8 Oct 2005 5:48 AM:

Makes you wonder if they should be sending me a thank you card, huh? :-)

# Michael S. Kaplan on 8 Oct 2005 6:25 PM:

Of course, 202,000 links and they still do not have one to GetDurationFormat....

go to newer or older post, or back to index or month or day