Last summer I wrote some thoughts about something like a mod_ping for Apache so that search engines could be easily notified when pages on a site change. I was trying to abstract the idea of pings and trackbacks in use by blogs into a general feature that could be used for any site, even one made out of static files.
The announcement about Google Sitemaps reminded me very much about my mod_ping idea. It isn’t the same, but the goal seems to be the same, providing a way for search engines to discover URLs and when they change. SearchEngineWatch has an article about it which provides a brief overview of what Google is up to. More information can be found in the help for there sitemap generator tool (and the Source Forge site for the tool), the Google Sitemaps FAQ and the Sitemap protocol page.
They specifically mention a hope that servers (Apache & IIS) will support this in the future. In the mean time you can manually ping Google for sitemap updates using something like curl, wget or even your web browser I suppose. I’d expect this feature to be built into certain web tools, like blogs and content management systems. I’m sure someone will get around to writing a tool for WordPress to generate a sitemap file, adding to it each time an entry is published and then ping Google to let them know it has been updated.
Will other web search companies adopt this? Keep an eye on Yahoo!, MSN, AOL, A9 and IceRocket to see if this goes anywhere. I don’t think that this will be limited to the “traditional” search folks, I’d think that someone at Technorati, PubSub and maybe Bloglines might come up with some clever uses for this. If we are really lucky people will learn from history and come up with something like feedmesh for sitemap pings.
For now I’ve whipped up a very basic sitemap file at http://joseph.randomnetworks.com/sitemap.xml and pinged Google to let them know about it.
2 replies on “Google Sitemaps”
in all the material about sitemaps i do not see a way to tap into the information from outside – for instance what if i wanted to get the sitemap (or RSS feed) from another site – it would be great if i could get it from google sitemaps directly rather than use something like feedburner
[…] I stopped by Joseph’s blog a few days ago and read the article about Vint Cerf moving to Google. This got me thinking about an earlier Google-related post about sitemaps on Joseph’s blog, where he wondered about a WordPress plugin to generate Google sitemaps. I figured that the WordPress community had probably finished with this by now, and it seems that they have. This article discusses the development of several such plugins and their features. If you check this link, you can peruse a menu of sitemap plugins. […]