Pete Freitag Pete Freitag

Google Sitemaps Protocol

web

Google has introduced an XML schema for describing your site structure, and update frequency, it's called the Google Sitemap Protocol. The protocol is offered under a creative commons license.

The Sitemap Protocol allows you to inform search engine crawlers about URLs on your Web sites that are available for crawling. A Sitemap consists of a list of URLs and may also contain additional information about those URLs, such as when they were last modified, how frequently they change, etc.

The google sitemap protocol is especially useful for websites with Flash user interfaces because google may not otherwise index the links.

It is also handy because you can save bandwidth by telling google the last time the file has changed.

Here's an example sitemap xml file:

<?xml version="1.0" encoding="utf-8" ?>
<urlset xmlns="http://www.google.com/schemas/sitemap/0.84">
	<url>
		<loc>https://www.petefreitag.com/</loc>
		<lastmod>2005-06-06</lastmod>
		<changefreq>daily</changefreq>
		<priority>0.7</priority>
	</url>
	<url>
		<loc>https://www.petefreitag.com/tags/</loc>
		<priority>0.2</priority>
	</url>
</urlset>

A couple of notes on the sitemaps protocol:

  • Sitemap files must use UTF-8 encoding
  • You can compress sitemap files using gzip
  • File can't be larger than 10mb uncompressed
  • Sitemaps can't contain more than 50,000 url's
  • The priority is relative only to pages on the same site
  • The default priority is 0.5
  • URL's must be XML Safe (use XMLFormat())

You can find the sitemap protocol specification here.

Google recommends that your store your sitemap in /sitemap.gz, you need manually add your sitemaps to the search engines. Google's page for that is here.


Like this? Follow me ↯

Google Sitemaps Protocol was first published on June 06, 2005.

If you like reading about google, sitemaps, or seo then you might also like:

Comments

I have created a pretty nice little application that will manage all of this for your site(s). Its alos gzip enabled. CHECK IT: http://googlebotsnacks.com/index.cfm/fuseaction/snacks.SitePageSnack/Google_coldfusion_code/fusebox/searchtext/Google%20Site%20Page%20Snack%20more/site_page_snack.htm
by JOSH on 04/20/2007 at 10:23:01 AM UTC
hi i just added a sitemap.xml file to my site http://www.orchard-endowments.com but when i checked it for errors i found that it should have a doctype. could you let me know the correct syntax for this page element as im i little foggy, thanks.
by shaun on 05/25/2007 at 7:22:18 PM UTC
praveen. to make a sitemap and be succesful with google, i use google webmaster or godaddy hosting service thei have a site map creator and works great.
by VILLY on 12/06/2008 at 7:09:58 AM UTC