Video SEO

  • Subscribe to our RSS feed.
  • Twitter
  • StumbleUpon
  • Reddit
  • Facebook
  • Digg

Tuesday, 5 September 2006

Better details about when Googlebot last visited a page

Posted on 07:34 by Unknown
Most people know that Googlebot downloads pages from web servers to crawl the web. Not as many people know that if Googlebot accesses a page and gets a 304 (Not-Modified) response to a If-Modified-Since qualified request, Googlebot doesn't download the contents of that page. This reduces the bandwidth consumed on your web server.

When you look at Google's cache of a page (for instance, by using the cache: operator or clicking the Cached link under a URL in the search results), you can see the date that Googlebot retrieved that page. Previously, the date we listed for the page's cache was the date that we last successfully fetched the content of the page. This meant that even if we visited a page very recently, the cache date might be quite a bit older if the page hadn't changed since the previous visit. This made it difficult for webmasters to use the cache date we display to determine Googlebot's most recent visit. Consider the following example:
  1. Googlebot crawls a page on April 12, 2006.
  2. Our cached version of that page notes that "This is G o o g l e's cache of http://www.example.com/ as retrieved on April 12, 2006 20:02:06 GMT."
  3. Periodically, Googlebot checks to see if that page has changed, and each time, receives a Not-Modified response. For instance, on August 27, 2006, Googlebot checks the page, receives a Not-Modified response, and therefore, doesn't download the contents of the page.
  4. On August 28, 2006, our cached version of the page still shows the April 12, 2006 date -- the date we last downloaded the page's contents, even though Googlebot last visited the day before.
We've recently changed the date we show for the cached page to reflect when Googlebot last accessed it (whether the page had changed or not). This should make it easier for you to determine the most recent date Googlebot visited the page. For instance, in the above example, the cached version of the page would now say "This is G o o g l e's cache of http://www.example.com/ as retrieved on August 27, 2006 13:13:37 GMT."

Note that this change will be reflected for individual pages as we update those pages in our index.
Email ThisBlogThis!Share to XShare to Facebook
Posted in crawling and indexing, search results | No comments
Newer Post Older Post Home

0 comments:

Post a Comment

Subscribe to: Post Comments (Atom)

Popular Posts

  • Come see us at SES London and hear tips on successful site architecture
    If you're planning to be at Search Engine Strategies London February 13-15, stop by and say hi to one of the many Googlers who will be ...
  • How to verify Googlebot
    Lately I've heard a couple smart people ask that search engines provide a way know that a bot is authentic. After all, any spammer cou...
  • Using the site: command
    The site: command enables you to search through a particular site. For instance, a searcher could look for references to [Buffy] in this blo...
  • Better badware notifications for webmasters
    In the fight against badware, protecting Google users by showing warnings before they visit dangerous sites is only a small piece of the puz...
  • Our Valentine's day gift: out of beta and adding comments
    Here at webmaster central , we love the webmaster community -- and today, Valentine's Day, we want to show you that our commitment to ...
  • A quick word about Googlebombs
    Co-written with Ryan Moulton and Kendra Carattini We wanted to give a quick update about "Googlebombs." By improving our analysis ...
  • Using the robots meta tag
    Recently, Danny Sullivan brought up good questions about how search engines handle meta tags . Here are some answers about how we handle the...
  • Googlebot activity reports
    The webmaster tools team has a very exciting mission: we dig into our logs, find as much useful information as possible, and pass it on to ...
  • Tips on using feeds and information on subscriber counts in Reader
    Does your site have a feed? A feed can connect you to your readers and keep them returning to your content. Most blogs have feeds, but incre...
  • SES Chicago - Using Images
    We all had a great time at SES Chicago last week, answering questions and getting feedback. One of the sessions I participated in was Ima...

Categories

  • crawling and indexing
  • events
  • feedback and communication
  • general tips
  • localization
  • products and services
  • search results
  • sitemaps
  • webmaster guidelines
  • webmaster tools

Blog Archive

  • ►  2007 (12)
    • ►  March (2)
    • ►  February (7)
    • ►  January (3)
  • ▼  2006 (34)
    • ►  December (5)
    • ►  November (7)
    • ►  October (7)
    • ▼  September (8)
      • Fresher query stats
      • Introducing Google Checkout
      • How to verify Googlebot
      • Debugging blocked URLs
      • For Those Wondering About Public Service Search
      • Setting the preferred domain
      • Information about Sitelinks
      • Better details about when Googlebot last visited a...
    • ►  August (7)
Powered by Blogger.

About Me

Unknown
View my complete profile