Video SEO

  • Subscribe to our RSS feed.
  • Twitter
  • StumbleUpon
  • Reddit
  • Facebook
  • Digg

Thursday, 31 August 2006

How search results may differ based on accented characters and interface languages

Posted on 17:22 by Unknown
When a searcher enters a query that includes a word with accented characters, our algorithms consider web pages that contain versions of that word both with and without the accent. For instance, if a searcher enters [México], we'll return results for pages about both "Mexico" and "México."



Conversely, if a searcher enters a query without using accented characters, but a word in that query could be spelled with them, our algorithms consider web pages with both the accented and non-accented versions of the word. So if a searcher enters [Mexico], we'll return results for pages about both "Mexico" and "México."



How the searcher's interface language comes into play
The searcher's interface language is taken into account during this process. For instance, the set of accented characters that are treated as equivalent to non-accented characters varies based on the searcher's interface language, as language-level rules for accenting differ.

Also, documents in the chosen interface language tend to be considered more relevant. If a searcher's interface language is English, our algorithms assume that the queries are in English and that the searcher prefers English language documents returned.

This means that the search results for the same query can vary depending on the language interface of the searcher. They can also vary depending on the location of the searcher (which is based on IP address) and if the searcher chooses to see results only from the specified language. If the searcher has personalized search enabled, that will also influence the search results.

The example below illustrates the results returned when a searcher queries [Mexico] with the interface language set to Spanish.



Note that when the interface language is set to Spanish, more results with accented characters are returned, even though the query didn't include the accented character.

How to restrict search results
To obtain search results for only a specific version of the word (with or without accented characters), you can place a + before the word. For instance, the search [+Mexico] returns only pages about "Mexico" (and not "México"). The search [+México] returns only pages about "México" and not "Mexico." Note that you may see some search results that don't appear to use the version of word you specified in your query, but that version of the word may appear within the content of the page or in anchor text to the page, rather than in the title or description listed in the results. (You can see the top anchor text used to link to your site by choosing Statistics > Page analysis in webmaster tools.)

The example below illustrates the results returned when a searcher queries [+Mexico].

Email ThisBlogThis!Share to XShare to Facebook
Posted in localization, search results | No comments
Newer Post Older Post Home

0 comments:

Post a Comment

Subscribe to: Post Comments (Atom)

Popular Posts

  • Come see us at SES London and hear tips on successful site architecture
    If you're planning to be at Search Engine Strategies London February 13-15, stop by and say hi to one of the many Googlers who will be ...
  • How to verify Googlebot
    Lately I've heard a couple smart people ask that search engines provide a way know that a bot is authentic. After all, any spammer cou...
  • Using the site: command
    The site: command enables you to search through a particular site. For instance, a searcher could look for references to [Buffy] in this blo...
  • Better badware notifications for webmasters
    In the fight against badware, protecting Google users by showing warnings before they visit dangerous sites is only a small piece of the puz...
  • Our Valentine's day gift: out of beta and adding comments
    Here at webmaster central , we love the webmaster community -- and today, Valentine's Day, we want to show you that our commitment to ...
  • A quick word about Googlebombs
    Co-written with Ryan Moulton and Kendra Carattini We wanted to give a quick update about "Googlebombs." By improving our analysis ...
  • Using the robots meta tag
    Recently, Danny Sullivan brought up good questions about how search engines handle meta tags . Here are some answers about how we handle the...
  • Googlebot activity reports
    The webmaster tools team has a very exciting mission: we dig into our logs, find as much useful information as possible, and pass it on to ...
  • Tips on using feeds and information on subscriber counts in Reader
    Does your site have a feed? A feed can connect you to your readers and keep them returning to your content. Most blogs have feeds, but incre...
  • SES Chicago - Using Images
    We all had a great time at SES Chicago last week, answering questions and getting feedback. One of the sessions I participated in was Ima...

Categories

  • crawling and indexing
  • events
  • feedback and communication
  • general tips
  • localization
  • products and services
  • search results
  • sitemaps
  • webmaster guidelines
  • webmaster tools

Blog Archive

  • ►  2007 (12)
    • ►  March (2)
    • ►  February (7)
    • ►  January (3)
  • ▼  2006 (34)
    • ►  December (5)
    • ►  November (7)
    • ►  October (7)
    • ►  September (8)
    • ▼  August (7)
      • How search results may differ based on accented ch...
      • Listen in - Matt Cutts and Vanessa Fox talk search
      • System maintenance
      • All About Googlebot
      • Back from SES San Jose
      • Chat with us in person at the Search Engine Strate...
      • More webmaster tools
Powered by Blogger.

About Me

Unknown
View my complete profile