twitter  facebook  feedburner  youtube  linkedin

 

Google Links, Scraping Content Policy, and A Polite Bot

GOOGLE Link Tool

You asked, and we listened: We’ve extended our support for querying links to your site to much beyond the link: operator you might have used in the past. Now you can use webmaster tools to view a much larger sample of links to pages on your site that we found on the web. Unlike the link: operator, this data is much more comprehensive and can be classified, filtered, and downloaded. All you need to do is verify site ownership to see this information.

To make this data even more useful, we have divided the world of links into two types: external and internal. Let’s understand what kind of links fall into which bucket.

http://googlewebmastercentral.blogspot.com/2007/02/discover-your-links.html

GooGLE View of Scraping Content

Sites with more content can have more opportunities to rank well in Google. It makes sense that having more pages of good content represent more chances to rank in search engine result pages (SERPs). Some SEOs however, do not focus on the user’s needs, but instead create pages solely for search engines. This approach is based on the false assumption that increasing the volume of web pages with random, irrelevant content is a good long-term strategy for a site. These techniques are usually accomplished by abusing qlweb style catalogues or by scraping content from sources known for good, valid content, like Wikipedia or the Open Directory Project.

http://googlewebmastercentral.blogspot.com/2007/03/site-content-and-use-of-web-catalogues.html

GOOGLE Bot Playing Nice

Search engine robots, including our very own Googlebot, are incredibly polite. They work hard to respect your every wish regarding what pages they should and should not crawl. How can they tell the difference? You have to tell them, and you have to speak their language, which is an industry standard called the Robots Exclusion Protocol.

http://googlewebmastercentral.blogspot.com/2007/03/all-about-robots.html

No related posts.

Comments are closed