Spiderline
custom search engine solutions
Your Own Search Engine.
Just seconds after registering, your web site can be searchable with the features you want and reliability you need. No software to install or maintenance required. Search results can match your website design seamlessly.

Site Search Knowledge Base

Search  
   
Browse by Category
Site Search Knowledge Base .: Crawl Questions .: Robot META tags Tutorial.

Robot META tags Tutorial.

The Robots META tag is another method that may be used to indicate to visiting robots whether a page should be indexed (crawled), or links on the page should be followed. It differs from the Protocol for Robots Exclusion in that you need no effort or permission from your Web Server Administrator.

The content of the robots META tag contains directives separated by commas. You can define [no]index, [no]follow, all, or none. The INDEX directive specifies if an indexing robot should index the page. While a robot crawls around your web site, it collects information about the words and links on each page; this is the process of indexing. The FOLLOW directive specifies if a robot is to follow links on the page. The defaults are INDEX and FOLLOW. The values ALL and NONE set all directives on or off: all=index,follow and none=noindex,nofollow. NOTE: The "robots" name of the tag and the content are case insensitive.

Like any META tag it should be placed beteen the <head></head> tags of an HTML page:

  <html>
<head>
<meta name="robots" content="none">
<meta name="description" content="This page ....">
<title>...</title>
</head>
<body>
...


Examples:

HTML page you do not want crawled/indexed:
  <meta name="robots" content="noindex">
HTML page you want crawled, but do not want the robot to follow the links on that page:
  <meta name="robots" content="nofollow">
HTML page you do not want crawled AND do not want the robot to follow the links on that page:
  <meta name="robots" content="none">

How helpful was this article to you?

Related Articles

article Robot Exclusion Guide
The robots.txt file and robot META tags are methods used to allow and disallow crawling portions of your site by robots (web robots, spiders). Website administrators and content providers can...

  2005-01-20    Views: 9327   
article How do robot meta tags work?
The Robots META tag is another method that may be used to indicate to visiting robots whether a page should be indexed (crawled), or links on the page should be followed. It differs from the...

(No rating)  2005-04-27    Views: 4635   
article Excluding crawler from sections of pages.
This help topic describes how to prevent sections of a document from being indexed. To prevent an entire document from being indexed, see the topics above. Spiderline supports the proprietary...

(No rating)  2005-01-20    Views: 178504   


.: Powered by Lore 1.5.3

Powered by Lucene