Question:
Pleasetell me about robots.txt?
hbk_star2006
2009-04-07 02:11:14 UTC
Hi friends,
Please tell me robot.txt file on webserver.
Also we use meta robot tag in webpages
Please explain everything in detail, ihave to use this tag in my website
Four answers:
Nithya
2009-04-07 03:19:52 UTC
The robots.txt is a TEXT file (not HTML!) which has a section for each robot to be controlled. Each section has a user-agent line which names the robot to be controlled and has a list of "disallows" and "allows". Each disallow will prevent any address that starts with the disallowed string from being accessed. Similarly, each allow will permit any address that starts with the allowed string from being accessed. The (dis)allows are scanned in order, with the last match encountered determining whether an address is allowed to be used or not. If there are no matches at all then the address will be used.



Here's an example:



user-agent: FreeFind

disallow: /mysite/test/

disallow: /mysite/cgi-bin/post.cgi?action=reply

disallow: /a



In this example the following addresses would be ignored by the spider:



http://adomain.com/mysite/test/index.html

http://adomain.com/mysite/cgi-bin/post.cgi?action=reply&id=1

http://adomain.com/mysite/cgi-bin/post.cgi?action=replytome

http://adomain.com/abc.html



and the following ones would be allowed:



http://adomain.com/mysite/test.html

http://adomain.com/mysite/cgi-bin/post.cgi?action=edit

http://adomain.com/mysite/cgi-bin/post.cgi

http://adomain.com/bbc.html



It is also possible to use an "allow" in addition to disallows. For example:



user-agent: FreeFind

disallow: /cgi-bin/

allow: /cgi-bin/Ultimate.cgi

allow: /cgi-bin/forumdisplay.cgi



This robots.txt file prevents the spider from accessing every cgi-bin address from being accessed except Ultimate.cgi and forumdisplay.cgi.



Using allows can often simplify your robots.txt file.



Here's another example which shows a robots.txt with two sections in it. One for "all" robots, and one for the FreeFind spider:



user-agent: *

disallow: /cgi-bin/



user-agent: FreeFind

disallow:



In this example all robots except the FreeFind spider will be prevented from accessing files in the cgi-bin directory. FreeFind will be able to access all files (a disallow with nothing after it means "allow everything").





Examples (top)



This section has a few handy examples.



To prevent FreeFind from indexing your site at all:



user-agent: FreeFind

disallow: /



To prevent FreeFind from indexing common Front Page image map junk:



user-agent: FreeFind

disallow: /_vti_bin/shtml.exe/



To prevent FreeFind from indexing a test directory and a private file:



user-agent: FreeFind

disallow: /test/

disallow: private.html



To allow let FreeFind index everything but prevent other robots from accessing certain files:



user-agent: *

disallow: /cgi-bin/

disallow: this.html

disallow: and.html

disallow: that.html



user-agent: FreeFind

disallow:
anonymous
2016-10-17 14:44:40 UTC
in basic terms a hatchback would properly be offered on your funds. So Indica, Alto K10 and Santro will slot on your funds. All 3 are sturdy efficient fashions from the reputed manufacturers having sturdy after sale centers. yet strategies you... hatchbacks have not got sturdy bags area. in case you could stretch your funds slightly extra you could decide for the Tata Indigo CS which isn't high priced and has a separate boot to shield your bags. Make up your strategies and decide for the only that fits u extra... relish your self...power secure.
cyberdoc
2009-04-07 02:23:16 UTC
Robots.txt is a text file that tells bots to what folders to see and what folders are prohibited.

Typical robots.txt file looks like this.



Allow: /Website-folder/

Allow: /images/

Disallow: /restricted-folder/

Disallow: /allowed folder/restricted-folder/

Disallow: /Personal-folder/

Disallow: /xyz/

Sitemap:http://site.com/sitemap.xml
anonymous
2009-04-07 02:21:09 UTC
http://gettinginfo.info.tm

A very informative website, kindly stay a minute in website and enter you email address for getting your answer.zzz


This content was originally posted on Y! Answers, a Q&A website that shut down in 2021.
Loading...