|
Webmaster
Knowledge Base | "Robots" |
Back to overview
The
Robots.txt" file prevents only that information arrives into
the indices (= database) of the search engines, which have to look
for there nothing, for example: demo pages. Pages, which are not
linked, do not need to become closed, because the robots cannot
find them anyway. Functions
as follows: If
a Robot visits your Website, he looks first once after robots.txt
and the information contained in it. The file robots.txt must be
in the root directory. There may be only one per Domain. Use everytime
the lower case: "robots.txt" and never "Robots.txt"
or "robots.TXT". The
asterisk * is considered as wildcard character and means that the
following lines apply to all robots. With
"Disallow" certain listings (files or directories ) for
the robots become closed. For each listing one line is necessary.
Excluding
all robots : User
agent: * Disallow Inviting
all robots : User
agent: * ( just
create a file called "robots.txt" with an texteditor,
write in the robot informations and upload it to your webservers
main directory ) #
my robots info ( just an comment )
user agent: * ( invite all robots ) Disallow:
/cgi-bin ( block the directory "cgi-bin"
with all content )
|