Using robots.txt to hide from search engines
The robots.txt protocol is used to tell search engines (Google, MSN, etc) which parts of a website should not be crawled.
For Jira instances where non-logged-in users are able to view issues, a robots.txt file is useful for preventing unnecessary crawling of the Issue Navigator views (and unnecessary load on your Jira server).
Editing robots.txt
Jira (version 3.7 and later) installs the following robots.txt
file at the root of the Jira web app ($JIRA-INSTALL/atlassian-jira
):
# robots.txt for Jira
# You may specify URLs in this file that will not be crawled by search engines (Google, MSN, etc)
#
# By default, all SearchRequestViews in the IssueNavigator (e.g.: Word, XML, RSS, etc) and all IssueViews
# (XML, Printable and Word) are excluded by the /sr/ and /si/ directives below.
User-agent: *
Disallow: /sr/
Disallow: /si/
Alternatively, if you already have a robots.txt
file, simply edit it and add Disallow: /sr/
and Disallow: /si/
.
Publishing robots.txt
The robots.txt
file needs to be published at the root of your Jira internet domain, e.g. jira.mycompany.com/robots.txt
.
If your Jira instance is published at jira.mycompany.com/jira
, change the contents of the file to Disallow: /jira/sr/
and Disallow: /jira/sr/
. However, you still need to put robots.txt
file in the root directory, i.e. jira.mycompany.com/robots.txt
(not jira.mycompany.com/jira/robots.txt
).