# If the Joomla site is installed within a folder such as at # e.g. www.example.com/joomla/ the robots.txt file MUST be # moved to the site root at e.g. www.example.com/robots.txt # AND the joomla folder name MUST be prefixed to the disallowed # path, e.g. the Disallow rule for the /administrator/ folder # MUST be changed to read Disallow: /joomla/administrator/ # # For more information about the robots.txt standard, see: # http://www.robotstxt.org/orig.html # # For syntax checking, see: # http://www.sxw.org.uk/computing/robots/check.html User-agent: * Allow: /*.js* Allow: /*.css* Allow: /*.png* Allow: /*.jpg* Allow: /*.gif* Disallow: /administrator/ Disallow: /cache/ Disallow: /cli/ Disallow: /includes/ Disallow: /installation/ Disallow: /language/ Disallow: /libraries/ Disallow: /logs/ Disallow: /tmp/ Allow: /index.php?option=com_xmap&view=xml&tmpl=component&id=1 Disallow: /*/events-calendar/* Disallow: /events-calendar/* Disallow: /*/blog/tag/* Disallow: /blog/tag/* Disallow: /*/blog/calendar/* Disallow: /blog/calendar/* Crawl-delay: 5 #SEO Disallows Disallow: /component/ Disallow: /channels/ Disallow: /*?* #Block Common Spam Bots User-agent: MJ12bot Disallow: / User-agent: Yandex Disallow: / User-agent: Baiduspider Disallow: / User-agent: AhrefsBot Disallow: / User-agent: SemrushBot Disallow: / User-agent: dotbot Disallow: / User-agent: Baiduspider Disallow: / User-agent: SeznamBot Disallow: / User-agent: Twingly Recon Disallow:/ User-agent: sogou spider Disallow: / User-agent: YoudaoBot Disallow: / User-agent: NaverBot User-agent: Yeti Disallow: / User-agent: moget User-agent: ichiro Disallow: / User-agent: Twitterbot Disallow: # Crawl-delay: 1