You can use a robots.txt file to request that search engines remove your site and prevent robots from crawling it in the future. (It's important to note that if a robot discovers your site by other means - for example, by following a link to your URL from another site - your content may still appear in our index and our search results. To entirely prevent a page from being added to the Google index even if other sites link to it, use a noindex meta tag.)
To prevent robots from crawling your site, place the following robots.txt file in your server root:
User-agent: *
Disallow: /
To remove your site from Google only and prevent just Googlebot from crawling your site in the future, place the following robots.txt file in your server root:
User-agent: Googlebot
Disallow: /
Each port must have its own robots.txt file. In particular, if you serve content via both http and https, you'll need a separate robots.txt file for each of these protocols. For example, to allow Googlebot to index all http pages but no https pages, you'd use the robots.txt files below.
For your http protocol (http://yourserver.com/robots.txt):
User-agent: *
Allow: /
For the https protocol (https://yourserver.com/robots.txt):
User-agent: *
Disallow: /
Removing your entire website using a robots.txt file
Wednesday, October 29, 2008 at 4:20 AM Posted by Vasu
Subscribe to:
Post Comments (Atom)
Blog Archive
-
►
2009
(1)
- ► 01/04 - 01/11 (1)
-
▼
2008
(153)
- ► 12/14 - 12/21 (2)
- ► 12/07 - 12/14 (13)
- ► 11/30 - 12/07 (11)
- ► 11/23 - 11/30 (8)
- ► 11/16 - 11/23 (7)
- ► 11/09 - 11/16 (5)
- ► 11/02 - 11/09 (2)
-
▼
10/26 - 11/02
(20)
- Optimizing Web Site Navigation
- Methods to Support Search Engines in Crawling and ...
- About /robots.txt
- Robots Exclusion Protocol: now with even more flex...
- I Robot | Robots.txt Help | SebastianX of Sebastia...
- Internal Links - Only The First Link Counts in Goo...
- 25 Web Form Optimization Tips
- Image Optimization Part 1: The Importance of Images
- Google Adds RSS Feeds For Web Search Results
- Removing your entire website using a robots.txt file
- Source Code for Web Robot Spiders
- Checklist for Search Robot Crawling and Indexing
- How To Handle Redirecting default.asp in IIS? Dupl...
- 5 Tools for On-page Image Usage Analysis
- Beyond Link Building Tools
- 8 Social Media Sites for Local Networking
- Google Penalty Myths
- General Search Ranking Penalties
- Reinclusion Requests: How to perform successful re...
- Effective Keyword Discovery and Traffic-on-Investment
- ► 10/19 - 10/26 (6)
- ► 10/12 - 10/19 (79)
0 comments:
Post a Comment