How to Avoid the Search Engines from Indexing Non-Public Release Documents

by Raphael Nikolai on January 22, 2011 and Revised on May 1, 2012

in Tips and Tricks

Your confidential files are not safe once uploaded online! The web is full of crawlers/spiders that are hungry for information and they will find your content – Trust me, even if you have not shared it to anyone.

Why dont you try it your self. Try searching Google with the search string below:

“not for public release” -.edu -.gov -.mil

Most confidential documents are appropriately labeled as “not for public release”, so searching google for such strings will yield to some interesting results. Same goes for the terms confidential, Top Secret, Clandestine, and so on.

So how do you prevent spiders/crawlers form indexing these files?

Well there are two ways of instructing crawlers to skip your content.

  1. Meta-Noindex / Meta-Nofollow via Meta tags- here
  2. Noindex / Nofollow via Robots.txt – here

Feel free to search the web for more info on these techniques. Good luck! If you have something to add feel free to leave a comment below.

  • Donald

    Just wanted to thank you for helping me get my HP All-in-one reinstalled and recognized by HP Solution Center. Your “I DONT WANT IT TO FAIL” method worked after I had been scrapping at it for 2 days. I hope you got my donation to help keep you going.
    Donald

Previous post:

Next post: