# As a condition of accessing this website, you agree to abide by the following # content signals: # (a) If a Content-Signal = yes, you may collect content for the corresponding # use. # (b) If a Content-Signal = no, you may not collect content for the # corresponding use. # (c) If the website operator does not include a Content-Signal for a # corresponding use, the website operator neither grants nor restricts # permission via Content-Signal with respect to the corresponding use. # The content signals and their meanings are: # search: building a search index and providing search results (e.g., returning # hyperlinks and short excerpts from your website's contents). Search does not # include providing AI-generated search summaries. # ai-input: inputting content into one or more AI models (e.g., retrieval # augmented generation, grounding, or other real-time taking of content for # generative AI search answers). # ai-train: training or fine-tuning AI models. # ANY RESTRICTIONS EXPRESSED VIA CONTENT SIGNALS ARE EXPRESS RESERVATIONS OF # RIGHTS UNDER ARTICLE 4 OF THE EUROPEAN UNION DIRECTIVE 2019/790 ON COPYRIGHT # AND RELATED RIGHTS IN THE DIGITAL SINGLE MARKET. # BEGIN Cloudflare Managed content User-agent: * Content-Signal: search=yes,ai-train=no Allow: / User-agent: Amazonbot Disallow: / User-agent: Applebot-Extended Disallow: / User-agent: Bytespider Disallow: / User-agent: CCBot Disallow: / User-agent: ClaudeBot Disallow: / User-agent: Google-Extended Disallow: / User-agent: GPTBot Disallow: / User-agent: meta-externalagent Disallow: / # END Cloudflare Managed Content # ============================================================================ # Dear Nobody - robots.txt # https://dearnobody.org/robots.txt # ============================================================================ # We welcome search engines to help people discover our platform. # However, we prohibit scraping, harvesting, or automated data collection. # See our Terms of Service: https://dearnobody.org/terms # ============================================================================ # =================== # SEARCH ENGINE BOTS # =================== # Allow legitimate search engine crawling of public informational pages User-agent: Googlebot Allow: / # Clean URLs (preferred) Allow: /write Allow: /confess Allow: /explain Allow: /faq Allow: /guidelines Allow: /privacy Allow: /terms Allow: /contact Allow: /crisis Allow: /consent Allow: /transparency Allow: /publishing Allow: /letter Allow: /mailbox # Fallback .html URLs Allow: /index.html Allow: /write.html Allow: /confess.html Allow: /explain.html Allow: /faq.html Allow: /guidelines.html Allow: /privacy.html Allow: /terms.html Allow: /contact.html Allow: /crisis.html Allow: /consent-explained.html Allow: /transparency.html Allow: /publishing.html Allow: /letter.html Allow: /mailbox.html Allow: /images/ Allow: /css/ # Private pages (both formats) Disallow: /access Disallow: /access.html Disallow: /submission Disallow: /submission-details.html Disallow: /netlify/ Disallow: /api/ Disallow: /api/ Disallow: /database/ Disallow: /partials/ User-agent: Bingbot Allow: / # Clean URLs (preferred) Allow: /write Allow: /confess Allow: /explain Allow: /faq Allow: /guidelines Allow: /privacy Allow: /terms Allow: /contact Allow: /crisis Allow: /consent Allow: /transparency Allow: /publishing Allow: /letter Allow: /mailbox # Fallback .html URLs Allow: /index.html Allow: /write.html Allow: /confess.html Allow: /explain.html Allow: /faq.html Allow: /guidelines.html Allow: /privacy.html Allow: /terms.html Allow: /contact.html Allow: /crisis.html Allow: /consent-explained.html Allow: /transparency.html Allow: /publishing.html Allow: /letter.html Allow: /mailbox.html Allow: /images/ Allow: /css/ # Private pages (both formats) Disallow: /access Disallow: /access.html Disallow: /submission Disallow: /submission-details.html Disallow: /netlify/ Disallow: /api/ Disallow: /api/ Disallow: /database/ Disallow: /partials/ User-agent: DuckDuckBot Allow: / Disallow: /access Disallow: /access.html Disallow: /submission Disallow: /submission-details.html Disallow: /netlify/ Disallow: /api/ Disallow: /api/ Disallow: /database/ Disallow: /partials/ User-agent: Slurp Allow: / Disallow: /access Disallow: /access.html Disallow: /submission Disallow: /submission-details.html Disallow: /netlify/ Disallow: /api/ Disallow: /api/ Disallow: /database/ Disallow: /partials/ # =================== # AI TRAINING BOTS # =================== # We do not consent to AI training on user-submitted content User-agent: GPTBot Disallow: / User-agent: ChatGPT-User Disallow: / User-agent: CCBot Disallow: / User-agent: anthropic-ai Disallow: / User-agent: Claude-Web Disallow: / User-agent: Google-Extended Disallow: / User-agent: Bytespider Disallow: / User-agent: Applebot-Extended Disallow: / User-agent: PerplexityBot Disallow: / User-agent: Omgilibot Disallow: / User-agent: Omgili Disallow: / # =================== # SCRAPERS & HARVESTERS # =================== # Block known scrapers, data harvesters, and email collectors User-agent: AhrefsBot Disallow: / User-agent: SemrushBot Disallow: / User-agent: MJ12bot Disallow: / User-agent: DotBot Disallow: / User-agent: PetalBot Disallow: / User-agent: DataForSeoBot Disallow: / User-agent: BLEXBot Disallow: / User-agent: SEOkicks Disallow: / User-agent: SEOkicks-Robot Disallow: / User-agent: Rogerbot Disallow: / User-agent: Exabot Disallow: / User-agent: Gigabot Disallow: / User-agent: Nutch Disallow: / User-agent: Scrapy Disallow: / User-agent: curl Disallow: / User-agent: wget Disallow: / User-agent: HTTrack Disallow: / User-agent: EmailCollector Disallow: / User-agent: EmailSiphon Disallow: / User-agent: WebBandit Disallow: / User-agent: EmailWolf Disallow: / User-agent: ExtractorPro Disallow: / User-agent: harvest Disallow: / User-agent: Harvest Disallow: / User-agent: sitecheck.internetseer.com Disallow: / User-agent: webalta Disallow: / User-agent: WebZIP Disallow: / User-agent: Offline Explorer Disallow: / User-agent: WebCopier Disallow: / User-agent: Teleport Disallow: / User-agent: TeleportPro Disallow: / User-agent: WebStripper Disallow: / User-agent: Sucker Disallow: / User-agent: Zeus Disallow: / # =================== # ARCHIVE BOTS # =================== # Allow Internet Archive (Wayback Machine) for historical preservation User-agent: ia_archiver Allow: / Disallow: /access Disallow: /access.html Disallow: /submission Disallow: /submission-details.html Disallow: /netlify/ Disallow: /api/ Disallow: /api/ Disallow: /database/ Disallow: /partials/ # =================== # DEFAULT RULE # =================== # Block all other bots by default - if you're legitimate, contact us User-agent: * Disallow: /access Disallow: /access.html Disallow: /submission Disallow: /submission-details.html Disallow: /netlify/ Disallow: /api/ Disallow: /api/ Disallow: /database/ Disallow: /partials/ Disallow: /report-modal.html Disallow: /report-modal-include.html Allow: / # =================== # CRAWL SETTINGS # =================== Crawl-delay: 1 # =================== # SITEMAP # =================== Sitemap: https://dearnobody.org/sitemap.xml # ============================================================================ # LEGAL NOTICE # ============================================================================ # Automated scraping, data harvesting, email collection, and unauthorized # access to this website is strictly prohibited and may violate our Terms # of Service (https://dearnobody.org/terms) and applicable laws including # the Computer Fraud and Abuse Act (18 U.S.C. ยง 1030). # # Violations may result in legal action. For legitimate access requests, # contact: support@dearnobody.org # ============================================================================