# robots.txt — Tanmiah Food Company # Official investor relations and corporate website (Umbraco CMS) # https://llmstxt.org/ | https://developers.google.com/search/docs/crawling-indexing/robots/robots_txt # # Pre-launch: HTML pages on dev/staging emit noindex (SiteGeo:AllowSearchIndexing). # At go-live set PreferredCanonicalHost and use absolute Sitemap / LLMs-Txt URLs if required. # ----------------------------------------------------------------------------- # Default rules (all crawlers) # ----------------------------------------------------------------------------- User-agent: * Allow: / # CMS, backoffice, and non-public application paths Disallow: /umbraco/ Disallow: /App_Plugins/ Disallow: /App_Data/ Disallow: /install/ # Error and utility pages (not useful for indexing) Disallow: /InternalServerError.html # ----------------------------------------------------------------------------- # Major search engines # (Explicit groups for clarity; same public-site rules apply.) # ----------------------------------------------------------------------------- User-agent: Googlebot User-agent: Googlebot-Image User-agent: Googlebot-News User-agent: Googlebot-Video User-agent: Bingbot User-agent: Slurp User-agent: DuckDuckBot User-agent: Baiduspider User-agent: YandexBot User-agent: Applebot Allow: / Disallow: /umbraco/ Disallow: /App_Plugins/ Disallow: /App_Data/ Disallow: /install/ Disallow: /InternalServerError.html # ----------------------------------------------------------------------------- # AI assistants, LLM crawlers, and answer engines (GEO) # Public investor-facing content may be crawled and cited. # Admin paths remain blocked for all agents below. # ----------------------------------------------------------------------------- User-agent: GPTBot User-agent: ChatGPT-User User-agent: OAI-SearchBot User-agent: ClaudeBot User-agent: anthropic-ai User-agent: PerplexityBot User-agent: Google-Extended User-agent: Bytespider User-agent: DeepSeekBot User-agent: ChatGLM-Spider User-agent: PanguBot User-agent: PetalBot User-agent: Baidubot User-agent: CCBot User-agent: Amazonbot User-agent: meta-externalagent User-agent: FacebookBot User-agent: Applebot-Extended User-agent: cohere-ai User-agent: Diffbot User-agent: YouBot User-agent: ImagesiftBot User-agent: Omgilibot Allow: / Disallow: /umbraco/ Disallow: /App_Plugins/ Disallow: /App_Data/ Disallow: /install/ Disallow: /InternalServerError.html # ----------------------------------------------------------------------------- # Sitemap (Umbraco SEO sitemap) # ----------------------------------------------------------------------------- Sitemap: https://ir.tanmiah.com/sitemap.xml # ----------------------------------------------------------------------------- # LLM / GEO — curated site summary for AI assistants (https://llmstxt.org/) # Also linked from HTML: # ----------------------------------------------------------------------------- LLMs-Txt: https://ir.tanmiah.com/llms.txt