411bz Authority Intelligence Crawler

Transparent. Respectful. Policy-driven.

What We Crawl

The 411bz Authority Intelligence Crawler analyzes publicly accessible web pages to measure AI cite-ability, authority structure, and competitive positioning within industry verticals.

We crawl only publicly accessible content — pages that any web browser can access without authentication, payment, or special access.

How We Identify

Our crawler identifies itself with the following User-Agent string:

411bz-AuthorityCrawler/1.0 (+https://411bz.ai/crawler-policy)

You will always see this identifier in your server logs.

What We Respect

robots.txt — We honor all directives including Disallow rules
Crawl-Delay — We respect specified delays between requests
Rate Limiting — Maximum 1 request per second per domain
Max Pages — We never crawl more than 50 pages per domain per session
No Authentication Bypass — We never attempt to access protected content
No Paywall Circumvention — We do not bypass any access restrictions

What We Extract

From publicly accessible HTML, we extract structural and semantic signals:

Heading structure (H1-H6 hierarchy)
JSON-LD structured data types
FAQ and Q&A pattern presence
Definition clarity patterns
List and step instruction density
Entity reinforcement consistency
Topic cluster strength
External citation density
AI discovery signals (llms.txt, robots.txt AI directives)

We do not store raw HTML content. We extract numerical scores and structural metadata only.

How to Opt Out

To exclude your site from our crawler:

Add to your robots.txt:User-agent: 411bz-AuthorityCrawler Disallow: /
Or email us at crawler@411bz.com

We process opt-out requests within 24 hours and permanently exclude opted-out domains.

Data Use

Extracted signals feed into the 411bz Authority Economics Engine to compute:

AI Cite-Ability Index (ACI) — LLM extraction readiness
Authority AI Index (AAI) — Composite authority positioning
Vertical competitive benchmarks
Reinforcement recommendations

We never sell raw crawl data. We publish only aggregate indices and anonymized vertical benchmarks.

Contact

For questions about our crawling practices:

Email: crawler@411bz.com
Web: 411bz.ai

Last updated: February 2026. 411bz Authority Economics Platform.