Off-Site Authority: Digital PR and Earned Media

82-89% of AI citations come from earned media — third-party sources — not brand-owned content. That single statistic reframes the entire GEO problem. No matter how well-structured your own site is, if you have not built a presence in the sources that AI systems trust, you will be cited rarely. This section covers the three highest-leverage off-site authority mechanisms: earned media placement, original research as citation magnets, and Reddit community strategy.

How AI Systems Evaluate Off-Site Trust

AI systems cannot follow a link graph in real time the way Google's PageRank crawler does, but they are trained on data that reflects the link graph, entity relationships, and co-occurrence patterns across billions of web pages. The trust model they apply to sources at citation time is built on four evaluation dimensions:

Entity coherence: Does your brand/product appear consistently across multiple authoritative sources with the same name, description, and category? An entity that is described inconsistently — "Acme API" on your site, "AcmeAPI" on TechCrunch, "Acme API Platform" on Forbes — is a weaker entity signal than a consistent name and description across all mentions.

Topical depth: Is your entity consistently associated with a specific topic area across multiple sources? A developer tools company that appears in developer tools articles on GitHub, Stack Overflow, Dev.to, and technical publications has stronger topical authority than one that appears in general business press.

Off-site co-occurrence: Do authoritative sources in your topic area mention your entity in the same context as other trusted entities? Co-occurrence with established brands is a trust proxy.

Content for AI extraction: Third-party pages that mention you in structured, citable ways — with specific claims, statistics, and comparisons — are more valuable than passing mentions.

Earned Media: Tier-1 Placement

A single placement in a Tier-1 publication — Forbes, TechCrunch, VentureBeat, The Verge, Wall Street Journal, Wired — generates persistent AI citations for months to years after publication. The mechanism is dual: these publications are in the training data of virtually all major AI models, and they are in the inference index of every major AI search platform. An article that mentions your company and describes what it does becomes a durable citation source that AI systems return to repeatedly.

BrightEdge research found that 40% of AI Overview citations come from pages not in Google's top 10 — an indication that structural authority at the content level can compensate for lower domain rankings. But Tier-1 earned media provides both structural authority and domain authority, making it the highest-leverage combination.

The PR strategy that AI-optimizes earned media:

  1. Make the claim citable: Every press release, pitch, and contributed article should include specific statistics, product metrics, and dated claims. "Acme API processed 12 billion requests in Q1 2026" is citable; "Acme API is growing fast" is not.

  2. Include entity-defining statements: Ensure every article includes a complete factual statement that defines your entity — name, category, founding, and scale. This is the sentence AI systems extract for entity recognition.

  3. Use consistent brand naming: Match your site's brand name exactly across every external placement. Variations fragment your entity signal.

  4. Target publications in your vertical: Domain-specific authority is weighted heavily. For a developer tools company, coverage in The New Stack, InfoQ, or Hacker News is more topically relevant than equivalent coverage in a general business publication.

  5. Brief is fine: A 200-word mention in TechCrunch's startup roundup that includes a specific data point is more AI-valuable than a 2000-word puff piece that says nothing specific.

Original Research as Citation Magnets

Original research is the highest ROI citation-building investment, and the mechanism is straightforward: AI systems cite specific data points, and if you are the original source of a data point, every article that cites your data includes a link to you, and AI systems cite you as the primary source.

Statistics and data points increase AI citations by 40% (from the Princeton GEO study). When you create the statistic, you receive that 40% lift plus the compounding effect of every secondary citation across the web.

Research formats with the highest citation yield:

Annual surveys: Developer experience surveys, tool adoption reports, benchmark studies. The annual cadence means you publish an updated version each year that re-establishes citation freshness. Stack Overflow's Developer Survey, JetBrains' State of Developer Ecosystem, and Cloudflare's Year in Review are extreme examples of this pattern — they generate AI citations across thousands of responses year after year.

Benchmark reports: Performance comparisons, cost analyses, accuracy benchmarks. These are especially high-value in developer tools because quantitative comparisons are exactly what engineers ask AI systems about ("which is faster, Bun or Node.js for this use case?").

Case studies with specific metrics: "How we reduced API latency by 40% using Acme's routing layer" is more citable than "How Acme helped us scale." Include before/after numbers, timeline, and infrastructure context.

For each piece of original research:

  • Publish the primary data at a stable URL that you commit to maintaining
  • Submit the data to relevant data aggregators and industry newsletters
  • Pitch the findings to Tier-1 publications before or concurrent with publication
  • Include methodology notes so secondary citers can describe the data accurately
  • Update the research annually and maintain the same canonical URL

Reddit: The AI Citation Network Hidden in Plain Sight

Reddit's position in the AI citation landscape is disproportionate to how most marketing teams treat it. The data:

  • Reddit citations in AI search grew 450% between March and June 2025.
  • Reddit appears in 68% of AI-generated responses across major platforms.
  • Reddit is currently the #2 most visible website in Google search, behind only Wikipedia.
  • 85% of brand mentions in AI answers come from third-party sources including Reddit discussions.
  • Reddit's $60M data licensing deal with Google for AI training data ensures Reddit content is deeply embedded in multiple AI systems' parametric knowledge.

The underlying reason is structural: Reddit threads are exactly the format that AI systems find credible for conversational queries. A thread asking "which API gateway should I use for rate limiting?" with a 400-word response from a developer with 8 years of experience, citing specific benchmarks and explaining tradeoffs, is a higher-trust source for an AI answering that question than a company's own marketing page.

The 90/10 Reddit Strategy

Authentic participation drives AI citations; transparent self-promotion triggers downvotes and reduces the signal value of any mentions. The operational framework:

90%: Authentic technical participation

  • Answer questions in your technical domain with detailed, specific responses
  • Contribute original observations, not restated documentation
  • Long-form comments (300+ words) are cited 3x more often than short replies — provide depth
  • Include specific data, version numbers, and edge cases that demonstrate genuine expertise
  • Participate in threads where your product is not mentioned, to build creditor status in the community independent of promotional intent

10%: Strategic brand mentions

  • When your product genuinely solves the problem being discussed, mention it with full disclosure of your affiliation
  • Include specific technical comparisons: "we built X because Y and Z alternatives had P problem" — entity-rich posts with specific product names are cited at 3x the rate of vague comparisons
  • Never respond to a thread with only a brand mention; it must be embedded in a substantive technical response

Documented outcomes from this approach: A Writesonic experiment documented that 6 weeks of authentic technical participation in relevant subreddits, following the 90/10 rule, resulted in 41% of relevant AI-generated answers mentioning the brand — a 240% increase from baseline. The citations came from AI systems extracting the Reddit discussions where the brand was mentioned in a technically credible context.

Subreddit Targeting

For developer-facing products, the highest-value subreddits for AI citation are:

  • r/programming, r/webdev, r/devops: High volume, high visibility, heavily indexed
  • Topic-specific subreddits (r/golang, r/reactjs, r/kubernetes): Topical authority in vertical
  • r/MachineLearning, r/LocalLLaMA: High AI citation rates for ML/AI tools
  • r/sideprojects, r/entrepreneur: Less competitive for citation; useful for early brand mentions

Write contributions from a personal developer account with genuine history in the community, not a brand account. Personal accounts with real contribution history are more trusted by both the community and the AI systems that index the discussions.

The Compounding Dynamic

Off-site authority compounds in ways that on-site optimization cannot. Each Tier-1 placement generates AI citations independently of your site's technical optimization. Each piece of original research generates secondary citations that attribute the data to you. Each Reddit contribution adds to a body of third-party discussion that AI systems extract from when your brand comes up.

The compounding happens because AI systems update their inference indexes more frequently than they retrain their base models. New earned media placements and Reddit contributions enter the retrieval index within days to weeks. This means that an active earned media program creates a continuously refreshing off-site citation base, not a static one.

The floor investment for a meaningful off-site authority program: one original research piece per quarter, one Tier-1 press placement per quarter, and consistent technical participation in 2-3 relevant subreddits from a genuine developer contributor. This is not a dedicated team; it is structured into the existing work of technical founders, developer relations, and engineering blog contributors. The ROI at 6 months is measurable through AI citation tracking (Chapter 6); at 12 months, the compounding effect becomes the primary driver of AI referral traffic growth.