- Backlinks are not a direct ranking factor inside an LLM. There is no link graph and no PageRank in the model, so a pure, non-retrieval answer is almost unaffected by links.
- They influence AI visibility through the retrieval layer, where live grounding pulls from an index that still ranks pages partly on links, and barely touch the training layer.
- The effect is threshold-based. Authority Score correlated with AI mentions at Pearson 0.65, but incremental links below a minimum authority level return almost nothing.
- Brand mentions now rival raw links. Brand search volume was the strongest single predictor of LLM citations at 0.334, and nofollow links perform almost like follow links.
Table of Contents
In Semrush and Kevin Indig’s October 2025 analysis of 1,000 domains across five AI surfaces, a domain’s Authority Score correlated with how often it was mentioned in AI answers at a Pearson coefficient of 0.65. Raw backlink volume correlated far more weakly. That gap is the whole argument: link quality and aggregate authority move AI visibility, link quantity mostly does not.
The question matters now because retrieval mechanics changed twice in twelve months. Google rolled Gemini 3 in as the default model behind AI Overviews in late January 2026, and the share of AI Overview citations that also rank in the organic top 10 fell from 76% in Ahrefs’ July 2025 study to 38% in its early 2026 study of 863,000 keywords and roughly 4 million AI Overview URLs. The pool that backlinks feed is widening and loosening at once.
Most marketers misread this in two directions. One camp declares backlinks dead because the model “just reads text,” ignoring that every major AI search product retrieves from an index where links still rank pages. The other keeps buying links by the hundred, ignoring that data shows almost no return below an authority threshold. This article sets out what a ranking factor means when there is no ranking, what the correlation studies measured, and a named framework, the Authority Threshold, for deciding how much link investment is worth it and when it stops paying.
Get your website’s conversion score in minutes
- Instant CRO performance score
- Friction and intent issues detected automatically
- Free report with clear next steps
What counts as a ranking factor when there is no ranking
An LLM does not rank documents. It predicts tokens. Before asking whether backlinks rank you, separate the two distinct machines that decide whether an AI tool surfaces your brand, because backlinks behave completely differently in each.
The first is the parametric layer: the model’s frozen training weights, what answers when search is off. Nothing here is a ranking. The model reproduces patterns from its training corpus, so a brand that appears frequently and consistently across it is likelier to be generated as an answer. Backlinks barely touch this layer. What matters is corpus presence: how often your brand, products, and positioning appear as text the model trained on, linked or not. To a tokenizer, a hyperlink and a plain-text mention look almost identical.
The second is the retrieval layer: retrieval-augmented generation, where the model queries a search index, reads the returned pages, and grounds its answer in them. This runs in Google AI Overviews, Perplexity, ChatGPT with search, Copilot, and Gemini. Here a ranking exists: the candidate set comes from a conventional index, and in that index backlinks remain one of the strongest ordering signals. Google’s John Mueller said at Google Search Live in Zurich in December 2025 that there is “no such thing as GEO or AEO without doing SEO fundamentals,” and the reason is mechanical: the AI reads what the index ranks, and the index ranks partly on links.
Backlinks influence AI visibility almost entirely through the retrieval layer, and almost not at all through the parametric layer. That split explains nearly every contradictory study here: one measuring pure ChatGPT answers finds links matter less, one measuring AI Overviews or Perplexity finds they matter more. Different machines.
For the upstream context on how AI assistants are changing query behavior, see Pathmonk on whether ChatGPT is taking traffic from Google, what happens to SEO with SearchGPT, and how your company ranks on ChatGPT and Perplexity.
What the correlation data actually says
Opinion is cheap here. The useful inputs are three large correlation studies from late 2025, and they agree more than the headlines suggest.
The anchor is the Semrush and Kevin Indig study of 1,000 randomly selected domains, tracked across ChatGPT, ChatGPT with search, Gemini, Google AI Overviews, and Perplexity. The numbers that matter:
- Authority Score versus AI mentions: Pearson 0.65, a strong relationship. High-quality backlink profiles appear far more often in AI answers.
- Authority Score versus AI Share of Voice: Pearson 0.23 but Spearman 0.36. That gap is the tell. A Spearman well above the Pearson means the relationship is threshold-shaped, not smooth.
- Follow versus nofollow links: nearly identical correlations with AI mentions (Pearson 0.334 for follow, 0.340 for nofollow). AI surfaces do not honor the nofollow attribute the way classic PageRank logic assumes.
- Image links versus text links: image-based backlinks correlated more strongly with AI visibility (Pearson 0.415) than text links (0.334), especially for domains that already held authority.
A separate Semrush analysis of 800-plus websites across 11 industries, reported by Search Engine Land in October 2025, found organic keyword coverage (Spearman 0.41) tracked AI visibility slightly more closely than total backlinks (0.37). It also found citation concentration no link plan easily overcomes: Reddit alone drew roughly 66,000 AI mentions across the 11 sectors.
Aggregate domain authority predicts AI citations moderately well, raw link counts predict them poorly, and the effect only switches on past a threshold. That last clause is where most strategies break, so it gets its own section.
The overlap with conventional technical and on-page SEO is obvious, as is the link to why teams keep losing organic traffic. How to attribute and measure conversion from this traffic is a separate problem, covered later.
The Authority Threshold: links are a gate, not a dial
Here is the framework. The Authority Threshold is the level of aggregate domain authority below which backlink investment produces almost no AI visibility, and above which authority strongly predicts how often an AI surface cites you. Backlinks behave like a gate you clear, not a dial you turn up gradually.
The evidence is the Pearson-Spearman gap in the Semrush data. When Spearman (0.36) sits far above Pearson (0.23) for authority-to-visibility, nothing happens for a long stretch and then the relationship strengthens sharply past a level. Indig put it plainly: a backlink profile “needs to hit a minimum threshold,” and linear growth below it should not be expected to pay off. Semrush’s tiered breakdown showed the same shape: low-authority domains collected a handful of AI mentions, top-tier domains dozens.
This reframes the budget question. The standard instinct treats links as incremental, but for AI visibility that is wrong below the threshold and only partly right above it.
- Below the threshold, volume link building to chase citations is close to wasted. The money does more on content depth, entity consistency, and brand mentions.
- The fastest way across is concentration: a few links from genuinely high-authority, topically aligned domains, plus an expanding set of unique referring domains. Source diversity beats repetition.
- Above the threshold, authority compounds, and each additional high-authority link rises in value rather than flattening.
Two caveats. The threshold is necessary, not sufficient: in verticals dominated by Reddit, YouTube, or a category incumbent, you can clear it and still lose the citation. And reaching it through digital PR is slow, often six to twelve months for a new domain, which is why brand-mention building delivers faster early returns.
Backlinks still gate the retrieval pool, and that pool is loosening
The strongest case for backlinks is mechanical: the retrieval layer pulls candidates from an index where links decide rank, so backlinks control whether your page is eligible to be read at all. That gate is widening fast.
The overlap between organic rankings and AI citations is real but in steep decline. Ahrefs found 76% of AI Overview citations came from organic top-10 pages in July 2025, with a median cited position of 4. By early 2026, after a detection change and the Gemini 3 rollout, that fell to 38%, the rest split almost evenly between positions 11 to 100 (about 31%) and beyond 100 (about 31%). BrightEdge’s 16-month study reported 54.5% overlap as of September 2025, up from 32.3%, with YMYL verticals like healthcare and insurance at 68% to 75% and e-commerce at almost none. Originality.AI put the figure near 52% in November 2025.
The studies disagree because they parse citations differently and cover different verticals, which is the insight. The link between traditional ranking authority and AI citation is strong, contested, and weakening, so backlinks are becoming a necessary entry condition rather than a sufficient one. The driver is query fan-out: Google decomposes one question into many sub-queries, then assembles an answer from pages that perform across that cluster, not just the page ranking first. A page can be cited for a sub-query it never targeted, and a top-ranked page skipped for answering only the narrow query.
For YMYL content, links still matter intensely: trust thresholds are high and Google leans on already-ranking content. For e-commerce, overlap is near zero, so a jewelry or apparel brand should not expect link authority alone to buy AI Overview presence. This vertical split is one of the clearest operational signals in the data, and it sharpens on mobile AI search.
This is why ranking position no longer maps cleanly to revenue. Pathmonk has covered why SERP rank fails to generate sales, how to show up in an AI answer without ranking, and how to rank on SearchGPT and Perplexity. The same dynamic is reshaping whether keyword research still pays and how AI is affecting SEO overall.
What outranks raw links now: brand mentions and entity authority
If you can fund one signal, build unlinked brand mentions before links.
The Digital Bloom’s 2025 synthesis, drawing on more than 680 million citations, reported brand search volume as the single strongest predictor of LLM citations at 0.334, ahead of traditional backlinks. The mechanism is the parametric layer: models build their sense of which brands are credible from how often and how consistently brands appear across the training corpus, and most appearances are unlinked. A Reddit thread, podcast transcript, or YouTube description feeds that signal with no hyperlink at all, which is why nofollow links performed almost identically to follow links in the Semrush study. The link attribute is close to irrelevant; the textual association registers.
Two content-level levers showed large effects. Adding original statistics to a page raised its AI visibility by roughly 22%, direct quotations by roughly 37%. These are extraction signals: AI surfaces preferentially cite content that hands them a clean, attributable fact or quote. FAQ-structured content was cited at higher rates than the same information buried in prose.
The shift is from accumulating link equity to manufacturing citable evidence and consistent entity signals, because that is what both machines reward and links only feed one. Entity consistency means your brand name, category, founders, and core claims appear the same way across your site, structured data, third-party profiles, and the places scraped into training corpora. Inconsistency dilutes the parametric signal in a way no link repairs.
Image links deserve a note. Once a domain has authority, image-based backlinks correlated more strongly with AI visibility than text links, because original charts get embedded and credited across many sites, generating both links and interpretive context. Visual assets build link authority and entity signal at once.
For content teams, the playbook overlaps with product-led SEO for B2B, optimizing for zero-click searches, and using generative AI in B2B. Given YouTube’s heavy presence in non-ranking citations, YouTube SEO belongs in an AI-visibility strategy, and so does feeding the parametric layer with first-party intent data.
The link strategy that moves AI visibility, and where it breaks
Translate this into allocation: clear the Authority Threshold efficiently, then spend the rest on signals that compound faster than links.
A defensible priority order for a mid-authority B2B site:
- Concentrate link acquisition on high-authority, topically relevant domains. A handful of links from industry publications, recognized blogs, and trusted directories moves authority more than dozens of weak placements. Expand unique referring domains rather than collecting repeats.
- Build unlinked brand mentions deliberately. Pursue podcasts, expert commentary, data-led PR, and presence in the forums and communities that feed training corpora. A high-authority mention without a link is a win, not a near-miss.
- Engineer extraction signals into content. Add original statistics, quote named sources, and structure answers as discrete liftable blocks, leading each section with a direct answer in the first 40 to 60 words.
- Produce embeddable visual assets. Charts built on original data earn image links and entity context at once, and age slower than text.
- Keep nofollow placements fresh. Since AI surfaces weight nofollow mentions almost like follow links, refreshing data and screenshots on contributed articles keeps them re-citable.
Where it breaks and what it costs:
- Below-threshold domains see no return for months. If your Authority Score sits low, steps 2 and 3 outperform step 1 for one to two quarters; front-loading link spend wastes budget.
- High-concentration verticals cap your ceiling. In categories dominated by Reddit, YouTube, or a single incumbent, you can do everything right and still lose most citations to platform-preferred sources. Win on those platforms instead of out-linking them.
- Manufactured links and PBNs are riskier than before. The content that earns AI citations is scrutinized for trust signals, so manipulative link patterns can poison entity trust across both machines at once. Mueller’s warning about short-lived tricks applies directly.
- It does not fix conversion. Authority gets you cited and clicked; it does nothing about whether that visitor converts, a separate and usually larger lever.
Two notes: personalization that runs post-arrival never touches crawler-facing content, so you can run an A/B test without affecting SEO, and the move toward agentic AI in CRO puts the conversion layer at the center of where AI meets conversion optimization.
Is your website ready for LLMs, or only ready to be cited?
Everything above is about one question: can an LLM find and cite you. Being indexed in the retrieval layer and named in the answer are real wins, but they are visibility wins. A separate readiness question decides whether any of it becomes revenue. Once a buyer is inside ChatGPT, Claude, or Perplexity, or an AI agent is acting on their behalf, can your website actually let them convert? Citations get you into the answer; they do nothing about whether the buyer can act on it.
This is what Pathmonk’s Website MCP is built for. It exposes your brand and your conversion actions to AI tools through the Model Context Protocol, so a buyer researching inside ChatGPT, Claude, or Perplexity can book a demo, sign up, or buy directly inside the conversation instead of being handed a homepage URL to navigate alone. Your site stops being a page an LLM only reads about and becomes a set of actions an LLM agent can complete.
Being cited and being transactable are two different levels of LLM readiness, and the second is where the revenue is. The math is blunt: if your authority work earns the citation but the agent reaches a contact form it cannot fill or a checkout it cannot start, that citation converts at zero. Expose the same action through Website MCP and the high-intent moment closes inside the conversation, which matters most for the visitors who would not act on a homepage anyway and for the rising share of sessions driven by AI agents rather than humans. Ranking and indexing get you seen; Website MCP is what makes your website ready for LLMs to act on.
Let AI assistants book, buy, and sign up on your behalf
- Your website becomes transactable inside ChatGPT, Claude, and Perplexity
- Buyers convert without ever leaving the AI conversation



LAW GROUP
- Practice
- About
Employment Law Representation You Can Rely On
Protecting your rights. Securing favorable outcomes.
FAQs on backlinks for LLMs
Do LLMs use backlinks the way Google uses PageRank?
No. LLMs have no link graph in their training weights, so there is no PageRank-style calculation inside the model. Backlinks influence AI answers only through the retrieval layer, where live results come from an index that still ranks pages partly on links. In a pure, non-retrieval answer, links have almost no effect.
Does a nofollow link help with AI visibility?
In the Semrush and Kevin Indig study, nofollow links correlated with AI mentions almost identically to follow links (Pearson 0.340 versus 0.334), because AI surfaces weight the textual association, not the link attribute. Gemini and ChatGPT leaned more on nofollow sources like Reddit and Wikipedia; AI Overviews and Perplexity leaned toward follow links.
Are brand mentions more important than backlinks for getting cited by AI?
For the parametric layer, yes. Brand search volume was the strongest single predictor of LLM citations in The Digital Bloom’s 2025 analysis at a 0.334 correlation, ahead of raw backlinks. Unlinked mentions feed the model’s sense of which brands are credible. Backlinks still matter for the retrieval layer, so build both, with mentions usually paying off faster.
Why do different studies report wildly different overlap between rankings and AI citations?
They parse citations differently, cover different verticals, and ran at different points during a fast-moving change. Ahrefs reported 76% overlap in July 2025 and 38% in early 2026; BrightEdge reported 54.5%; Originality.AI about 52%. The trend across all of them is downward, driven by query fan-out and the Gemini 3 rollout, so rankings are a weakening proxy for AI citation.
Is the link between backlinks and AI visibility the same across industries?
No, and the spread is large. BrightEdge found YMYL verticals like healthcare and insurance at 68% to 75% overlap between organic rankings and AI citations, while e-commerce showed almost none. A finance or health brand should expect link authority to matter heavily; a retail brand should lean on brand mentions, structured product data, and platform presence.
How many backlinks do I need before AI visibility improves?
There is no fixed count, because the effect is threshold-based on aggregate authority, not volume. Incremental links below a certain authority level produce little to no AI visibility. The faster path across is a few links from high-authority, topically relevant domains plus an expanding set of unique referring domains, not many weak links.
Do image links really matter more than text links for AI?
For domains that already hold authority, yes. The Semrush study found image-based backlinks correlated more strongly with AI visibility (Pearson 0.415) than text links (0.334), strongest on Perplexity and ChatGPT with search. Original charts get embedded and credited widely, generating links and entity context at once. The effect is weak for low-authority domains.
Can I get cited by AI without ranking on page one?
Yes, and increasingly so. With only 38% of AI Overview citations coming from organic top-10 pages in early 2026, more than 60% now come from outside page one. Query fan-out lets Google cite pages that answer a sub-query well even if they do not rank for the literal query. Comprehensive topical coverage and extractable, structured answers can earn citations a single top-10 position cannot.
Does building authority for AI visibility improve conversion too?
No. Authority and citations affect whether you get found and clicked. They do nothing for whether the visitor converts, which is a separate and usually larger lever. Treating a citation win as a revenue win skips the conversion step, where intent-matched on-site experience does the work no backlink can.
Key takeaways
- Backlinks are not a direct LLM ranking factor. They influence AI visibility through the retrieval layer, where live grounding pulls from an index that still ranks on links, and barely touch the parametric layer.
- The effect is threshold-based. Below a minimum authority level, incremental link building produces little AI visibility; above it, authority strongly predicts citation frequency.
- Link quality and unique referring domains beat raw volume. Authority Score correlated with AI mentions at Pearson 0.65; raw backlink count far more weakly.
- Brand search volume was the strongest single predictor of LLM citations (0.334), ahead of backlinks. Unlinked mentions feed the parametric layer, and nofollow links perform almost like follow links.
- The ranking-to-citation overlap is dropping fast, from 76% to 38% in Ahrefs’ data, because of query fan-out, making rankings a weakening proxy for AI citation.
- Industry matters. YMYL verticals show 68% to 75% overlap; e-commerce shows almost none.
- Extraction signals move the needle: original statistics raised AI visibility about 22%, quotations about 37%, and FAQ structure outperformed buried prose.
- Authority gets you cited; it does not get you converted. The conversion lever on AI-referred traffic is usually larger than the visibility lever and needs real-time, intent-matched experience.