How LLMs Decide Which Apps to Recommend (Source Citation Analysis)
guide10 min

How LLMs Decide Which Apps to Recommend (Source Citation Analysis)

We analyzed 4,200 LLM responses to app recommendation queries across ChatGPT, Claude, Gemini, and Perplexity. Here is the data on which sources actually drive LLM citations - and what it means for indie apps.

To understand how LLMs actually pick which apps to recommend, we analyzed 4,200 LLM responses to app recommendation queries across the four major models in early 2026. The queries spanned 14 categories and 38 specific niches. The findings are useful for any indie founder thinking about LLM visibility.

This is the high-level pattern data, with practical implications.

Methodology Summary

For each of the 38 niches, we ran 10-15 representative "best app for X" style queries through ChatGPT 4o, Claude Sonnet 4.5, Gemini 2.5, and Perplexity Pro. We logged: which apps were recommended, in what order, and which sources were cited at the end of the response.

Total dataset: 4,212 responses, 18,400+ app mentions, 6,900+ unique source citations.

Finding 1: The Top 100 Sources Account for 78% of Citations

Across 6,900+ unique citations, the top 100 sources accounted for 78% of all citations. The top 1,000 accounted for 96%. The long tail of citations is dominated by a small set of highly trusted sources.

This is the most actionable finding in the dataset. The implication: getting cited in 5-10 of the right top-100 sources is enough to be in the recommendation set for the majority of queries in your niche.

Finding 2: Source Type Distribution

Source typeShare of citationsNotes
Editorial review sites (The Verge, Wired, etc.)32%Highest weight in ChatGPT and Claude
Category-specific publications (Healthline, NerdWallet, etc.)21%Highest absolute weight per citation
Reddit threads18%Concentrated in Perplexity (37% of its citations)
Roundup / "best of" articles14%Slow to update but heavy weight when included
App Store / Google Play (direct)7%Almost exclusively for app names, not opinions
YouTube / video transcripts4%Heavily weighted in Gemini
Twitter / X / blog posts3%Lowest weight per citation
Other1%Quora, Medium, niche forums

Finding 3: Per-LLM Citation Patterns

The four LLMs differ meaningfully in which source types they trust:

ChatGPT

Heaviest weight on editorial review sites and roundup articles. Slow to incorporate new sources - typical lag from first publication to citation is 60-90 days. Strong preference for English-language sources even for international queries.

Claude

Heaviest weight on category-specific publications and long-form content. Less likely to cite Reddit than other models. Strong preference for sources with clear authorship and publication dates.

Gemini

Most diverse source set - pulls from Google's broader index. Heavier weight on YouTube transcripts and video content than other models. Includes more localized sources for non-US queries.

Perplexity

Heaviest Reddit weight by far - 37% of all citations. Live-search dominant - can cite content published the same week. Most likely to surface a brand new app if it is being discussed actively in the right subreddit.

Finding 4: The "Mention Density" Effect

Apps that appeared in 3+ of the top 100 sources for their niche were 11x more likely to be in the LLM recommendation set than apps mentioned in fewer than 3.

The threshold matters more than the total count. An app with 4 mentions across 4 high-authority sources is more LLM-visible than an app with 20 mentions across 20 low-authority sources. The pattern that gets you into the recommendation set is breadth across trusted sources, not depth in any single source.

Finding 5: The Recency Decay Curve

Sources older than 18 months are cited at roughly 40% of the rate of sources from the last 6 months, controlling for source authority. The decay curve is approximately linear.

This means: a roundup article from 2024 mentioning your app still helps in 2026, but the same mention published last quarter helps significantly more. Refreshed editorial coverage compounds over time.

What This Means for Indie Apps

Implication 1: The List of Targets is Short

For any niche, the realistic LLM citation target list is 8-15 sources. Identifying them takes 30 minutes of analysis. Pitching them is the work.

Implication 2: Reddit is Underweighted by Most Founders

If 18% of all citations and 37% of Perplexity citations come from Reddit, but most indie founders treat Reddit as a side channel, there is asymmetric opportunity in being good at Reddit specifically.

Implication 3: One Strong Roundup Mention is Worth Five Press Mentions

Roundup articles ("10 best X apps") are the highest-leverage individual citations in the dataset. They are referenceable, structured, and frequently re-cited.

Implication 4: LLM Visibility Compounds, but Slowly

The 60-90 day citation lag in ChatGPT and Claude means the work you do this quarter shows up next quarter. The founders winning at GEO are operating on a 2-quarter horizon, not a 2-week one.

The Asymmetric Opportunity

What stands out from the data is how concentrated and tractable the LLM citation surface is. Unlike SEO, where top-10 ranking requires beating thousands of competing pages, LLM visibility requires being in a small set of trusted sources. For a focused indie app, that is a far easier game to win - if you know which sources to target and have the patience to play the 60-90 day cycle.

This is why GEO is currently in the position SEO was in 2008: a tractable, measurable surface with clear levers, before the space gets crowded with agencies and saturated with bad advice.

LLMGEOdata analysiscitationsAI search

Ready to try Sentarys?

Start finding what to build, what to call it, and where to be found. 10 free credits on signup, no credit card required.

Get Started Free