White paper

AI Content & Search

Jose Luis Paredes, PhD, Data Specialist
Gregory Druck, PhD, Chief AI Officer
Ethan Smith, CEO
Watch Video
Table of Contents
Setup
Let's ChatContact Us

Motivation

We are frequently asked about using AI to automate organic content creation. Companies are excited about the potential to dramatically reduce content creation costs and scale content more efficiently. There is also a perception that everyone is already using AI to create content, and Google is now flooded with AI-generated content.

On the other hand, there have been well-circulated success stories of sites using AI-generated content that soon after lost most of their traffic, and Google continues to emphasize expertise and personal experience as key indicators of content quality.

In this whitepaper, we first investigate how much AI content currently appears in Google search results (Note that this paper focuses on organic search results with AI-generated content rather than AI Overviews generated by Google). Then, we analyze the performance of AI-generated content, relative to human-written content.  We find that purely AI-generated content makes up 3% of organic search results today, and generally ranks lower than human-generated content.

Our results suggest exercising caution about using a purely AI-generated content strategy.

Experiment Setup

1. Keyword Selection

We select an initial list of 2200 keywords evenly distributed in the following categories:

  • Tech
  • Productivity
  • News
  • Food
  • Finance
  • Entertainment
  • Education
  • Crypto
  • Commerce
  • Local

More specifically, we select the top non-branded keywords for the top five domains in each category. We then de-duplicate the keywords, filter out nonsense or irrelevant keywords, and use that as our base list per category. For each category, we randomly select 220 keywords.

2. URL Selection and Filtering

We start with the top 20 organic results for each keyword. AI detectors work best on long-form, editorial content, so we limit our analysis to articles and listicles, excluding product, category, landing, video, and other pages. To reduce costs, we also remove PDFs and very long pages.

The AI detector could not download the content of some pages, for example, in cases where the website blocks access.

After filtering and processing with the AI detector, we have 20280 URLs.

3. AI Detection

We use Originality.ai as the AI detection tool. It returns a score between 0 and 1, indicating the likelihood that the page text comes from generative AI tools such as ChatGPT, GPT-4, Gemini Advanced, or Llama 3, among others, or from a human writer.  Numerous studies have shown that Originality.ai is one of the most accurate AI-content detectors available, having an accuracy of over 90% on multiple data sets1. Furthermore, several top digital content creators, news media, publishers, and writing agencies rely on Originality.ai as their primary AI-content detector tool.

More specifically, Originality.ai returns a global score for the entire page and per-paragraph scores. Let (ai_score(k),wc(k)) be the ai score and the word count for the kth paragraph of page content, respectively. Originality.ai computes the global ai_score as the average of the paragraph-based ai_score(k) for k = 1,2,..., P where P is the number of paragraphs in the page of interest.

Since our analysis heavily relies on the accuracy of the AI-content detector, we favor a conservative approach, i.e., we keep the URLs for which the AI-content detector outputs a high confic score for most paragraphs, excluding the URLs with uncertain/ambiguous results from the AI detector. First, we classify each paragraph into one of the following three groups:

  1. AI-generated paragraph if ai_score(k) ≥ 0.85
  2. Human-created paragraph if ai_score(k) < 0.15
  3. Uncertain paragraph otherwise

Selecting the threshold values (0.85 and 0.15) is a trade-off between increasing the detector's precision and avoiding filtering out many URLs. We then compute the percentage of content for each content type as follows.

Percentage of AI-generated content
where Ω denotes the subset of AI-generated paragraph indices,
and wc(k) is the number of words in the kth paragraph.

Similarly, for the percentage of human-created and uncertain content. Notice that the longer the paragraph, the more it contributes to the page-level prediction.

Finally, we remove pages containing a substantial proportion of ambiguous content. Doing so ensures that our final URL set contains only URLs where the AI-content detector yields high confidence across multiple paragraphs. Our analysis found that setting a maximum threshold value for the percentage of uncertain content to 30% leads to a more reliable database for further study.  This last filtering stage removes about 40% of the URLs processed by the AI detector, leaving 11994 URLs.

4. AI Content Taxonomy

We divide the URLs into categories based on the proportion of AI-generated. To be more specific, we categorize URLs into four types:

  • Human-created: < 10% AI content
  • AI-generated: ≥ 90% AI content
  • Mixed: low AI content: 10-50% AI content
  • Mixed: high AI content: 50-90% AI content

Note that AI-generated content that a human subsequently edits might lie in either Mixed category.

Results

How Much AI Content Appears in Search?

Contrary to the claim that AI content is flooding the web, only a small percentage of the results on the first two SERPs are AI-generated.

Pure AI-generated content makes up about 3% of pages, while pages with more than 50% AI content make up 12%. URLs with minimal to no AI content dominate the search results for the first 20 positions, making up 88% of the total URLs used in our study.

Some categories, like Food, contain minimal purely AI-generated content, with less than 1% of the URLs. Across most categories, around 3% of the URLs are AI-generated, although Commerce stands out with 8% AI-generated. Moreover, categories like Crypto, Commerce, Finance, and Local have roughly 20% of their URLs featuring AI-generated content exceeding 50%.

Evaluating Rank Performance for AI-generated content

How well does AI-generated content rank? Is there a correlation between the quantity of AI-generated content and rank?

It’s worth noting that ranking depends on many variables, and isolating the effect of a particular variable on the rank is challenging.

With that caveat, we compare the ranks of each content type. More specifically, for each keyword, we select the top-ranking page for each AI content category. For example, for a particular keyword, the top-ranking article page confidently classified as human-created may appear at position 3, while the top-ranking article page confidently classified as AI-generated may appear at position 5. Then, we compute the average and the median of those best positions over all keywords.

Our results suggest that pages with more AI-generated content rank lower.

The average and median best positions increase (i.e., pages rank lower) as AI content increases. The best position for human-created content is in the first five positions 50% of the time, while the best position for AI-created content is on the second SERP 50% of the time. The results suggest that human-written content outranks AI content.

Conclusion

Our findings suggest that AI-generated content comprises a small percentage of search results and ranks lower than human-created content, and the average rank decreases as the amount of AI-generated content increases. The AI landscape is evolving rapidly, but at the moment, our results caution against a purely AI-generated strategy. While AI can reduce costs, when comparing the ROI of different content creation strategies, it is important to note that human-written content performs better in search.

Things are changing quickly — AI is improving and Google algorithms are adapting — so we will continue to study the performance of AI content over time.

AEO Tools List

Brand

Entry Level Pricing

AEO features

Models Tracked

Other Features

Weight Management, Metabolic Health and Type 2 Diabetes
Brand mention tracking
Weight Management and Metabolic Health
Type 2 Diabetes and Metabolic Health
Starts at $5.99. Price is for a report.
AI Search Visibility, Brand Sentiment
ChatGPT, Perplexity, Gemini, Claude, Deepseek, Grok
-
-
AI Search Visibility, Brand Sentiment, Competitive Benchmarking
ChatGPT, Perplexity, Gemini
Query Tracking, Page performance, Source discovery
Starts at 49$/ one time payment
Brand Sentiment, AI Search Visibility
ChatGPT, Perplexity, Claude, Gemini, Deepseek
AI Ranking Guide
-
Brand Sentiment, Competitive Benchmarking
ChatGPT, Perplexity, Gemini
Source Analysis, Track Growth, Sentiment Analysis, Recommendations, Weekly Reports, Actionable Insights
-
AI Search Visibility, Brand Sentiment, Competitive Benchmarking
ChatGPT, Gemini, Claude, Perplexity, Grok
Content Gap Analysis
Starts at $367/mo
AI Search Visibility, Brand Sentiment
ChatGPT, Perplexity, Gemini, Claude
-
-
Brand Sentiment
ChatGPT, Perplexity, Gemini
AI Search Trends, AI Content Generation
-
Brand Sentiment
ChatGPT, Perplexity, Gemini, Bing Copilot
-
-
AI Search Visibility, Brand Sentiment, Competitive Benchmarking
ChatGPT, Perplexity, Gemini
AI Content Optimization, A/B Testing, AI Feedback, Protecion & Control features
-
Brand Sentiment, Competitive Benchmarking
ChatGPT, Perplexity, Gemini, Claude, Meta AI
AI Visibility Analytics, Competitive Benchmark, Citation Analysis, Optimization Module
Starts at $75/mo
-
ChatGPT, Perplexity, Gemini, Claude
Benchmarking, Link Tracking, Prompt Explorer
-
Brand Sentiment
ChatGPT, Perplexity, Gemini, Claude, Meta AI
Sentiment Analysis, Content Analysis, Content Roadmap
-
AI Search Visibility
ChatGPT, Perplexity, Claude, Grok, Gemini
Data-Driven Insights, Benchmarking Vs Competitors
-
AI Search Visibility, Brand Sentiment
ChatGPT, Perplexity, Gemini, Claude
Impressions Manager, Crawler Analytics,
-
AI Search Visibility, Brand Sentiment, Competitive Benchmarking
ChatGPT, Perplexity, Gemini, Claude, Meta AI, Deepseek
Competitive Benchmark, Performance Dashboard, Global Monitoring, Visibility Analysis, Sentiment Analysis, Optimization Hub
Free
AI Search Visibility, Competitive Benchmarking
ChatGPT, Perplexity, Gemini, Gemini
Competitor Benchmark, Citation Insights
Free
AI Search Visibility, Brand Sentiment, Competitive Benchmarking
ChatGPT, Perplexity, Gemini
Competitive Rank, Persona Visibility, Topic Visibility
Free
Brand Sentiment
ChatGPT, Gemini, Bing Copilot, Perplexity, Claude, Meta AI, AI Overviews, Deepseek
Generative Answer Insights, Website Citation Insights, Agent Analytics
Free
AI Search Visibility, Brand Sentiment
ChatGPT, Perplexity, Gemini
AI Search Grader
Free
Brand Sentiment
ChatGPT, Perplexity, Gemini, Claude
-
Starts at $80/mo
AI Search Visibility, Brand Sentiment, Competitive Benchmarking
ChatGPT, Claude, Perplexity, Deepseek, Gemini
Answer Validation, Ranking Of Competitors
-
Brand Sentiment
ChatGPT, Meta AI, Gemini
-
Starts at $100/mo
AI Search Visibility, Brand Sentiment
ChatGPT, Perplexity, Bing Copilot, AI Overviews, Claude
Free "AI Presence" Report
Starts at $49/mo
AI Search Visibility, Brand Sentiment, Competitive Benchmarking
ChatGPT, Mistral, Deepseek, Claude, Meta AI, Grok, Gemini
Prompt Vault, Sentiment Analysis, Competitor Detection
Starts at $29/mo
AI Search Visibility, Brand Sentiment
ChatGPT, Perplexity, Gemini
Country Monitoring, Sentiment Analysis, Link Analysis
Starts at $90/mo
AI Search Visibility, Brand Sentiment, Competitive Benchmarking
ChatGPT, Perplexity, Gemini, Deepseek, Claude, Grok
Benchmarking, Recent Mentions, Competitor Detection
-
Brand Sentiment
ChatGPT, Perplexity, Claude, Bing Copilot, Gemini
Comprehensive AI Readiness Audit, Content & Structural Refinement
-
AI Search Visibility
ChatGPT, Perplexity, Gemini, Claude
Market Sentiment Analysis, Custom Experimentation
All usage based.
AI Search Visibility
Perplexity, Bing Copilot, ChatGPT, Gemini
Conversation Explorer, Answer Engine Insights, Agent Analytics
Starts at $100/mo
AI Search Visibility, Brand Sentiment
ChatGPT, Claude, Deepseek, Gemini, Perplexity, Grok
AI Monitoring And Optimization
Starts at $10/mo + 0.01$/query
AI Search Visibility, Brand Sentiment, Competitive Benchmarking
ChatGPT, Claude, Perplexity, Gemini, Meta AI, Mistral, Grok
Competitor Category Analysis, Standard Deviation Calculator, Performance Analysis, Ranking Suggestions
-
AI Search Visibility, Brand Sentiment, Competitive Benchmarking
ChatGPT, Perplexity, Gemini, Meta AI, Claude, Bing Copilot, Deepseek
AI Interviewer, Citation Track, Prompt Analysis, Competitive Analysis, Demographic Intelligence, Conversion Intelligence
-
AI Search Visibility, Competitive Benchmarking
ChatGPT, Gemini, Mistral, Claude, Deepseek, Perplexity, Search GPT
Website GEO audit with gap Analysis, Visibility/Performance Monitoring, Competitor Benchmarking, Citation Analysis, Sentiment Analysis, Regions
Credit based price
Brand Sentiment, Competitive Benchmarking, AI Search Visibility
ChatGPT, Perplexity, Gemini, Mistral, Claude
Performance Tracking, Citation Analysis
-
AI Search Visibility, Brand Sentiment
ChatGPT, Gemini, Perplexity, Meta AI, Claude
Journey Mapping
Starts at $99/mo
AI Search Visibility
ChatGPT
Business Landscape, Audience & Content, AI-Generated Tips
-
Brand Sentiment
ChatGPT, Perplexity, Claude, Gemini, Mistral, Meta AI
Sentiment Analysis, Awareness Analysis
Starts at $49/mo
Brand Sentiment
ChatGPT, Perplexity, AI Overviews
Automated weekly updates
Starts at $19.95/mo
Brand Sentiment, Competitive Benchmarking
Sonar, Gemini, ChatGPT
Seo Report, Fact-Checking AI Knowledge, Model-Specific Insights, Competitor Benchmark
Free
AI Search Visibility, Brand Sentiment, Competitive Benchmarking
ChatGPT, Perplexity, Gemini, Meta AI, Bing Copilot, Claude
Sentiment Analysis, Competitive Positioning, Buying Journeys Optimization, AI Engine Analytics
-
AI Search Visibility, Brand Sentiment, Competitive Benchmarking
ChatGPT, Perplexity, Gemini
Competitor Analysis, Optimize content
-
AI Search Visibility, Brand Sentiment
-
-
Copied