The science behind synthetic data

Synthetic respondents have moved from speculation to standard practice. Peer-reviewed studies in Political Analysis, the Journal of Marketing, Psychology & Marketing, and replications by EY, Harvard, MIT Sloan, and Qualtrics show that calibrated synthetic data now matches—and in some cases exceeds—traditional human-only research.

95%

EY brand-survey replication correlation

90%

of human test-retest reliability (arXiv 2025)

77%

of human-analyst themes recovered (Journal of Marketing 2025)

THE HEADLINE FINDING

EY replicated their CEO brand survey with 1,000 synthetic personas

95%

correlation with the original survey

In a double-blind test, professional services firm EY took its annual Global Brand Survey—aimed at CEOs of US companies with $1B+ in revenue—and ran it twice: once through traditional fielding, once through 1,000 synthetic personas built by Aaru.

The synthetic survey returned 95% correlation with the real one. EY also recreated their annual Global Wealth Research Report in a single day, with 90%+ median correlation to the original six-month study.

— Toni Clayton-Hine, EY CMO. Reported in Solomon Partners (Sept 2025).

Read the case study →

PEER-REVIEWED RESEARCH

The academic case for synthetic respondents

Four foundational papers from leading journals establish that calibrated synthetic data reproduces human survey responses with rigor.

Journal of Marketing

Arora, Chakraborty & Nishimura · 2025 · Vol. 89(2)

AI–Human Hybrids for Marketing Research

The AI–human hybrid generates information-rich, coherent data that surpasses human-only data in depth and insightfulness, and matches human performance in theme generation. LLM hybrid recovered 77% of themes identified by human analysts.

DOI: 10.1177/00222429241276529 →

arXiv

Maier et al. · October 2025 · arXiv:2510.08338

LLMs Reproduce Human Purchase Intent via Semantic Similarity

Tested against 9,300 human responses across 57 personal-care surveys, the Semantic Similarity Rating method achieved 90% of human test-retest reliability. Distributional similarity to real data exceeded 0.85 (Kolmogorov–Smirnov).

Read on arXiv →

Political Analysis

Argyle et al. · 2023 · Cambridge University Press

Out of One, Many: Using Language Models to Simulate Human Samples

The foundational “silicon samples” paper. GPT-3 conditioned on sociodemographic backstories accurately emulates response distributions across human subgroups, successfully replicating real survey results across diverse populations.

DOI: 10.1017/pan.2023.2 →

Psychology & Marketing

Sarstedt, Adler, Rau & Schmitt · 2024 · Vol. 41(6)

Using LLMs to Generate Silicon Samples in Consumer & Marketing Research

Establishes formal academic guidelines for silicon sampling. Concludes synthetic samples hold particular promise in upstream parts of the research process: qualitative pretesting, pilot studies, and hypothesis generation.

DOI: 10.1002/mar.21982 →

INDEPENDENT INDUSTRY REPLICATION

Replications by category leaders

Qualtrics × Greenbook

0.07 SD

Cohen's D deviation between calibrated synthetic respondents and real humans across 11 identical survey questions. The takeaway: calibration is what closes the gap. McLean (Feb 2026) replication of Paxton & Yang.

Read the replication study →

PyMC Labs

90%

Alignment with human survey data and 85% distributional similarity across concept and pricing studies, with carefully calibrated synthetic consumers.

Read the analysis →

Dollar Shave Club & Gabb

~10×

Faster research timelines. Dollar Shave Club: month-long studies in weeks. Gabb: weeks of work in hours, with rank-order alignment between synthetic and human respondents.

Read the case studies →

METHODOLOGY IN PRACTICE

The rigour comes from the design, not just the model

Yatabase Fast Research uses a sequential exploratory mixed-methods design (Creswell & Plano Clark) — the textbook academic protocol for combining qualitative and quantitative phases. Every Fast Research run produces three methodologically distinct generations, all anchored to a peer-reviewed theoretical framework.

Synthetic personas

Generated against your context and scored a priori on the framework's constructs. What the archetype would care about.

→

Qualitative interviews

Open-ended conversations with each persona. Themes are extracted — both framework-aligned and emergent. What synthetic respondents say.

→

Confirmatory survey

Designed from the framework and the themes that surfaced in phase 2. What synthetic respondents measure as.

The convergence report triangulates all three. Where they agree, you get a confirmed finding. Where they diverge, you've surfaced a research-worthy tension — exactly what real mixed-methods research is designed to do.

A LIVE FINDING FROM THE PIPELINE

The intent–action gap in remote-worker migration

A Yatabase Fast Research run studied digital nomads considering coworking in Ubud, anchored to the Push–Pull–Mooring Model of migration. Five constructs were triangulated across three methods.

Confirmed

Push factors, Pull factors

Nuanced

Mooring factors, Migration intention

Tension flagged

Migration behaviour

The tension is the finding

The survey said behavioural commitment was high (mean 7.3/10). The interviews disagreed sharply (low evidence). The personas didn't predict it either. The pipeline surfaced a classic intent–action gap — a phenomenon migration researchers spend whole careers studying — through pure methodological disagreement, without being prompted to look for it.

That's what triangulation across genuinely independent generations produces. It's the structural rebuttal to "you just asked an AI."

Plus: 4 of 4 interview-surfaced themes validated quantitatively

Wellness Integration

100% strong agreement · 9.5/10

Nature-Connected Workspaces

100% strong agreement · 9.6/10

Professional Growth Opportunities

92% strong agreement · 8.7/10

Cultural Immersion

81% strong agreement · 8.2/10

Themes the interviews surfaced (outside the framework) were tested in the survey designed afterwards. Every one held up at scale — the sequential exploratory design earning its keep.

MAJOR-PUBLICATION COVERAGE

Where the conversation is happening

Harvard Business Review

“LLMs, used carefully, can function as synthetic focus groups—producing insights in a fraction of the time and cost.”

Brand, Israeli & Ngwe · July 2025

MIT Sloan Management Review

“LLM-generated synthetic respondents—digital twins—enable rapid concept testing and AI-moderated interviews for qualitative research at scale.”

2025

Nature

“Synthetic data could be better than real data.”

2023

Marketing Week

“Synthetic data is as good as real—next comes synthetic strategy.”

Mark Ritson

See the science in action

Generate personas, run a survey or interview, and see what synthetic respondents reveal — in minutes.

Start Free See Pricing