When a government tests a housing or minimum wage policy on a simulated population,
the quality of those predictions depends on one thing: whether the simulated people
match the real ones. We argue that without empirical demographic grounding,
current LLM-based simulations of public opinion introduce systematic, measurable
distortions into who is represented and whose preferences are modeled — distortions
that are directional, not random, and that compound existing biases in LLM-internal
opinion representations.
CivicSim provides three empirical demonstrations of these distortions and proposes
a corrective framework. Every demographic modeling decision is treated as an
empirical question, not a design preference.