Computational Social Science

Uniting computer science, statistics, and social science to solve real-life problems through mass collaboration on path-breaking transparent research in partnership with industry, government, and civil society.


Virtual Lab

Building a state-of-the-art infrastructure for high-throughput experiments by large teams in open science format. 

Media and Democracy

Exploring how new media companies are changing the way we receive information and shaping the views we form about the world.

Using cutting-edge statistical techniques to analyze police-civilian interactions, measure racial bias in policing, evaluate policing policy reforms, and improve the performance of policing agencies.

Digital Organization

Using large-scale data on social interactions to predict and understand individual and group behavior, and to design new digital tools for more effective collaboration and collective action across organizational, spatial, and social boundaries.

Projects and Publications

Virtual Lab

Experimentally-based social science today is moving slower and finding less than it could be. Although recent advances in digital technologies and crowdsourcing services allow individual experiments to be deployed and run faster than in traditional physical labs, a majority of experiments still focus on one-off results that do not generalize easily to real-world contexts or even to other variations of the same experiment. To achieve replicable, generalizable, scalable and ultimately useful social science a fundamental rethinking of the model of virtual-laboratory style experiments is required. Not only is it possible to design and run experiments that are radically different in scale and scope than was possible in an era of physical labs; this ability allows us to ask fundamentally different types of questions than have been asked historically of lab studies. However, taking full advantage of this new and exciting potential will require four major changes to the infrastructure, methodology, and culture of experimental science: (1) significant investments in software design and participant recruitment; (2) innovations in experimental design and analysis of experimental data, (3) adoption of new models of collaboration, and (4) a new understanding of the relationship between theory and experiment. The Virtual Lab project pursues this ambitious path to facilitate a new class of scientific advances in our understanding of social phenomena.

Project Ratio

Since the 2016 US presidential election, the deliberate spread of misinformation online, and on social media in particular, has generated extraordinary concern, in large part because of its potential effects on public opinion, political polarization, and ultimately democratic decision making. However, proper understanding of misinformation and its effects requires a much broader view of the problem, encompassing biased and misleading–but not necessarily factually incorrect–information that is routinely produced or amplified by mainstream news organizations. Project Ratio measures the origins, nature, and prevalence of misinformation, broadly construed, as well as its impact on democracy.

A Dynamic Model of Speech for the Social Sciences

Speech and dialogue are the heart of politics: Nearly every political institution in the world involves verbal communication. Yet vast literatures on political communication focus almost exclusively on what words were spoken, entirely ignoring how they were delivered—auditory cues that convey emotion, signal positions, and establish reputation. We develop a model that opens this untapped information to principled statistical inquiry: the model of audio and speech structure (MASS). Our approach models political speech as a stochastic process shaped by fixed and time-varying covariates, including the history of the conversation itself. In an application to Supreme Court oral arguments, we demonstrate how vocal delivery signals crucial information— skepticism of legal arguments—that is indecipherable to text models. Results show that justices do not use questioning to strategically manipulate their peers, but rather engage in genuine fact-finding efforts. Our easy-to-use R package implements the model and many more tools for audio analysis.

Automated Analysis of Police Body-Worn Camera Footage

Why have body-worn cameras failed to live up to their hype as transformative tools for police accountability? In conversations with police commanders, we consistently hear that agencies lack the capacity to manually review hundreds or even thousands of hours of footage, nearly all of which is deleted before review due to the financial strain of video storage. We are working to develop automated techniques for analyzing massive-scale footage from police body-worn cameras (BWCs). With university seed funding, our team of computer-vision, speech analysis, and social science experts is annotating videos to train artificial intelligence algorithms, evaluating state-of-the-art analytic tools, and communicating closely with police departments to develop practical tools for police supervisors.

Do Online Video Recommendation Algorithms Increase Polarization?

Based largely on vivid anecdotes, a media narrative has taken hold ascribing worrisome power to hidden algorithms. Academic researchers have struggled to add evidence to this debate due to both data limitations and serious research design challenges. We undertake a novel experimental approach that seeks to resolve the impasse on this critical question by randomizing the recommendations themselves. Our team has built an interface that serves sequences of videos using the recommendation tree from YouTube’s API. We create our own variations on the existing algorithm by injecting differing proportions of cross-cutting video content (algorithmically matched according to several criteria) to randomly assigned groups of subjects. This approach enables an attractive combination of experimental control and external validity without the need for reverse-engineering YouTube’s blackbox recommendations. Subjects in our study will choose to watch a sequence of short videos and follow recommendations while the interface tracks watch time and other measures of engagement. After the viewing session, participants will be surveyed on issue opinions, affective polarization, and perceptions of media hostility to their preferred viewpoint. If online video recommendation algorithms drive affective polarization, our design will provide rigorous evidence of the extent and magnitude of the effect.

Knox, Dean, and Jonathan Mummolo. “Toward a General Causal Framework for the Study of Racial Bias in Policing.” Journal of Political Institutions and Political Economy 1, no. 3 (August 2020): 341–78.

Lazer, David MJ, Alex Pentland, Duncan J. Watts, et al. “Computational social science: Obstacles and opportunities.” Science 369, no. 6507 (August 2020): 1060-1062.

Heck, Patrick R., Christopher F. Chabris, Duncan J. Watts, and Michelle N. Meyer. “Objecting to experiments even while approving of the policies or treatments they compare.” Proceedings of the National Academy of Sciences 117, no. 32 (July 2020): 18948-18950.

Almaatouq, Abdullah, Joshua Becker, James P. Houghton, Nicolas Paton, Duncan J. Watts, and Mark E. Whiting. “Empirica: a virtual lab for high-throughput macro-level experiments.” arXiv preprint arXiv:2006.11398 (June 2020).

Salganik, Matthew J., Ian Lundberg, Alexander T. Kindel, Caitlin E. Ahearn, et al. “Measuring the predictability of life outcomes with a scientific mass collaboration.” Proceedings of the National Academy of Sciences 117, no. 15 (April 2020): 8398-8403.

Allen, Jennifer, Baird Howland, Markus Mobius, David Rothschild, and Duncan J. Watts. “Evaluating the fake news problem at the scale of the information ecosystem.” Science Advances 6, no. 14 (April 2020): eaay3539

Knox, Dean, Will Lowe, and Jonathan Mummolo. “Administrative Records Mask Racially Biased Policing.” American Political Science Review 114, no. 3 (January 2020): 619–37.

In the News

How Misinformation Hurts Democracy

David M. Rothschild, a Wharton graduate and economist with Microsoft Research, speaks with Wharton Business Daily on SiriusXM about the impact of misinformation on democracy.

Dean Knox on Wharton Business Daily

Dean Knox, Wharton Assistant Professor of Operations, Information and Decisions, joins Wharton Business Daily host Dan Loney to discuss how systemic racism manifests in policing.

How Data Science Can Win the Debate on Police Reform

Wharton’s Dean Knox and Princeton’s Jonathan Mummolo discuss their research on racial bias in policing.

The Team

Computational Social Science encompasses two collaborative research teams with shared interests and interrelated research agendas, lead by Professors Duncan Watts and Dean Knox.


Professor Duncan Watts - Analytics at Wharton Faculty Fellow

Duncan Watts
Stevens University Professor & twenty-third Penn Integrates Knowledge Professor

Professor Dean Knox - Analytics at Wharton Faculty Fellow

Dean Knox
Assistant Professor
Operations, Information and Decisions

Affiliated Scholars

Abdullah Almaatouq
Assistant Professor, Information Technology
Massachusetts Institute of Technology

Jonathan Mummolo
Assistant Professor, Politics and Public Affairs
Princeton University

David M. Rothschild
Microsoft Research


Valery Yakubovich
Executive Director

Rachel Mariman
Research Project Manager

Homa Hosseinmardi
Research Scientist

Mark Whiting
Postdoctoral Researcher

James Houghton
Postdoctoral Researcher