Does AI Help Humans Make Better Decisions? A Statistical Evaluation Framework for Experimental and Observational Studies.

4/24/25 | 4:15pm | E62-276

Kosuke Imai

Professor of Government and of Statistics
Harvard University

Abstract: The use of Artificial Intelligence (AI), or more generally data-driven algorithms, has become ubiquitous in today’s society. Yet, in many cases and especially when stakes are high, humans still make final decisions. The critical question, therefore, is whether AI helps humans make better decisions compared to a human-alone or AI-alone system. We introduce a new methodological framework to empirically answer this question with a minimal set of assumptions. We measure a decision maker’s ability to make correct decisions using standard classification metrics based on the baseline potential outcome. We consider a single-blinded and unconfounded treatment assignment, where the provision of AI-generated recommendations is assumed to be randomized across cases with humans making final decisions. Under this study design, we show how to compare the performance of three alternative decision-making systems — human-alone, human-with-AI, and AI-alone. Importantly, the AI-alone system includes any individualized treatment assignment, including those that are not used in the original study. We also show when AI recommendations should be provided to a human-decision maker, and when one should follow such recommendations. We apply the proposed methodology to our own randomized controlled trial evaluating a pretrial risk assessment instrument. We find that the risk assessment recommendations do not improve the classification accuracy of a judge’s decision to impose cash bail. Furthermore, we find that replacing a human judge with algorithms — the risk assessment score and a large language model in particular — leads to a worse classification performance.

Bio: Kosuke Imai is a professor in the Department of Government and the Department of Statistics at Harvard University. He is also an affiliate of the Institute for Quantitative Social Science. Before moving to Harvard in 2018, Imai taught at Princeton University for 15 years where he was the founding director of the Program in Statistics and Machine Learning. Imai specializes in the development of statistical methods and machine learning algorithms and their applications to social science research. His areas of expertise include causal inference, computational social science, and survey methodology. Imai leads the Algorithm-Assisted Redistricting Methodology (ALARM) Project, serving as an expert witness for several high-profile legislative redistricting cases. Outside of Harvard, Imai served as the President of the Society for Political Methodology from 2017 to 2019. His research has been supported by the Guggenheim Fellowship and grants from the National Science Foundation, Sloan Foundation, and other agencies and organizations.