Peter Cohen

PhD Student

Operations Research Center, Massachusetts Institute of Technology

About

I am a PhD candidate at MIT’s Operations Research Center.

I work with Colin Fogarty on problems in causal inference. My earlier work centered around observational studies and the associated sensitivity analyses. More recently I have been working on inference procedures for nonparametric Behrens–Fisher-style problems which leverage bootstrapping.

Before coming to MIT, I graduated from Bowdoin College in 2017 with a B.A. in Mathematics. Prior to starting statistics and OR, I did some research in chemistry, number theory, and random matrix theory.

Interests

Nonparametric Statistics
Resampling and Permutation Procedures
Causal Inference
Machine Learning and Statistics

Education

PhD Candidate in Operations Research, 2022

Massachusetts Institute of Technology
B.A. in Mathematics, 2017

Bowdoin College

Publications

"No-harm Calibration for Generalized Oaxaca-Blinder Estimators " P. Cohen and C.B. Fogarty (Submitted).
"Gaussian Prepivoting for Finite Population Causal Inference " P. Cohen and C.B. Fogarty (JRSS (Series B)).
"Multivariate One-Sided Testing in Matched Observational Studies as an Adversarial Game" P. Cohen, M.A. Olson, and C.B. Fogarty (Biometrika).
"On Within-Perfectness and Near-Perfectness" P. Cohen, K. Cordwell, A. Epstein, C.H. Kwan, A. Lott, and S.J. Miller.
"Random Matrix Ensembles with Split Limiting Behavior" P. Burkhardt, P. Cohen, J. Dewitt, M. Hlavacek, S. J. Miller, C. Sprunger, Y. N. Truong Vu, R. Van Peski, and K. Yang.
"An Analytic Heuristic for Multiplicity Computation for Zaremba's Conjecture" P. Cohen
"Sticking to (First) Principles: Quantum Molecular Dynamics and Bayesian Probabilistic Methods to Simulate Aquatic Pollutant Absorption Spectra" K. Trerayapiwat, N. Ricke, P. Cohen, A. Poblete, H. Rudel, and S. N. Eustis
"Effect of Exercise on Heart-rate Response to Mental Stress in Teenagers" with A. Costin, N. Costin, P. Cohen, C. Eisenach, and F. Marchlinski

Teaching

15.075 Statistical Thinking and Data Analysis: Spring 2018 TA
Teaching assistant for an undergraduate course which aims to provide students with a theoretical understanding of fundamental techniques in statistics and data science, including linear regression and hypothesis testing, as well as a toolkit for practical implementation of statistical techniques.

Project Details

High Power Multivariate Testing with Directional Control
Hypothesizing elaborate cause-effect relationships is a dangerous game. On one hand, if data supports an elaborate relationship, then the underlying model is well supported. However, elaborate relationships often invovle testing several different outcomes. For instance, to claim that an economic intervention is effective, examining its impact through several metrics helps increase credibility. When testing multiple outcomes, corretions for multiple comparisons are necessary to avoid making errors at a high rate; these corrections often dramatically reduce the power of statistical tests. Working with Colin Fogarty and Matt Olson, we have approached the problem of testing several one-sided hypotheses simultaneously with high power in the context of observational studies. Our results are available in Biometrika, and a pre-print is also available on arXiv. Code to implement the methods in the paper is available here.
Gaussian Prepivoting
In finite population causal inference there are two central null hypotheses: Fisher's sharp null and Neyman's weak null. Fisher's sharp null stipulates that the treatment has no impact on any of the study participants whereas Neyman's weak null states that the treatment effect is zero on average for those involved in the study. The rich field of randomization testing applies well to tests of Fisher's sharp null, but can - at times - provide anti-conservative inference under Neyman's weak null. On a case-by-case basis some common test statistics have been modified so that randomization tests can be used to test both nulls with valid Type I error rate control in the large-sample limit. Furthermore, these modifications retain the extactness of randomization testing when examining only Fisher's sharp null. Working with Colin Fogarty, we have been able to construct a general procedure which modifies a given test statistic by composing it with a suitable cumulative distribution function in order to build a new statistic which is amenable to randomization inference under both Fisher's and Neyman's nulls. We show that this procedure is broadly applicable by providing a general characterization of the class of statistics for which it may be used. Important examples include the difference in means for rerandomized designs and regression adjusted estimators in CREs. The paper is available from the Journal of the Royal Statistical Society (Series B), and a pre-print of the paper can be found on arXiv. Some slides from a talk I gave on this are here.

Upcoming Talks

Nothing planned.

Contact

plcohen@mit.edu
1 Amherst St, Cambridge, MA 02142