I am a post-doc at MIT CSAIL, working with Prof. Costantinos Daskalakis and Prof. Antonio Torralba.

Prior to that, I spent 4 wonderful years at UT Austin, working with Prof. Alexandros Dimakis.

Before starting my Ph.D., I received my undergraduate degree in ECE from the National Technical University of Athens (NTUA).

My research focuses on generative modeling, with a particular emphasis on learning from low-quality data sources—including corrupted, synthetic, and out-of-distribution samples. Broadly, I aim to squeeze as much signal as possible from whatever data is available.

News

Internships:

Talks

The most representative talk of my latest research is the one given at Columbia Engineering, as part of the Workshop on Emerging Trends in AI. Please watch the talk here.

For a full list of the talks that I have given over the years, see below.

MIT, ML Tea Seminar Talk
2025
Mila - Quebec AI Institute
2025
Ben-Gurion University
2025
Runway ML
2025
Applied Inverse Problems Conference (AIP)
2025
NTU Singapore
2025
Simons Institute Berkeley
Youtube Video
2025
Columbia University
Youtube Video
2025
Biomedical and Astronomical Signal Processing (BASP) Conference
2025
Harvard University
2024
Google DeepMind, London Office
2024
Grundfest Lecture series (UCLA + Caltech)
Youtube Video
2024
Learning on Graphs and Geometry (LoGG) Reading Group
Youtube Video
2024
Aalto University
2024
University of Wisconsin-Madison, MLOPT Idea Seminar
Youtube Video
2023
UT Austin, GenAI IFML Workshop
2023
EleutherAI Diffusion Reading Group
Youtube Video
2023
Uppsala University
2023
Archimedes Research Unit
2023
NeurIPS Workshop Oral Presentation
2022
Rice University, Imaging and Vision Seminar
2022

Publications

For a more comprehensive list of publications, please visit my Google Scholar page.

DataComp: In search of the next generation of multimodal datasets

Published as an Oral in NeurIPS 2023 [Paper] [Code]

Citation: Samir Yitzhak Gadre, Gabriel Ilharco, Alex Fang, Jonathan Hayase, Georgios Smyrnis, Thao Nguyen, Ryan Marten, Mitchell Wortsman, Dhruba Ghosh, Jieyu Zhang, Eyal Orgad, Rahim Entezari, Giannis Daras, Sarah Pratt, Vivek Ramanujan, Yonatan Bitton, Kalyani Marathe, Stephen Mussmann, Richard Vencu, Mehdi Cherti, Ranjay Krishna, Pang Wei Koh, Olga Saukh, Alexander Ratner, Shuran Song, Hannaneh Hajishirzi, Ali Farhadi, Romain Beaumont, Sewoong Oh, Alex Dimakis, Jenia Jitsev, Yair Carmon, Vaishaal Shankar, Ludwig Schmidt, "DataComp: In search of the next generation of multimodal datasets", NeurIPS 2023