The Future of the AI-Driven Research Ecosystem

Associate Professor @ Stanford

Talk (tentative)From CRISPR-GPT to LabOS: closing the loop between AI scientists and the physical lab.

Co-developed CRISPR-Cas9 genome editing (Science 2013); now builds AI co-scientists for the wet lab — CRISPR-GPT and the LabOS AI-XR co-scientist — where a research artifact must be executable and verifiable enough to drive real physical experiments safely.

Joydeep Biswas

Associate Professor @ UT Austin

Talk (tentative)AI-assisted peer review at AAAI and NeurIPS: review at agent throughput, humans in the loop.

Leads UT Austin's Autonomous Mobile Robotics Lab and ran the AI-assisted peer-review experiments at AAAI and NeurIPS — a live test of review and reproduction at AI-scientist throughput, with human judgment kept in the loop.

Nihar Shah

Associate Professor @ CMU

Talk (tentative)The science of evaluation: experiments on peer review, and the hidden cost of automating it.

Conducts foundational research on the algorithms and integrity of scientific peer review. His methods have been deployed across 200+ venues — including OpenReview — in the evaluation of over 100,000 papers.

Lianhui Qin

Assistant Professor @ UC San Diego

Talk (tentative)Multi-agent collaboration and self-evolving agents for scientific reasoning.

Builds AI agents that reason, learn, and collaborate in complex environments — multi-agent collaboration, self-evolving agents that learn during deployment, and reasoning systems for scientific discovery. PhD from University of Washington with Yejin Choi.

Audrey Cheng

UC Berkeley · Incoming Assistant Professor

Talk (tentative)AI-Driven Research for Systems (ADRS): agents doing real database and systems research.

Creator of TAOBench (deployed at Meta, PlanetScale, TiDB) and co-lead of AI-Driven Research for Systems (ADRS) — a live test of whether AI scientists can drive real engineering research, and how to measure them when they do.

Yao Li

Assistant Professor @ Portland State University

Talk (tentative)Proof assistants for AI scientists: making agent claims verifiable, not just plausible.

Uses interactive theorem provers and dependent types to formally verify real-world programs, bringing proof-assistant rigor to AI-scientist claims.

Bodhisattwa Majumder

Research Scientist @ Ai2

Talk (tentative)Autonomous discovery at Ai2: DataVoyager, AutoDS, and AstaBench.

Leads autonomous, data-driven scientific discovery at Ai2 (DataVoyager, AutoDS) and builds the AstaBench scientific-agent benchmark.

Yue Zhang

Head of Agent Research @ Scale AI

Panel chairChairs the closing panel: ecosystem-level evaluation when the producer is an agent.

Leads Agent Research at Scale AI on LLM-agent evaluation and benchmarking at scale.

Five tracks tracing the path of a unit of research.

We organize the call around how a unit of research is produced, represented, verified, composed with other work, and evaluated at the level of the whole network. One foundational track measures the producer; the rest rebuild the layers its output flows through. We prioritize contributed empirical and technical work; position and vision papers are welcome but capped as a minority track.

09:00 – 09:10	Opening remarks
09:10 – 10:30	Invited Talks 1–2 (40 min each)
10:30 – 10:50	Coffee break
10:50 – 11:30	Debate — a provocative motion on the future of the AI-driven research ecosystem (two sides + audience vote)
11:30 – 12:50	Invited Talks 3–4
12:50 – 13:50	Lunch
13:50 – 15:10	Invited Talks 5–6
15:10 – 15:20	Break
15:20 – 16:00	Panel + moderated discussion
16:00 – 16:05	Closing remarks
16:05 – 18:00	Poster & Demo Session (contributed papers)

The Future of the AI-Driven Research Ecosystem

Treating AI scientists as first-class citizens of research.

Five tracks tracing the path of a unit of research.

Author information.

Key dates (tentative).

A full day that builds toward debate.