Shashank Gupta

I'm a researcher at the Allen Institute for AI (Ai2) in Seattle.

My current research centers on developing agentic large language models—systems that can plan, reason, use tools and APIs, write code, and interact with complex environments to accomplish real tasks. I study how training data, objectives, and learning environments shape the emergence of robust and generalizable agentic behavior, and how such capabilities can be measured and controlled in reliable ways.

CV | E-Mail | Google Scholar | Semantic Scholar | Github | Twitter

If you're excited about advancing open agentic LLMs that can reason, plan, and use tools, we're currently hiring interns—apply here or reach out to me directly!
profile photo
Selected Projects

[NEW] OLMo 3
Olmo Team*, Allyson Ettinger, Amanda Bertsch, [...], Shashank Gupta, [...], Noah A. Smith, Hannaneh Hajishirzi
* Core Contributor

paper | code | data & models | blog | X (Twitter)

sym

LLM-SR: Scientific Equation Discovery via Programming with Large Language Models
Parshin Shojaee, Kazem Meidani, Shashank Gupta, Amir Barati Farimani, Chandan K Reddy

ICLR 2025 (Oral)
paper | code | X (Twitter)

sym

SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories
Ben Bogin, Kejuan Yang, Shashank Gupta, Kyle Richardson, Erin Bransom, Peter Clark, Ashish Sabharwal, Tushar Khot

EMNLP 2024
🏆 Outstanding Paper Award 🏆
paper | code | dataset | X (Twitter)

sym

AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents
Harsh Trivedi, Tushar Khot, Mareike Hartmann, [...], Shashank Gupta, Ashish Sabharwal, Niranjan Balasubramanian

ACL 2024
🏆 Best Resource Paper Award 🏆
paper | website | code | video | poster | X (Twitter)

sym

Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs
Shashank Gupta, Vaishnavi Shrivastava, Ameet Deshpande, Ashwin Kalyan, Peter Clark, Ashish Sabharwal, Tushar Khot

ICLR 2024
paper | website | code | poster | data | X (Twitter)

sym

Self-Refine: Iterative Refinement with Self-Feedback
Aman Madaan, Niket Tandon, [...], Shashank Gupta, [...], Peter Clark

NeurIPS 2023
paper | website | code | poster | Forbes

sym

Sparsely Activated Mixture-of-Experts are Robust Multi-Task Learners
Shashank Gupta, Subhabrata Mukherjee, Krishan Subudhi, Eduardo Gonzalez, Damien Jose, Ahmed H. Awadallah, Jianfeng Gao

Microsoft AI Journal, 2022
paper

sym

Knowledge Infused Decoding

Ruibo Liu, Guoqing Zheng, Shashank Gupta, Radhika Gaonkar, Chongyang Gao, Soroush Vosoughi, Milad Shokouhi, Ahmed Hassan Awadallah

ICLR 2022
paper | code

sym

Exploring Low-Cost Transformer Model Compression for Large-Scale Commercial Reply Suggestions
Vaishnavi Shrivastava*, Radhika Gaonkar*, Shashank Gupta*, Abhishek Jha

*equal contribution
Microsoft AI Journal, 2021
paper | blog post

sym

CogCompNLP: Your Swiss Army Knife for NLP
Daniel Khashabi, Mark Sammons, [...], Shashank Gupta, [...], Dan Roth

LREC 2018
paper | code

sym

Web-scale entity annotation using MapReduce
Shashank Gupta, Varun Chandramouli, Soumen Chakrabarti

HiPC 2013
paper | slides


Template credits: Unnat, and Jon