I'm a researcher on the Aristo team at the Allen Institute for AI (AI2) in Seattle.
Broadly, I am interested in building general-purpose AI agents that can:
(1) reason about complex problems through natural language rationalization,
(2) faithfully express their uncertainty in their knowledge and reasoning, and
(3) continuously improve through self-reflection and human feedback.
My current research focus is on AI for Scientific Discovery. I am particularly interested in AI for Math (e.g., Automated Theorem Proving, better foundation models for Math),
building agents for finding supporting/contrary evidence from literature, and retrieval-augmented and memory architectures for supporting these use cases.
CV |
E-Mail |
Google Scholar |
Semantic Scholar |
Github | Twitter
|
|
|
SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories
Ben Bogin, Kejuan Yang, Shashank Gupta, Kyle Richardson, Erin Bransom, Peter Clark, Ashish Sabharwal, Tushar Khot
To appear at EMNLP 2024
paper |
code |
dataset
|
|
AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents
Harsh Trivedi, Tushar Khot, Mareike Hartmann, [...], Shashank Gupta, Ashish Sabharwal, Niranjan Balasubramanian
🏆 ACL'24 Best Resource Paper 🏆
paper |
website |
code |
video |
poster |
X (Twitter)
|
|
LLM-SR: Scientific Equation Discovery via Programming with Large Language Models
Parshin Shojaee, Kazem Meidani, Shashank Gupta, Amir Barati Farimani, Chandan K Reddy
Preprint (2024)
paper |
code
|
|
Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs
Shashank Gupta, Vaishnavi Shrivastava, Ameet Deshpande, Ashwin Kalyan, Peter Clark, Ashish Sabharwal, Tushar Khot
ICLR 2024
paper |
website |
code |
poster |
data |
X (Twitter)
|
|
Self-Refine: Iterative Refinement with Self-Feedback
Aman Madaan, Niket Tandon, [...], Shashank Gupta, [...], Peter Clark
NeurIPS 2023
paper |
website |
code |
poster |
Forbes
|
|
Sparsely Activated Mixture-of-Experts are Robust Multi-Task Learners
Shashank Gupta, Subhabrata Mukherjee, Krishan Subudhi, Eduardo Gonzalez, Damien Jose, Ahmed H. Awadallah, Jianfeng Gao
Microsoft AI Journal, 2022
paper
|
|
Knowledge Infused Decoding
Ruibo Liu, Guoqing Zheng, Shashank Gupta, Radhika Gaonkar, Chongyang Gao, Soroush Vosoughi, Milad Shokouhi, Ahmed Hassan Awadallah
ICLR 2022
paper |
code
|
|
Exploring Low-Cost Transformer Model Compression for Large-Scale Commercial Reply Suggestions
Vaishnavi Shrivastava*, Radhika Gaonkar*, Shashank Gupta*, Abhishek Jha
*equal contribution
Microsoft AI Journal, 2021
paper |
blog post
|
|
CogCompNLP: Your Swiss Army Knife for NLP
Daniel Khashabi, Mark Sammons, [...], Shashank Gupta, [...], Dan Roth
LREC 2018
paper |
code
|
|
Web-scale entity annotation using MapReduce
Shashank Gupta, Varun Chandramouli, Soumen Chakrabarti
HiPC 2013
paper |
slides
|
|