cv
resume
Basics
Name | Gaurav Sahu |
Label | AI Researcher |
gaurav.sahu@mila.quebec | |
Url | https://scholar.google.com/citations?user=nMAt7UMAAAAJ&hl=en |
Summary | Machine Learning Researcher with expertise in building AI systems that bridge cutting-edge research with real-world applications. Proven track record in developing agentic systems and LLMs for complex scientific workflows, with demonstrated experience in both exploratory research and production ML infrastructure. Passionate about transforming AI research into impactful products that deliver value at scale. |
Work
-
2025.01 - ongoing Postdoctoral Fellow
Mila - Quebec AI Institute
Focus: Enhancing Scientific Workflows with AI. Advisors: Prof. Chris Pal, Prof. Laurent Charlin
- Led LitLLM and deployed a novel RAG pipeline for automated literature discovery with a "Deep Research" variant that boosts retrieval coverage by over 5x and a planning-based generation framework that cuts content hallucination by 18-26%
- Developed Essence, an AI-based framework for literature-grounded paper analysis and claim verification during peer-reviewing
- Probing frontier AI models to evaluate their ability to rediscover complex scientific concepts in AI/ML from first principles
- Building an agentic pipeline to curate a knowledge base of Wiki-style articles for Physics-focused RAG applications
-
2021.09 - 2024.12 Visiting Researcher
ServiceNow Research
Focus: Data Augmentation & Agentic Data Analysis. Advisors: Dzmitry Bahdanau, Issam Laradji
- Proposed PromptMix, a 2-shot data augmentation strategy outpeforming 5-shot and 100-shot text classification methods
- Developed MixSumm and PPSL for text summarization, matching fully supervised methods with 5% of the labeled data
- Spearheaded the creation of InsightBench, a comprehensive benchmark with 100 diverse tasks to evaluate data analytics agents, and developed AgentPoirot, a multi-step reasoning agent that autonomously discovers complex insights from data
-
2018.09 - 2024.12 Graduate Research
University of Waterloo
Focus: Computational Creativity & Multi-modal AI. Advisor: Prof. Olga Vechtomova
- Proposed a novel, interpretable framework to computationally measure artistic inspiration in poetry and predict aesthetic preferences of creatives, outperforming a 450-shot LLaMA classifier by 18 points on the curated EvocativeLines dataset
- Trained a bi-modal CVAE+LSTM architecture with an adversarial loss for LyricJam Sonic, a real-time, bi-modal system for music and lyric co-creation; also implemented a BERT filter to improve the coherence of the served lyrics at runtime
- Designed fusion mechanisms (Auto-Fusion and GAN-Fusion) to adaptively combine multi-modal data sources for improved emotion recognition and machine translation; also worked on the Multi-Modal Discussion Transformer (mDT), which integrates text, image, and graph transformer data to more effectively detect hate speech in Reddit discussions
- Proposed adversarial alignment for multi-turn dialogs [16] and studied the racial and ethnic bias in bots v/s humans
-
2016.07 - 2018.06 Undergraduate Research
IIT Kharagpur
Focus: Linguistics-grounded NLP. Advisor: Prof. Pawan Goyal
- Designed program synthesis models for morphological inflection in English, and the first energy-based model for Sanskrit
Education
-
2020 - 2024 Waterloo, ON, Canada
PhD
University of Waterloo
Computer Science
- Thesis: Harnessing Generalist LLMs for Diverse Objective and Subjective NLP Tasks
-
2018 - 2020 Waterloo, ON, Canada
M.Math
University of Waterloo
Computer Science
- Thesis: Adaptive Fusion Techniques for Effective Multimodal Deep Learning
-
2014 - 2018 Kharagpur, WB, India
B.Tech (Hons)
IIT Kharagpur
Manufacturing Science and Engineering (Mechanical Engineering)
- Thesis (in Computer Science): Program Synthesis for Natural Language
Publications
-
2025 Balancing indeterminacy and structure: Neural text generation for artistic inspiration
International Conference on Computational Intelligence in Music, Sound, Art and Design (Part of EvoStar)
Olga Vechtomova and Gaurav Sahu.
-
2025 Insightbench: Evaluating business analytics agents through multi-step insight generation
ICLR 2025
Gaurav Sahu, Abhay Puri, Juan Rodriguez, Amirhossein Abaskohi, Mohammad Chegini, Alexandre Drouin, Perouz Taslakian, Valentina Zantedeschi, Alexandre Lacoste, David Vazquez, et al.
-
2025 A guide to effectively leveraging llms for low-resource text summarization: Data augmentation and semi-supervised approaches
Findings of the Association for Computational Linguistics: NAACL 2025
Gaurav Sahu, Olga Vechtomova, and Issam H Laradji.
-
2025 Litllms, llms for literature review: Are we there yet?
Transactions on Machine Learning Research
Shubham Agarwal*, Gaurav Sahu*, Abhay Puri*, Issam H Laradji, Krishnamurthy Dj Dvijotham, Jason Stanley, Laurent Charlin, and Christopher Pal.
-
2024 Computational modeling of artistic inspiration: A framework for predicting aesthetic preferences in lyrical lines using linguistic and stylistic features
arXiv preprint arXiv:2410.02881
Gaurav Sahu and Olga Vechtomova.
-
2023 Llm aided semi-supervision for efficient extractive dialog summarization
Findings of the Association for Computational Linguistics: EMNLP 2023
Nishant Mishra, Gaurav Sahu, Iacer Calixto, Ameen Abu-Hanna, and Issam Laradji.
-
2023 Promptmix: A class boundary augmentation method for large language model distillation
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
Gaurav Sahu, Olga Vechtomova, Dzmitry Bahdanau, and Issam Laradji.
-
2022.05 Data augmentation for intent classification with off-the-shelf large language models
Proceedings of the 4th Workshop on NLP for Conversational AI
Gaurav Sahu, Pau Rodriguez, Issam Laradji, Parmida Atighehchian, David Vazquez, and Dzmitry Bahdanau.
Volunteer
-
2019 - Present Mentor & 3x World Finalist
Technovation Girls
Mentored 8-18 yrs old girls to solve societal problems using technology, with teams reaching the world finals three times.
-
2017 - 2018 Mentor
Student Welfare Group, IITKGP
Helped 7 undergraduates successfully navigate their initial years at IIT Kharagpur.
Awards
- 2025
Best Paper Award
International Conference on Computational Creativity (ICCC)
Built an interpretable framework for modeling inspiration in creative individuals.
- 2023.2025
- 2024
- 2020.2024
- 2014.2018
Recipient of Prime Minister Scholarship
Government of India
For passing JEE Advanced 2014 examination (< 0.1% acceptance rate)
-
State-level Silver medalist
6th International Mathematics Olympiad (conducted by the Science Olympiad Foundation)
Skills
Technical Skills | |
PyTorch | |
Python | |
HuggingFace | |
Scikit-learn | |
Git | |
Linux | |
NLTK | |
Flask | |
Pandas | |
Numpy | |
vLLM | |
Wandb |
Professional Services | |
Area Chair: NAACL, EMNLP, ACL | |
Reviewer: ACL, NAACL, EMNLP, COLM, ACM Multimedia, AAAI |
Languages
Hindi | |
Native Speaker |
English | |
Fluent |
Japanese | |
Intermediate |
French | |
Beginner |
Interests
Research Interests | |
Natural Language Processing | |
Large Language Models | |
Generative AI | |
Synthetic Data Generation |