I am a Member of Technical Staff at Inception, working from our Bangalore office. Previously I was a Research Scientist at Adobe Research with a focus on NLP and multi-modal machine learning.
I received my PhD from the Department of Computational and Data Sciences (CDS) at the Indian Institute of Science, Bangalore.
Prior to graduate school, I received my Bachelor of Engineering (BE Hons.) in Computer Science from Birla Institute of Technology and Science, Pilani (BITS Pilani) in 2015.
Research
I am broadly interested in generative AI (LLMs, computer vision) and their applications. My recent work has focused on enhancing the utility of LLMs for document-grounded tasks, such as summarization and question answering. This includes optimizing inference speed for document-centric tasks and developing methods to attribute LLM outputs back to the documents. I am also interested in design generation applications, specifically with (V)LLMs.
During my undergraduate studies I worked on humanoid robotics and computer vision.
Selected publications
Saxena A. Prompt Lookup Decoding.
Developed a method to speed-up LLM decoding, integrated in transformers and vLLM.
A. Phukan, S. Somasundaram, A. Saxena, K. Goswami & B.V. Srinivasan. “Peering into the mind of language models: An approach for attribution in contextual question answering”. Findings of the Association for Computational Linguistics: ACL 2024, 11481-11495.
Saxena A., Kochsiek A. & Gemulla R. “Sequence-to-Sequence Knowledge Graph Completion and Question Answering”. Accepted to the 2022 Annual Conference of the Association for Computational Linguistics (ACL 2022).
Saxena A., Tripathi A. & Talukdar P. “Improving Multi-hop Question Answering over Knowledge Graphs using Knowledge Base Embeddings”. Accepted to the 2020 Annual Conference of the Association for Computational Linguistics (ACL 2020).
Work Experience
Inception, Bangalore
Member of Technical Staff — September 2024 - Present
Adobe Research, Bangalore
Research Scientist — June 2022 - August 2024
IBM Research, Bangalore
Research Intern — Summer 2019
Google, Hyderabad
Software Engineer, Tools and Infrastructure — May 2016 - Mar 2017
PayPal, Chennai
Software Engineer 1 — July 2015 - November 2015
Side Projects
3D Printing Store: Everything 3D
Get custom-designed products, including personalized fridge magnets, desk signs, house nameplates and more! Order your own unique items via the store or follow the latest creations on our Instagram :)
Store: everything3dindia.com
Instagram: everything_3d_india
Podcast transcription and highlight generation
I used openai-whisper (along with pyannote and flan-t5) to automatically generate transcripts and highlights for all episodes of The Seen and The Unseen
podcast by Amit Varma. These can be browsed at https://apoorvumang.github.io/seenunseen
CV
Contact
apoorv [at] inceptionlabs.ai
apoorvumang [at] gmail.com