Publications

2026

  1. An Empirical Investigation of Robustness in Large Language Models under Tabular Distortions Avik Dutta, Harshit Nigam, Hosein Hasanbeig, Arjun Radhakrishna, Sumit Gulwani Preprint [Paper]

2025

  1. ConDABench: Interactive Evaluation of Language Models for Data Analysis Avik Dutta, Priyanshu Gupta, Hosein Hasanbeig, Rahul Pratap Singh, Harshit Nigam, Sumit Gulwani, Arjun Radhakrishna, Gustavo Soares, Ashish Tiwari ACM SIGMOD 2026 [Abs] [Paper]

2024

  1. RAR: Retrieval-augmented retrieval for code generation in low resource languages Avik Dutta, Mukul Singh, Gust Verbruggen, Sumit Gulwani, Vu Le EMNLP 2024 [Abs] [Paper] [Presentation]
  2. Context Matters: Pushing the Boundaries of Open-Ended Answer Generation with Graph-Structured Knowledge Context Somnath Banerjee, Amruit Sahoo, Sayan Layek, Avik Dutta, Rima Hazra, Animesh Mukherjee EMNLP 2024 [Abs] [Paper]
  3. DistALANER: Distantly Supervised Active Learning Augmented Named Entity Recognition in the Open Source Software Ecosystem Somnath Banerjee, Avik Dutta, Aaditya Agrawal, Rima Hazra, Animesh Mukherjee ECML-PKDD 2024 [Abs] [Paper] [Code]
  4. Redefining Developer Assistance: Through Large Language Models in Software Ecosystem Somnath Banerjee, Avik Dutta, Sayan Layek, Amruit Sahoo, Sam Conrad Joyce, Rima Hazra Preprint [Abs] [Paper]