Hi, I'm
Abhik Bhattacharjee
I am a Research Assistant at the Department of Computer Science and Engineering at Bangladesh University of Engineering and Technology, working with the BUET CSE NLP Group on Machine Learning and Natural Language Processing. I am fortunate to be supervised by Prof. Rifat Shahriyar. Previously, I also got my bachelor's degree from the same department, completing my thesis under his supervision.
My current research interests broadly lie in Natural Language Processing and Systems for Machine Learning. The core motivation of my research is to develop robust and generalizable machine learning systems that work across a wide range of tasks and modalities equipped with large-scale knowledge. Most of my previous work has focused on efficient data and computing utilization in the context of low-resource languages and multilingual/cross-lingual language models. In particular, for the past three years, I completed multiple projects on Machine Translation, Natural Language Understanding, and Text Summarization. During this period, I had the privilege to closely collaborate with Tahmid Hasan (BUET), Dr. Wasi Ahmad (AWS AI), Dr. Yuan-Fang Li (Monash), and Dr. Yong-Bin Kang (Swinburne).
Besides academic activities, I love picking up a good read whenever I can. I recently got into playing chess in my leisure time, and it's on my bucket list to become a FIDE Master sometime in the future.
Recent News
July, 2024
Our papers "IllusionVQA" and "BanglaContextualBias" got accepted at COLM 2024 and ACL 2024 (Findings), respectively! "IllusionVQA" was also featured in Scientific American!
May, 2023
Our paper "CrossSum: Beyond English-Centric Cross-Lingual Abstractive Text Summarization for 1500+ Language Pairs" got accepted at ACL 2023!
March, 2023
September, 2022
Our paper "GEMv2: Multilingual NLG Benchmarking in a Single Line of Code" got accepted at EMNLP 2022 demo track!
July, 2022
Our paper "BanglaParaphrase: A High-Quality Bangla Paraphrase Dataset" got accepted at AACL 2022!
April, 2022
Selected Publications
(* indicates equal contribution)
CrossSum: Beyond English-Centric Cross-Lingual Abstractive Text Summarization for 1500+ Language Pairs
Abhik Bhattacharjee*, Tahmid Hasan*, Wasi Uddin Ahmad, Yuan-Fang Li, Yong-Bin Kang, Rifat Shahriyar
BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla
Abhik Bhattacharjee*, Tahmid Hasan*, Wasi Uddin Ahmad, Kazi Samin, Md Saiful Islam, M. Sohel Rahman, Anindya Iqbal, Rifat Shahriyar
XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages
Tahmid Hasan*, Abhik Bhattacharjee*, Md. Saiful Islam, Kazi Mubasshir, Yuan-Fang Li, Yong-Bin Kang, M. Sohel Rahman, Rifat Shahriyar