Experience

Technology Innovation Institute (TII)📍Abu Dhabi, UAE

👨‍💻 Machine Learning Engineer ⌛ 2023/Oct - Present (~1 year and 8 months +)

Exploring LLM-based audio synthesis using neural audio codecs and its codebook compression with knowledge distillation.
Developing CLIP-style audio-representation learning framework with Whisper encoder and synthetic dense audio captions.
Developed audio-understanding LLMs that surpass prior open-weight models by 10+ points on multi-domain benchmarks (speech, music, sound) like MMAU and AirBench, leveraging a simpler architecture, single-stage training, and under 30K hours of public data.
Developed VisCon, a novel LLM fine-tuning dataset with leaky visual conversations and contextual information for enhanced visual QA.
Integrated image understanding in text-only LLMs via distributed fine-tuning with CLIP’s vision encoder.
Enhanced distributed pretraining of Falcon3 LLM, integrating Triton and optimizing backward-pass for architectural changes.
Accelerated LLM Inference through self-distillation, intermediate layers prediction, and recurrent multi-token prediction.

G42’s Inception📍Abu Dhabi, UAE

👨‍💻 Applied Scientist ⌛ 2023/Jun - 2023/Aug (~3 months)

Collaborated on the development Jais, an English-Arabic LLM.
Managed the LLM Arena framework, benchmarking models (in-house, open-source, and GPT-4) with human annotators using Elo ratings.
Leveraged LLM Arena data to align LLMs with harmlessness and usefulness through RLHF and DPO techniques using the TRL package.

Microsoft Research 📍Bengaluru, India

👨‍💻 Machine Learning Research Intern ⌛ 2022/May - 2022/Aug (~3 months)

Extensively evaluated the design choices for Text-To-Speech systems and open-sourced state-of-the-art models for 13 Indian languages. [ICASSP Paper]

Mohamed Bin Zayed University of Artificial Intelligence (MBZUAI) 📍Abu Dhabi, UAE

👨‍💻Machine Learning Research Assistant ⌛2021/Jan - 2021/Jul (~7 months)

Demonstrated the target classifier and attack-agnostic nature of the Double Variational Autoencoder Network (DoENet), leading to improved unsupervised adversarial image classification.

TCS Research 📍Chennai, India

👨‍💻 Machine Learning Developer ⌛2019/Jul - 2020/Nov (~1 year and 5 months)

👨‍💻 Machine Learning Developer Intern ⌛2019/Apr - 2019/Jul (~3 months)

Enhanced retail sales forecasting with deep neural networks (RNNs, LSTNet) and a novel N-gram method for dynamic pricing. [US Patent]
Implemented RNN-based spatio-temporal travel time prediction, benchmarking against temporal difference-based methods. [IJCNN Paper]
Built an employee profile retrieval system using information parsing and natural language query processing. [Special Initiative Award]
Developed a winning yield prediction model for hybrid corn crops, recommending effective species crossing (internal hackathon). [Innovation Pride Award]
Designed winning computer vision pipelines for curved text extraction and perspective correction (internal hackathon with 100+ folks). [Innovation Pride Award]

IIT Madras | AI4Bharat | One Fourth Labs 📍Chennai, India

👨‍💻 Machine Learning Project Intern ⌛2018/Dec - 2019/Mar (~4 months)

👨‍💻 Summer Research Fellow ⌛2018/Jun - 2018/Aug (~2 months)

Developed a scene text translation system for Indian languages using synthetic datasets, incorporating an Efficient and Accurate Scene Text Detector (EAST) for detection, and Convolutional Recurrent Neural Networks (CRNN) for Classification and Recognition. [Dataset] [Detection] [Classification] [Recognition]
Set up programming competitions for the One Fourth Labs Deep Learning course. [Course]
Investigated attention models in Deep Learning, analyzing foreground region detection through proxy data with distinct statistical properties. [Report]

Google Sites

Report abuse