Building a multimodal conversational agent that understands audio and visual cues and is fine-tuned for real-time stress counseling.
Developing flow-based, diffeomorphic generative models for "Fast Shower Simulation (FastSim)" in High Energy Physics.
Neural Architecture Search (NAS) for optimizing tiny LLM architectures. Implemented both Reinforcement Learning (RL) and Bayesian Optimization approaches for architecture search.
End-to-end 2D to 3D scene reconstruction using classical, hybrid, and learning-based strategies including SfM, MVS, GANs, Neural Radiance Fields, and Gaussian Splattings.