Hi! I am a Machine Learning Researcher @ Verisk Analytics, with an immense interest and enthusiasm in the areas of Computer Vision & AI.
Prior to joining Verisk, I pursued Masters’ by Research (MS) at Centre for Visual Information Technology (CVIT), IIIT Hyderabad supervised by Prof. C V Jawahar (IIIT-H) and Prof. Vinay Namboodiri (University of Bath, UK). My Masters’ research focused on exploiting the redundancies in vision and speech modalities for cross-modal generation.
Research interests: Computer Vision, Machine Learning, Deep Learning, Speech Processing, Multi-modal Learning: Vision + Speech/Language
|Aug 2022||Accepted the PhD offer from University of Oxford! Excited to join the Visual Geometry Group (VGG), with Prof. Andrew Zisserman from October 2022!|
2 papers accepted to ACM-MM 2022!
1] Talking-Face Video Upsampling 2] Lip-to-Speech Synthesis
Successfully defended MS thesis
Thesis: Exploiting Cross-Modal Redundancy for Audio-Visual Generation
|Apr 2022||Promoted to Lead Data Scientist at Verisk Analytics|
|Feb 2022||Participated in Research Week with Google. Got a chance to interact with amazing researchers all over the world!|
Recent papers [Full list]
ACM-MMExtreme-scale Talking-Face Video Upsampling with Audio-Visual PriorsIn Proceedings of the 30th ACM International Conference on Multimedia (MM’22) 2022
ACM-MMLip-to-Speech Synthesis for Arbitrary Speakers in the WildIn Proceedings of the 30th ACM International Conference on Multimedia (MM’22) 2022