Sindhu B. Hegde

PhD Student, University of Oxford

Hi! I am a second year PhD student in the Visual Geometry Group (VGG) at the University of Oxford, supervised by Prof. Andrew Zisserman. My research is in Computer Vision, particularly in multimodal learning, video understanding and self-supervised learning.

Prior to joining Oxford, I worked as a Lead Data Scientist @ Verisk Analytics. Before that, I pursued Masters’ by Research (MS) at Centre for Visual Information Technology (CVIT), IIIT Hyderabad supervised by Prof. C V Jawahar (IIIT-H) and Prof. Vinay Namboodiri (University of Bath, UK). My Masters’ research focused on exploiting the redundancies in vision and speech modalities for cross-modal generation.

Research interests: Computer Vision, Machine Learning, Deep Learning, Video Understanding, Multi-modal Learning: Vision + Speech/Language

News [Archive]

Sep 2023 Our paper on GestSync: Determining who is speaking without a talking head accepted to BMVC 2023 (ORAL)
Jul 2023 Participated in the International Computer Vision Summer School (ICVSS)) at Sicily, Italy. Had an eincredible experience of learning from some of the most distinguished computer vision experts!
Oct 2022 Joined the Visual Geometry Group (VGG) at the University of Oxford as a PhD student with Prof. Andrew Zisserman
Jul 2022 2 papers accepted to ACM-MM 2022
1] Talking-Face Video Upsampling 2] Lip-to-Speech Synthesis
May 2022 Successfully defended MS thesis :smile:
Thesis: Exploiting Cross-Modal Redundancy for Audio-Visual Generation

Recent papers [Full list]

  1. BMVC
    GestSync: Determining who is speaking without a talking head
    Hegde, Sindhu, and Zisserman, Andrew
    In British Machine Vision Conference (BMVC) 2023
  2. ACM-MM
    Extreme-scale Talking-Face Video Upsampling with Audio-Visual Priors
    Hegde, Sindhu, Mukhopadhyay, Rudrabha, Namboodiri, Vinay P, and Jawahar, CV
    In Proceedings of the 30th ACM International Conference on Multimedia (MM’22) 2022
  3. ACM-MM
    Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild
    Hegde, Sindhu, Prajwal, KR, Mukhopadhyay, Rudrabha, Namboodiri, Vinay P, and Jawahar, CV
    In Proceedings of the 30th ACM International Conference on Multimedia (MM’22) 2022