On Reduction and Optimization of AI Parameter Set via Signal Processing Methods: Lip-Sync Problem Yapay Zek Parametre K mesinin Isaret Isleme Y ntemleriyle Daraltilmasi ve Optimizasyonu zerine: Ses-Dudak Eslestirmesi Problemi

Ova S., Yilmaz A. M., YARKAN S.

33rd IEEE Conference on Signal Processing and Communications Applications, SIU 2025, İstanbul, Türkiye, 25 - 28 Haziran 2025, (Tam Metin Bildiri)

Yayın Türü: Bildiri / Tam Metin Bildiri
Doi Numarası: 10.1109/siu66497.2025.11111772
Basıldığı Şehir: İstanbul
Basıldığı Ülke: Türkiye
Anahtar Kelimeler: acoustic signal processing, lip synchronization, machine learning
İstanbul Ticaret Üniversitesi Adresli: Evet

Özet

Lip synchronization is a fundamental component of speech-driven applications, ranging from virtual reality and human-machine interaction to forensic analysis and cybersecurity. Traditional signal processing-based methods for lip synchronization face challenges in accuracy, real-time performance, and language independence. This study benchmarks a statistical signal processing-based lip synchronization approach against modern machine learning-based tools. By leveraging high signal-to-noise ratio (SNR) audio data and transcripts, we enhance the conventional algorithm with AI-driven models. The integration of signal processing with machine learning and deep learning enables more precise, natural, and language-agnostic lip synchronization. The results emphasize AI's transformative impact on speech processing technologies and set the stage for future advancements in multimodal communication.