Speech Processing

Language Technologies

From voice to action: intelligent audio comprehension and generation We develop solutions that enable interaction with spoken language even in difficult conditions (noise, accents, imperfect recordings).

Speech Processing
  • Automatic transcription and subtitling: Real-time voice-to-text conversion, adapted to different acoustic environments and with speaker recognition.
  • Voice synthesis and conversion: Creation of high-quality, natural-sounding artificial voices using advanced tools for voice synthesis and conversion.
  • Detection of fake audio (deepfakes): Identification of manipulated or artificially generated audio, a crucial technology for security and fraud prevention.
  • Speaker verification: Identity authentication using vocal biometric fingerprinting to enhance security in access and communications.
  • Voice biomarkers: Processing of pathological speech and analysis of vocal patterns to detect health indicators or to aid in rehabilitation treatments.
  • Avatar animation: Extraction of vocal characteristics to animate avatars expressively with real and synthetic audio.
  • Emotion analysis: Detection of emotional state in the voice, useful for call centres or monitoring the customer experience.

Looking for support for your next project? Contact us, we are looking forward to helping you.

Vicomtech

Parque Científico y Tecnológico de Gipuzkoa,
Paseo Mikeletegi 57,
20009 Donostia / San Sebastián (Spain)

+(34) 943 309 230

Zorrotzaurreko Erribera 2, Deusto,
48014 Bilbao (Spain)

close overlay

Behavioral advertising cookies are necessary to load this content

Accept behavioral advertising cookies