DeepL, renowned for its text translation tools, has launched a voice-to-voice translation suite. The move aims to cater to various scenarios, from meetings and conversations to training sessions, enhancing communication across languages.
The challenges lie in balancing latency with accuracy, but DeepL believes it can leverage years of experience in text translation to deliver quality results. Add-ons for platforms like Zoom and Microsoft Teams will allow users to hear real-time translations or follow translated text on screen. Mobile and web-based conversations are also supported, providing flexibility for both in-person and remote settings.
DeepL’s technology can adapt to custom vocabularies, making it ideal for industries with specific terminology. The company aims to develop an end-to-end voice translation model, potentially skipping the text conversion step altogether, which could revolutionize real-time translations.
However, competition from startups like Sanas and Camb.AI is fierce. Sanas modifies accents in real time for call center agents, while Camb.AI focuses on speech synthesis and translation for media companies. The race to dominate this space is heating up.







