#This startup gives your speech a new ‘human-realistic’ AI voice — for free

From virtual assistants to voiceovers for audiobooks, AI voice generation has emerged as a rapidly growing field — and it’s no wonder that companies are rushing to tap into the technology’s potential.

Among them is Valencia-based Voicemod. The startup has developed an AI voice changer and soundboard software that enables instant speech-to-speech conversion. Unlike most of its competitors, the company claims that it transforms voices in real time and with low latency, enabling users to converse as they would in real life.

According to Jaime Bosch, Voicemod’s CEO and co-founder, the company trains its AI model using publicly available data sets and professional voice actors, which results in a broad pool of vocal expressions, pitches, tones, and emotions. Through machine learning techniques, the model learns to understand, analyse, and predict the a person’s speech patterns and intricacies.

“When a user speaks into our software or application, their voice input is processed in real time,” Bosch told TNW. “Our AI model then applies the learned patterns and transformations to the input, allowing for instant voice conversion.”

Voicemod mainly targets the entertainment industry, including gamers, streamers, content creators, and vtubers in platforms ranging from Discord and Twitch, to Zoom and WhatsApp.

To further address the increasing user demand for self-expression, pseudonymity, and creativity online, next to the 100 voice options in its portfolio, the startup is now launching the so-called “AI Humans” collection. Although Voicemod already offers human voice filters, the new collection is slated to be the company’s most human-realistic to date.