This Delhi-NCR Startup Enables Content Creators To Produce Audio In Minutes


Anant Tripathi, Chakrapani Mishra and Diwakar Mishra believed that voice to text and vice versa would be imperative in the world of content creation. This need had become more important during the COVID-19 pandemic.

While Anant is an alumnus of IIT Kharagpur, Chakrapani is an alumnus of IIT Bombay, and Diwakar is a graduate of Purdue University, USA. Friends since college, the trio decided to launch Speechmax in mid-2020. Delhi-NCR startup generates free voiceovers in Hindi and English for content creators.

Unlike other text-to-speech platforms, where voiceovers sound robotic, all of Speechmax’s voices sound natural and human.

Leveraging AI and natural language processing, Speechmax helps generate highly realistic instant voiceovers, which can be used in a variety of content including audiobooks, documentaries, videos, and more.

Founders of Speechmax

What is the startup doing?

Speechmax takes the hassle out of doing a voiceover by cutting the traditional commute from days to minutes, Anant explains.

On the Speechmax platform, content creators – with just a few clicks – can convert a script into a studio-quality human voiceover. The startup helps creators reduce time and content costs while maintaining production quality.

“Voices on Speechmax are AI avatars of professional voiceover artists in different languages ​​and voiceover styles. Video and audio content creators use our platform to create voiceovers in whatever script and style they choose, ”says Anant.

A typical voiceover generation platform is becoming popular thanks to the variety of voices and styles of storytelling it offers. Currently, Speechmax has three voice artists with different styles of voiceover.

Users can select an artist, style and provide a script, which is converted to audio in seconds. They can further refine the generated audio using an audio editor and the final audio file can be exported as MP3.

In fact, they can create video content for digital consumption as most of them host their creations on social media. “Soon we will be adding new voices, both vernacular and foreign languages ​​to make Speechmax a global platform for voice over needs,” says Anant.

“We want to empower content creators. We follow a freemium monetization strategy. You can try Speechmax for free and even download the audio for any use, ”he adds.

However, it was not all easy for the startup.

Building proprietary technology

Machine-assisted generation of audio – traditionally referred to as text-to-speech – has been around for a long time. But the quality of the audio generated by the machine was still below average.

“Speechmax has changed that. The startup’s proprietary technology deploys cutting-edge AI that not only generates natural human-like speech, but can also closely mimic prosody. That’s why the voiceover done on Speechmax feels so natural, ”explains Anant.

The startup hit a jackpot when its proprietary AI models produced sound that was virtually indistinguishable from a real human recording.

“We developed the technology in the midst of the pandemic, which fundamentally changed people’s lives to engage in digital content creation and catapulted demand for our product,” said Anant.

He adds: “More and more content creators are using digital solutions for production. Our technology makes it ideal for content creators and businesses looking for a solution to create voiceovers at scale.

Currently, Speechmax has 10 members in its team.

Market and future

According to a Markets and Markets report, the text-to-speech market was valued at $ 2 billion in 2020, estimated at $ 5 billion by 2026. The segment will experience a CAGR of 14.6% between 2020 and 2026.

The Delhi-NCR startup competes with platforms including Otter.AI, SoundHound, and Gnani.AI, among others.

Bootstrap since its inception, Speechmax will soon be deployed in other vernacular languages.

“We want to expand our reach to more geographies as we develop the product. Our plans also include the addition of foreign languages. We also want to create a marketplace for voice over artists on Speechmax. Our vision for Speechmax is to make it a one-stop solution when it comes to voiceover requirements, ”explains Anant.


YourStory’s flagship startup and leadership conference will be virtually back for its 13th edition from October 25-30, 2021. Sign up to receive updates on TechSparks or to express your interest in partnerships and speaker opportunities . here.

To learn more about TechSparks 2021, click on here.

About Ethel Nester

Check Also

Musician Oberon’s Cassette Tracks Get New Digital Life | Oberon’s review

An Oberon-based musician, composer and theater maker has had some of his treasured compositions remastered …

Leave a Reply

Your email address will not be published.