The audiobook sector is undergoing a major transformation due to the influence of artificial intelligence (AI).
AI technology has reduced the business of many voice actors and narrators by allowing them to create human-like recordings quickly and cost-effectively. Professionals like his Tanya Eby, who has been in the industry for his 20 years, have seen their workload cut in half and their bookings lower than in previous years. Other factors may have contributed to this decline, but AI is believed to be a key factor.
AI-assisted recordings do not have a specific label, but thousands of existing audiobooks use database-generated ‘voices’. Companies like DeepZen are at the forefront of this technology, creating databases of recorded voices of different actors speaking in different emotional tones.
They adhere to ethical practices, obtain proper licenses, and pay royalties for the audio used. However, there are also concerns about unethical practices in the industry, such as some companies exploiting voice databases without proper authorization.
Traditional publishers, including US audiobook giants like Audible, are deploying generative AI that can create text, images, video and audio without human intervention. They envision a future where human performance and text-to-speech content can coexist. Tech giants such as Apple and Google are also entering the AI-narrated audiobook market with the aim of making audiobook creation more accessible to independent authors and small publishers.
AI offers potential advantages in terms of cost savings and access to a wider range of books, but critics argue that storytelling should continue to be a human endeavor. AI-generated recordings are often criticized for lacking the emotional connection compared to human performances.
There is concern that people will get used to the machine-generated version and gradually replace it with human narration. Some experts have stressed the importance of transparency, urging companies to disclose when AI-generated parts are being used.
Despite the concerns, some believe that the growth of the audiobook market will benefit both AI and human voice actors, leading to increased recording opportunities and recordings. They argue that AI technology could convert more books into audio format, which could bring economic benefits to voice actors.
Transparency and disclosure regarding AI-generated content is considered critical to maintaining ethical practices.
