Tajikistan announces the first national AI language model – Sorollm

AI News


13:27, today

This model can recognize not only standard Tajiks, but also its various local dialects. Generated photos.

Tajikistan has taken a major step towards digital innovation by launching its first national artificial intelligence language model, Sorollm. Uniquely designed to understand and handle all the diversity of the Tajik language, this model can recognize not only standard Tajik but also various regional dialects.

Developed by the research team at Zehnlab.ai, Sorollm stands out as the first neural network created specifically for the Tajik language. Unlike global AI models such as GPT and Llama that do not limit or support Tajik, Sorollm is built from scratch to accommodate the unique syntax of the language, rare vocabulary, and diverse pronunciation styles.

The groundbreaking project was officially presented to President Emomariramon on June 25th at the opening ceremony of Tajikistan's first AI Computing Resource Centre. The event marked a milestone in the country's digital transformation and highlighted the importance of local technology solutions.

“Our goal was to not only make the model aware of Tajik, but also capture the entire range of dialects ranging from the North accent to the Pamir language,” the developer said.

Going forward, the team plans to integrate multimodal capabilities so that Sorollm can handle not only text but audio and video input. As part of ongoing development, creators are invited to contribute to their citizens by sharing information about local dialects via simple Google forms that can be accessed through the provided links.

With Sorollm, Tajikistan sets new precedents for linguistic inclusion in AI, putting its language and cultural identity at the forefront of technological advancements.





Source link

Leave a Reply

Your email address will not be published. Required fields are marked *