Ganga-1B — A pre-trained Hindi AI model developed at IITGN

The Language Research Group at the Indian Institute of Technology Gandhinagar (IITGN) has developed an artificial intelligence (AI) model for Hindi called “Ganga-1B,” which is a “breakthrough in language modeling.” Named after the longest river that flows through India, Ganga-1B is the first pre-trained Hindi model developed by an academic research institute.

“This effort aims to improve performance in understanding and generating text in Indian languages, with the first milestone being the release of the Ganga-1B model trained on an extensive monolingual Hindi dataset,” said Prof. Mayank Singh, Head of the Lingo Research Group, IITGN and Assistant Professor of Computer Science and Engineering.

The Ganga-1B model is based on datasets found in the public domain for Hindi language, including news articles, web documents, books, government publications, educational materials, and quality-filtered social media conversations.

“Project Unity aims to develop pocket-sized, open source Large Language Models (LLMs) for Indian languages, built and trained from scratch on Indian data. This effort will empower the Indian open source community to build LLMs and chatbots that can be trained and deployed in resource-limited scenarios,” Professor Mayank Singh told The Indian Express.

Ganga-1B, which has already been downloaded by over 600 people within 48 hours of its announcement, took about a year and a half to build using open source data from various websites.

Celebration Offer

The research team is working on models for other languages, including Gujarati, Urdu, Tamil, Telugu and Marathi, and is researching the use of AI in e-governance in regional languages, as well as a Masters in Educational Law course to support students and teachers in schools.
The dataset is further curated by native Indian speakers to ensure high quality.

First uploaded: 07 Sep 2024 05:29 IST

Source link

binance "oppna konto commented on Forget Ray-Ban Meta smart glasses. We tested cheaper ones that support ChatGPT.: Thanks for sharing. I read many of your blog posts
Binance账户 commented on The Smartest Man Who Ever Lived: Your point of view caught my eye and was very inte
打开Binance账户 commented on Top 10 Machine Learning Jobs with the Best Salaries in 2023: Your point of view caught my eye and was very inte
binance Registrera dig commented on Generative-AI-Jobs: Die 11 gefragtesten KI-Berufe: Thanks for sharing. I read many of your blog posts
create a binance account commented on WHOOP 4.0 review: Fitness tracker brand launches new AI features: Can you be more specific about the content of your

Ganga-1B — A pre-trained Hindi AI model developed at IITGN | Ahmedabad News

Leave a Reply

RECENT POSTS

How AI is turning weeks of cybercrime into days

DoorDash CEO says AI coding alone is not enough to improve engineering productivity

Learn AI basics for free and get certified – Letem svetem Applem

Related Posts

Leave a Reply