IIT-Madras faculty develop AI models to process text in 11 Indian regional languages - Hindustan Times
close_game
close_game

IIT-Madras faculty develop AI models to process text in 11 Indian regional languages

Madras | ByPress Trust of India| Posted by Nandini
Sep 23, 2020 09:03 AM IST

Faculty at the Indian Institute of Technology Madras (IIT-M) have developed Artificial Intelligence (AI) models and datasets to process texts in 11 Indian languages, the premier institute said on Tuesday.

Faculty at the Indian Institute of Technology Madras (IIT-M) have developed Artificial Intelligence (AI) models and datasets to process texts in 11 Indian languages, the premier institute said on Tuesday.

(Getty Images/iStockphoto)
(Getty Images/iStockphoto)

The initiative was taken up jointly with “AI4Bharat,” a platform for building AI solutions for problems of relevance to the country, a release from IIT-M said here.

HT launches Crick-it, a one stop destination to catch Cricket, anytime, anywhere. Explore now!

The open source tool, completely free of cost, can be downloaded from https://indicnlp.ai4bharat.org/ “The multilingual AI models and datasets developed through this initiative will provide the essential building blocks to students, faculty, start-ups and industry to work on Indian language tools and push the frontiers of technology,” it said.

Researchers from IIT Madras and AI4Bharat released AI models and datasets for Tamil, Hindi, Malayalam, Telugu, Kannada, Punjabi, Bengali, Odia, Assamese, Gujarati and Marathi.

According to Mitesh M Khapra, Assistant Professor, Department of Computer Science and Engineering, IIT-M, as the country moves towards a digital economy, it is important that Indian languages find a space online.

“This requires a lot of innovation in creating input tools, datasets, and AI models for Indian languages,” he said.

“For example, imagine a learner who posts a question on an e-learning platform in Tamil or Hindi or any other numerous Indian regional languages. There is a need for tools that can automatically process such questions written in Indian languages and classify them into specific topics,” he said.

Such tools were already available for English and other foreign languages but not for Indian ones, Khapra added.

AI4Bharat is an initiative co-founded by Khapra and Pratyush Kumar, Assistant Professor, Department of Computer Science and Engineering, IIT Madras and works to solve India-specific problems in a community-driven, open-sourced manner, the release added.

Kumar said the initiative “is one of the few attempts in academia” to develop and publicly release large scale multilingual AI models containing millions of parameters trained on billions of tokens from 11 Indian languages, completely free and open-source.

Are you a cricket buff? Participate in the HT Cricket Quiz daily and stand a chance to win an iPhone 15 & Boat Smartwatch. Click here to participate now.

Discover the complete story of India's general elections on our exclusive Elections Product! Access all the content absolutely free on the HT App. Download now!

Get latest news on Education, UP Board Result LIVE, UP Board 10th Result 2024, UP Board 12th Result 2024 along with Board Exam, Competitive Exam and Exam Result at Hindustan Times. Also get latest Job updates on Employment News
SHARE THIS ARTICLE ON
Share this article
SHARE
Story Saved
Live Score
OPEN APP
Saved Articles
Following
My Reads
Sign out
New Delhi 0C
Sunday, April 21, 2024
Start 14 Days Free Trial Subscribe Now
Follow Us On