Ten Trillion Tokens: Making AI Work for Every Indian Language

Building the largest multilingual LLM dataset for Indian languages at People+AI

This article was written while working at People+AI

Click here to Read the full article on People+AI blog ->