PakAlumni Worldwide: The Global Social Network

The Global Social Network

Pakistan to Develop Urdu LLM for Generative AI

National University of Science and Technology (NUST), National Information Technology Board (NITB) and Telecom network operator Jazz have signed a Memorandum of Understanding (MOU) to develop Pakistan’s first indigenous Large Language Model (LLM) with focus on Urdu, including datasets for Pashto and Punjabi languages. It is aimed at empowering individuals, businesses, and organizations with advanced AI tools in their native languages. The envisioned LLM is expected to drive innovation in Generative AI applications, boosting productivity and accessibility in critical sectors like healthcare, education, and agriculture.

GPT-4 Accuracy Scores. Source: The Economist

Generative AI tools such as ChatGPT are powered by large language models, or LLMs. These models need to be trained on vast amounts of data in specific languages to be useful. Unfortunately, the Urdu content of the Internet is less than 0.1%. This will present a challenge for the developers of Urdu LLMs.

Online Content of Various Languages. Source: W3Techs

Lack of Urdu content available for training ChatGPT affects the accuracy of the results for Urdu language users. For example, the GPT-4 accuracy score in question-answer tests in Urdu is just over 70%, compared with 85% accuracy score in the English language, according to data from OpenAI. Other South Asian languages, including Hindi, Bengali, Punjabi, Marathi and Telugu, suffer from the same problem.

It's not just a South Asian problem. These challenges exist in the developing world. Non-European languages are generally poorly represented online. It's a major obstacle for non-European nations in developing their own generative artificial-intelligence (AI) models, which rely on vast amounts of training data. Generative artificial intelligence (AI) can produce biased results due to a number of factors, including the data it's trained on, the algorithms used, and how it's deployed.

The use of AI in developing nations such as Pakistan will remain limited to a small number of people proficient in the use of the English language. Broadening the adoption of AI applications will require LLMs trained on local language content. The absence of this development could cost Pakistan the opportunity to take full advantage of the AI Revolution.

You need to be a member of PakAlumni Worldwide: The Global Social Network to add comments!

Join PakAlumni Worldwide: The Global Social Network

RSS

Welcome to
PakAlumni Worldwide: The Global Social Network

Sign Up
or Sign In

Pre-Paid Legal

Twitter Feed

follow me on Twitter

Live Traffic Feed

Please Bookmark This Page!

Blog Posts

India Tariffs: Is Modi-Trump Bromance Over?

President Donald Trump has imposed 50% tariffs on India's exports to the United States. This is far higher than most countries facing US tariffs. Explaining the punitive India tariffs, US Treasury Secretary Scott Bessent said: "India came to the table early. They’ve been slow rolling things. So I think that the president, the whole trade team has been frustrated with them. And also, you know, India, India has been a large buyer of sanctioned Russian oil that they then resell as refined…

Continue

Posted by Riaz Haq on August 9, 2025 at 12:30pm — 2 Comments

Pakistan Ranked Among Top Donors to UN's World Food Program

The United Nations World Food Program has ranked Pakistan fourth among donor countries and sixth overall in 2024. Among the largest 15 donors worldwide, the United States topped the list with $4.45 billion, followed by Germany ($995 million), the United Kingdom ($610 million), European Union ($593 million), private donors ($335 million), Pakistan ($228 million), South Korea ($203 million), France ($196 million), Sweden ($183 million), Canada ($166 million), Norway ($158 million),…