Explained: What are foundational models, examples of common ones?

Foundational models are a form of artificial intelligence models that can perform a wide range of tasks

Artificial Intelligence, AI
Photo: Bloomberg
Shivani Shinde Mumbai
3 min read Last Updated : Jan 23 2025 | 11:51 PM IST
The debate on whether India should have its own foundational models or build large language models (LLMs) is intensifying, with IT Minister Ashwini Vaishnaw stating that the government is exploring indigenous AI models. Adding to this, Perplexity CEO Aravind Srinivas emphasised that India should create its own foundational model. In a recent post on X, Srinivas said he disagrees with Nandan Nilekani's view that India should not develop its own LLMs.
 
What are foundational models? 
Foundational models are a type of artificial intelligence model capable of performing a wide range of tasks. These models are created by training on vast and diverse datasets, enabling their use across various applications. Foundational models have existed for some time, but earlier versions were specialised tools trained for specific applications.
 
According to Amazon Web Services (AWS), the term "foundational model" was coined by researchers to describe machine learning (ML) models trained on broad, generalised, and unlabelled data. These models can perform a variety of general tasks, such as understanding language, generating text and images, and engaging in natural language conversations.
 
Which are some of the common foundational models? 
Indian entrepreneurs and businesses are using AI engines or foundational models developed by OpenAI, Microsoft, Google, and Meta, among others. In India, Ola is creating Krutrim, an LLM from scratch.
 
What are some of the challenges in building foundational models? 
There are two critical elements for building foundational models: first, the compute power or GPUs needed to create powerful servers, and second, the investments required. As Satya Nadella, chairman and CEO of Microsoft, said during his recent visit to India: “India must get into frontier work in artificial intelligence and build foundational models, but investment is a real entry barrier, and just one mathematical breakthrough can change the entire dynamics.”
 
The Indian government’s stand 
Ashwini Vaishnaw, Minister of Electronics and Information Technology, told Business Standard in an earlier interview that the country is focused on building its own GPUs. The target is to have a GPU built in India within the next three to five years. In a recent interview with CNBC TV18 at Davos, he stated that India is working on preparing datasets for training AI models, leveraging large pools of non-personal data, such as transport, agriculture, and weather datasets.
 
Why the debate now? 
One reason for the current debate is the recent advancement by DeepSeek, a Chinese startup that unveiled DeepSeek V3, an LLM with 671 billion parameters. Srinivas of Perplexity has argued that India should focus on creating foundational models for Indic languages while remaining competitive on global benchmarks.
 
*Subscribe to Business Standard digital and get complimentary access to The New York Times

Smart Quarterly

₹900

3 Months

₹300/Month

SAVE 25%

Smart Essential

₹2,700

1 Year

₹225/Month

SAVE 46%
*Complimentary New York Times access for the 2nd year will be given after 12 months

Super Saver

₹3,900

2 Years

₹162/Month

Subscribe

Renews automatically, cancel anytime

Here’s what’s included in our digital subscription plans

Exclusive premium stories online

  • Over 30 premium stories daily, handpicked by our editors

Complimentary Access to The New York Times

  • News, Games, Cooking, Audio, Wirecutter & The Athletic

Business Standard Epaper

  • Digital replica of our daily newspaper — with options to read, save, and share

Curated Newsletters

  • Insights on markets, finance, politics, tech, and more delivered to your inbox

Market Analysis & Investment Insights

  • In-depth market analysis & insights with access to The Smart Investor

Archives

  • Repository of articles and publications dating back to 1997

Ad-free Reading

  • Uninterrupted reading experience with no advertisements

Seamless Access Across All Devices

  • Access Business Standard across devices — mobile, tablet, or PC, via web or app

More From This Section

Topics :IT serviceIT sectorIndian IT Sectorartifical intelligence

First Published: Jan 23 2025 | 11:51 PM IST

Next Story