Building Trusted AI at Scale Cities Startups & Digital Sovereignty – Keynote Vivek Raghavan Sarvam AI

20 Feb 2026 13:00h - 14:00h

Session at a glanceSummary, keypoints, and speakers overview

Summary

Speaker 1 introduced Vivek Raghavan, co-founder of Sarvam, a company developing AI that understands India’s languages and context [1-3]. Raghavan opened by asserting that India is capable of training state-of-the-art models and delivering AI to a billion users [4-7]. He argued that while short-term technical leads such as model size or chip speed are fleeting, long-term national sovereignty in AI is essential and must be built domestically [10-13][19-20]. Raghavan warned that without indigenous AI, India risks becoming a digital colony dependent on foreign technology, emphasizing that AI is a core capability a country cannot forgo [21-27]. He highlighted India’s unique advantage of linguistic diversity-22 official languages and regional variations every 50 km-which must be captured for AI to reflect the voice of the people [28-34]. The large, cost-conscious Indian market provides both demand and the need for low-cost, scalable AI solutions, as illustrated by the success of UPI and the potential for AI to improve public services affordably [35-42]. Sarvam’s strategy is a full-stack sovereign AI platform comprising home-grown models, applications, and infrastructure designed for Indian scale [42-46]. Its models are built from scratch without external data dependencies, yet aim to be world-class state-of-the-art systems [46-52]. The SARAS speech model, trained on diverse Indian data, is claimed to outperform global competitors in Indian-language speech-to-text and text-to-speech quality [53-66][69-71]. Sarvam also offers a 3-billion-parameter vision model that surpasses larger international models on document digitisation and visual-grounding tasks, especially in Indian languages [84-90][91-92]. The company has produced several LLMs, including a small 32K-context model trained on 16 trillion tokens and a 105-billion-parameter model, both benchmarked as superior to comparable global open-source models [93-106]. These achievements were realised by a team of only fifteen young engineers, demonstrating the depth of talent available in India [112-119]. Sarvam is already deploying the models in real-time voice applications serving millions of minutes daily, supporting NGOs, content digitisation, and edge devices such as glasses, while building the compute infrastructure needed for nationwide rollout [120-135]. The discussion concluded that building sovereign AI is both a strategic necessity and a feasible path for India to lead in AI that serves its diverse population and economy [13-27][34-38][42-46].

Keypoints

Major discussion points

– AI sovereignty is essential for India’s future.

Raghavan stresses that reliance on foreign models would make India a “digital colony” and that long-term national security depends on building AI in-house, just as India created Aadhaar and the India Stack as open-source public infrastructure [10-13][14-19][24-27].

– India’s unique strengths make sovereign AI feasible and necessary.

He highlights the country’s linguistic diversity (22 official languages, regional variation every ~50 km) and massive, cost-conscious market that can drive demand for AI at scale [28-34][35-40].

– Sarvam is building a full-stack sovereign AI platform: models, applications, and infrastructure.

The platform is organized around three layers-home-grown models, AI-powered applications, and scalable infrastructure-to deliver “world-class, state-of-the-art” solutions entirely from India [42-46][47-52].

– Concrete AI breakthroughs demonstrate world-class capability.

• Speech-to-text (SARAS) and text-to-speech models trained on diverse Indian data outperform global competitors [53-65][69-71].

• Vision models (3 billion-parameter) excel at document digitisation and visual grounding, beating larger international models [84-89][90-91].

• Large language models, from a 32K-context 16-trillion-token model to a 105 billion-parameter LLM, achieve superior benchmarks against GPT-OSS, Gemini Flash, etc., while being trained entirely in India [92-106][108-110].

– Deployment focus: real-world applications and scalable infrastructure.

Sarvam already powers over a million minutes of multilingual voice conversations daily, supports NGOs, content dubbing, and is optimizing models for edge devices and custom hardware to deliver AI at “India scale and India cost” [120-136].

Overall purpose / goal

The discussion aims to convince the audience that India not only can but must develop its own sovereign AI ecosystem. By outlining strategic imperatives, showcasing Sarvam’s technical achievements, and illustrating tangible applications, Raghavan calls for continued investment and collaboration to ensure India’s independence and leadership in AI.

Tone of the discussion

The tone is consistently confident and rallying, beginning with a broad, patriotic appeal to sovereignty, moving into a data-driven exposition of India’s advantages, then shifting to a technical, demonstrative mode when describing models and benchmarks. It concludes on an optimistic, forward-looking note, emphasizing youth talent and the potential for even larger breakthroughs. Throughout, the speaker maintains an enthusiastic, persuasive stance without significant negativity or doubt.

Speakers

– Speaker 1

– Role/Title: Event moderator/host (introducing the keynote) [S1][S3]

– Area of Expertise:

– Vivek Raghavan

– Role/Title: Co-founder of Sarvam (AI company) [S4]

– Area of Expertise: Artificial Intelligence, sovereign AI, speech and language models, large language models, Indian language technology

Additional speakers:

Full session reportComprehensive analysis and detailed insights

Speaker 1 introduced Vivek Raghavan, co-founder of Sarvam, which is building artificial-intelligence systems that can speak India’s many languages and understand its local context [1-3].

Raghavan opened with a concise rallying cry: “India can train state-of-the-art AI models and deliver them to a billion users,” positioning AI sovereignty as a national mandate [4-7].

He argued that fleeting technical bragging rights-such as model size or chip speed-are transitory, whereas home-grown AI is essential to prevent India from becoming a “digital colony.” He cited his work on Aadhaar and the open-source India Stack as proof that publicly built, indigenous technology can scale to serve a nation [10-13][14-19].

India’s unique advantages make sovereign AI both feasible and necessary. First, the country’s linguistic diversity-22 official languages and dialects that shift roughly every 50 km-requires AI that can capture this variation [28-34]. Second, the massive, cost-conscious market creates demand for affordable, scalable solutions; the success of UPI illustrates how technology can achieve mass adoption while remaining inexpensive [35-40].

Sarvam’s response is a full-stack sovereign AI platform organised around three layers: indigenous models, AI-powered applications, and infrastructure built for Indian scale and cost [42-46]. All models are built from scratch with no reliance on external data, yet aim for world-class performance [47-52].

Speech – The SARAS speech-to-text model, trained on extensive Indian data, is best-in-class for Indian languages in blind tests. Its text-to-speech and dubbing capabilities also rank highest in the country, with dubbing that preserves speaker modality, offers precise duration control, and supports mixed-language output [53-66][69-71][133-135].

Vision – A 3-billion-parameter state-space model excels at document digitisation, language-layout understanding, visual grounding, and reading-order prediction, outperforming larger international models on both Indian-language and English tasks [84-90][91-92][136-138].

Large language models – Sarvam has (i) a compact model with a 32 k-token context, trained on 16 trillion tokens for real-time multilingual conversation, and (ii) a 105-billion-parameter LLM, the largest trained entirely in India. The LLM is on par with most open-source and closed-source models of its class and is superior to GPT-OSS 120 B and Gemini Flash in benchmark comparisons [92-106][108-110][139-141].

The development of these models was enabled by a grant from the India AI Mission [142-144].

All of the above were achieved by a team of just fifteen young engineers, underscoring the depth of talent available in India [112-119].

Sarvam’s Servum platform powers more than one million minutes of real-time voice conversation each day across eleven Indian languages; NGOs have generated a crore minutes of calls in a single month, and the platform also supports content digitisation, translation, dubbing, and enterprise and government use-cases [120-130].

To reach every citizen, Sarvam is optimising models for edge deployment on smartphones and augmented-reality glasses [145-147], and is building large-scale compute infrastructure that can deliver AI at “India scale and India cost” [148-150].

In closing, Raghavan reiterated that AI sovereignty is not a luxury but a national mandate. It safeguards India from dependence on foreign technology, leverages the country’s linguistic richness and market size, and capitalises on its youthful talent pool. He called for continued investment, policy backing, and collaborative effort to expand the sovereign AI ecosystem, positioning India to lead globally while serving its diverse population [151-155].

Session transcriptComplete transcript of the session

Speaker 1

I move on to our next keynote speaker, who is Mr. Vivek Raghavan, the co -founder of Sarvam, a company building AI that speaks India’s languages and understands India’s context. In a world dominated by models trained on English language data, their work is a powerful demonstration that sovereign AI capability is not just a luxury, it is a necessity. So, ladies and gentlemen, please welcome Mr. Vivek Raghavan, co -founder of Sarvam.

Vivek Raghavan

I come here to say that India can. And I think that’s the message I want to say. India can. And India can train state -of -the -art models, bring AI to a billion Indians, and do it all. And that’s really the message of why we started Sarvam. I want to talk about the long arc. You know, when you look at, you know, today the world is moving so fast. Everybody talks about where is the largest model or who has the fastest chip. These are all transitory technical advantages. In the long run, it’s sovereignty that matters. And unless we build these things ourselves, it’s something that, you know, will be left behind in the race. In the past 15 years of my life, I worked on building Aadhaar.

Which is India’s… digital identity program. Prior to that, many of these technologies, many of these technologies were proprietary technologies. And we built this kind of self -created technology that is open source and a public infrastructure that is available to all of us. And that led to the creation of the India Stack. So when you look at it over the course of long periods of time, sovereignty will always trump technical leads that are short term. We have a mandate to build. It is not an option whether we want to build in these technologies. AI is a technology that has impact on every single aspect of human life. And it’s a core technology that a country like India must have.

And it’s a core technology that a country like India must understand. from the foundational level. Otherwise, we will become a digital colony which is dependent on other countries for this core, core technology. That’s something that is, it’s not an option. It is something that we must do. And we have unique advantages. And our unique advantage is actually our diversity. We have so many languages. We have 22 official languages. And in fact, you know, the way people speak in our country changes every 50 kilometers. And that diversity must be captured if we have to understand the voice of the people. And therefore, if we build AI from India, it must acknowledge that diversity and do this. The other thing, of course, is we are a huge economy.

There is demand. and if AI is there to help the citizen to do everything, this is we can be one of the largest consumers of AI in the world. And that demand is there and then we have to build. We know that we are a cost -conscious country, right? Everything needs to be at the lowest cost. So we need to build efficient AI that actually can be delivered at scale for the people so that the last person in the country can actually have a better experience, right? Today, if you look at UPI, one of the great success stories of the past decade, and if you look, it is for the first time we feel that in India, things can be better than everywhere else in the world.

But AI done the right way can make sure that every service to citizens actually is the best and the cheapest and actually done in the best possible way for the country and that’s the promise of AI and that’s why we said we need to do this in India. and I think it’s not about I mean we have companies which have globally when you look at AI companies they are massive companies but in the end we have to show a new model where we can actually build AI which helps everyone and then we can win for the people and our model can be adopted in the world and that is my belief of where we need to go on this thing So Sarvam has been building India’s full stack sovereign AI platform and we work with developers fundamentally India is a country of developers we have more developers and we work with enterprises and we work with governments and we have a full stack platform which I’ll talk about In fact the full stack platform consists of three different things one is models we need models that are built in India And that is the key thing and that’s what we’ll focus on, sovereignty and models.

And then we are focusing on applications. Applications is AI for everyday tasks, for making things better for people. And finally, we’ll talk about infrastructure and infrastructure at India scale. The first thing I want to talk about are sovereign models. Rule number one is they are built from scratch. They are not dependent on any other model that is there in the world. They are built from scratch. There is no data dependency on anyone else. But at the same time, the focus is these are world -class, state -of -the -art models that we have. So I’m going to talk a little bit about some of our models. The first model is actually the SARAS model, which is actually the speech model which helps recognize speech in Indian languages.

This is native. This is native. This is native. This is native. This is native. This is native. This is native. This is native. This is native. This is native. This is native. This is native. This is native. This is native. This is native. This is native. This is native. This is native. This is native. This is native. This is native. This is native. In fact, if you see, this is actually best in class in Indian languages compared to any other global model in terms of that’s something to say that in the, these are extremely small models, but these are models which have been trained with significant amounts of Indian diverse data, which will actually lead to better performance on Indie voices.

So let me just take a play, an iconic moment in India, diarized using our model. Oh, sorry. Okay, I don’t think I can make this happen. Okay. Is audio, it’s not playing. let’s maybe we’ll come back to this and here we have a majestic liftoff of lbm3 m4 rocket carrying india’s prestigious chandrayaan 3 spacecraft this is the rocket of lbm3 m4 rocket this is the rocket of lbm3 m4 rocket this is the rocket of lbm3 m4 rocket this is the rocket of lbm3 m4 rocket this is the rocket of lbm3 m4 rocket this is the rocket of lbm3 m4 rocket this is the rocket of lbm3 m4 rocket this is the rocket of lbm3 m4 rocket this is the rocket of lbm3 m4 rocket this is the rocket of lbm3 m4 rocket this is the rocket of lbm3 m4 rocket this is the rocket of lbm3 m4 rocket this is the rocket of lbm3 m4 rocket this is the rocket of lbm3 m4 rocket this is the rocket of lbm3 m4 rocket this is the rocket of lbm3 m4 rocket this is the rocket of lbm3 m4 rocket We want to create models which are naturally expressive Indian voices, and they have low latency streaming and actually production grade quality.

So, in fact, our models, our speech to text models are considered in blind tests are actually the most preferred voices in Indian languages. And this is something compared to all the global competitors, such as Levin Labs and Cartesia, etc. And we have actually the most preferred text to speech model in the country. We’ll also talk, we also have a dubbing capability, which actually preserves the speaker modality and has precise control over duration and supports mixed language things. So I will show, in fact, in this model as well. We see compared to any other model in the world, we are the most preferred as far as dubbing is concerned. I will show a small snippet of what happens here.

We have the remaining 13 bits in which the 12th bit is called the small a bit. Then the remaining 6 bits are compute bits. These 3 are the destination bits and these are the jump bits. Then we have the remaining 13 bits in which the 12th bit is called the small a bit. Then the remaining 6 bits are compute bits. These 3 are the destination bits and these are the jump bits. Then we have the remaining 13 bits in which the 12th bit is called the small a bit. Then the remaining 6 bits are compute bits. These 3 are the destination bits and these are the jump bits. So we’ve also built these vision models. And these vision models are actually very good for document digitization.

They’re very good at language layout understanding, visual grounding, and in fact, finding intelligence by visual components. And finally, reading order predictions. In fact, the vision model that we built is only a 3 billion parameter state space model which beats all other models in the world, not just in Indian languages, but in English as well. So therefore, it shows, and many of these models are many orders of magnitude bigger than our models, and still we are able to get world -class performance from them. Of course, in Indian languages, we are far and away ahead of the models that are there from the global comparison. Now we come talk about some of the LLMs. We have actually, India has started the training of LLMs from scratch.

And this was done through a grant from the India AI Mission that actually without which it would not have been possible for us to train these kinds of models through a GPU grant. In fact, it is a context, it’s a model which is an extremely small model which can run on a single GPU. And it has a 32K context length and is trained on 16 trillion tokens. And it’s extremely efficient thinking which actually gives better answers with lower. And the focus of this model is actually real time conversational applications that people will be able to, it will be able to generate conversations in all the Indian languages in a real time system. And some of the benchmarks show compared to models, global models of similar size.

such as QN30B or GPT -OSS. It is far superior in the ability in terms of various parameters, such as fluency, language and script, and usefulness and conciseness. So the important thing is that this is able to, at the size, it is again a global best. And then we’ll come finally to our largest model, which is the 105 billion parameter model. This is the largest LLM that has trained from scratch in the country. And it is basically better on par with most open source and closed source models of its class. And it can handle various kinds of complex reasoning tasks, as well as web searching and things like that. So it’s actually a fairly complex, fairly advanced model, which again works in all…

all Indian languages. From a benchmark perspective, these models, again, compared to things like GBT OSS 120 billion, as well as Gemini Flash, is actually superior in terms of the kinds of outputs it can generate. So therefore, this is really an example. While this is quite a small model, it is really, just to give you an idea, last year we had DeepSeek R1, which actually was launched, and that was 670 billion parameters. The numbers that we are getting are actually superior to what DeepSeek R1 had last year. Of course, the state of the art has also improved. But the goal is we can show that India can build these things. And I want this. The most important thing that I want people to understand is…

just because, and I think that the, you know, I would love that not just us, but many other people come and show that we can actually build world -class models from India, because that is the fact. And these models was built with a team of just 15 young people. And really it’s the game of the youth of India that have actually made this model what it is. I’m just the spokesperson. These young kids have actually made something like this happen. And if these kids can do it, we have so much talent in the country. And I think that’s, I’m very positive about given the right kind of support in the way that we have been given, that much bigger things can happen.

Moving beyond our models, we actually want these models to become useful. And so therefore we build applications. And I’ll talk very briefly. about the kinds of applications that we build. So in fact, we have an ability to actually converse, and this is our real -time voice conversation that happens. We do more than a million minutes of voice conversation in 11 Indian languages every day using Servum. So these models are actually being used to build things. So these models which have been trained fully in our control are actually now being used for conversations across enterprises and government use cases. In fact, in the last month, we actually took about 20 NGOs, actually did a crore minutes of calling within a month to really understand what is the real -time voice conversation and what people actually are saying at the ground.

Okay. So we actually have ways by which we can make this available for work tasks and enterprise tasks. And we also have the ability to do this for content, which is digitizing books, translating books, dubbing videos. These are all studio products that we have. And I think finally, I want to end with infrastructure. We actually are doing many interesting things. We are making our models smaller to work on the edge, to work on phones. We are making, we’re actually, many of you may have heard that we have also launched some of these glasses so that we are able to have these models run on different form factors and capture the intelligence, capture the voice of India at every point in there.

And finally, for these things to work, you need to have compute at a large scale and the ability to actually very efficiently deliver all these models to India at India scale at India cost. Finally, we are, of course,

Related ResourcesKnowledge base sources related to the discussion topics (19)

Factual NotesClaims verified against the Diplo knowledge base (7)

✓

Confirmedhigh

“Home‑grown AI is essential to prevent India from becoming a “digital colony.””

The knowledge base includes a statement that India must develop its own core technology or risk becoming a digital colony, echoing Raghavan’s point [S14].

✓

Confirmedhigh

“India has 22 official languages.”

A source explicitly notes that India has 22 official languages [S56].

✓

Confirmedhigh

“UPI demonstrates that technology can scale rapidly and affordably for a massive Indian market.”

The knowledge base cites UPI processing over 20 billion transactions monthly and showing scalable, inclusive technology [S57].

ℹ

Additional Contextmedium

“India’s massive, cost‑conscious market makes sovereign AI both feasible and necessary.”

Another source highlights India’s status as the world’s strongest growth market where AI’s deflationary nature aligns with development needs, providing economic context for the claim [S53].

ℹ

Additional Contextmedium

“Building sovereign AI is a national mandate to preserve cultural and technological independence.”

A discussion of India’s need for its own foundation models frames the issue as cultural preservation and national capability, adding nuance to the sovereignty argument [S8].

ℹ

Additional Contextlow

“Sarvam’s platform relies entirely on indigenous models without external data.”

The knowledge base mentions the broader Indian push for heterogeneous compute and sovereign AI capabilities, underscoring the strategic emphasis on indigenous model development [S19].

ℹ

Additional Contextlow

“India’s linguistic diversity creates a need for AI that can handle many languages and dialects.”

A source notes India’s multilingual environment (22 official languages) and the importance of multilingual collaboration for AI, providing additional background [S22].

External Sources (63)

Keynote-Martin Schroeter — -Speaker 1: Role/Title: Not specified, Area of expertise: Not specified (appears to be an event moderator or host introd…

Responsible AI for Children Safe Playful and Empowering Learning — -Speaker 1: Role/title not specified – appears to be a student or child participant in educational videos/demonstrations…

Building Trusted AI at Scale Cities Startups & Digital Sovereignty – Keynote Vijay Shekar Sharma Paytm — -Speaker 1: Role/Title: Not mentioned, Area of expertise: Not mentioned (appears to be an event host or moderator introd…

Building Trusted AI at Scale Cities Startups & Digital Sovereignty – Keynote Vivek Raghavan Sarvam AI — -Announcer: Role/Title: Event announcer; Area of expertise: Not mentioned Rather than viewing India’s complexity as a c…

Host Country Open Stage — Francis D Silva: Please welcome to the stage, from Brnoisund Register Centre, Francis de Silva. Good morning. We are the…

https://dig.watch/event/india-ai-impact-summit-2026/building-trusted-ai-at-scale-cities-startups-digital-sovereignty-keynote-kiran-mazumdar-shaw — Deep science requires a lot of research and development. It requires patient capital. But the societal and economic retu…

(Day 2) General Debate – General Assembly, 79th session: morning session — Emmanuel Macron – France: President of the General Assembly, Heads of State and Government, Ministers, Ambassadors, La…

From Innovation to Impact_ Bringing AI to the Public — The discussion concludes with predictions about the pace of transformation. Sharma suggests that the changes will be dra…

Indias Roadmap to an AGI-Enabled Future — This reframes India’s perceived disadvantages (diversity, complexity) as unique competitive advantages in the AI era. It…

S10

https://dig.watch/event/india-ai-impact-summit-2026/the-global-power-shift-indias-rise-in-ai-semiconductors — And one of the changes that has happened, obviously India becoming the larger in terms of GDP size, consumer demand, peo…

S11

https://dig.watch/event/india-ai-impact-summit-2026/waves-of-infrastructure-open-systems-open-source-open-cloud — So I do expect that to start happening. That’s why we started working with CDAC and VVDN to some extent. We do see the o…

S12

https://dig.watch/event/india-ai-impact-summit-2026/driving-social-good-with-ai_-evaluation-and-open-source-at-scale — Can I, so I just wanted to add something to what you were saying. This is, you know, some of the organizations that we’v…

S13

DigiSov: Regulation, Protectionism, and Fragmentation | IGF 2023 WS #345 — Andrea Beccalli:So Daniele, yes, so indeed, the model that, as I said, underpins the internet, the protocol layer, is a …

S14

https://dig.watch/event/india-ai-impact-summit-2026/building-trusted-ai-at-scale-cities-startups-digital-sovereignty-keynote-vivek-raghavan-sarvam-ai — And it’s a core technology that a country like India must understand. from the foundational level. Otherwise, we will be…

S15

Keynote-Vishal Sikka — “So if you are counting, that is about more than 250 times improvement in productivity.”[1]. “Recently, he rebuilt that …

S16

Building Indias Digital and Industrial Future with AI — Deepak Maheshwari from the Centre for Social and Economic Progress provided historical context, tracing India’s digital …

S17

Partnering on American AI Exports Powering the Future India AI Impact Summit 2026 — This comment demonstrates sophisticated understanding that ‘AI sovereignty’ isn’t a monolithic concept but represents di…

S18

Designing Indias Digital Future AI at the Core 6G at the Edge — This comment connects technical sovereignty to cultural and ethical sovereignty, highlighting that AI systems trained on…

S19

HETEROGENEOUS COMPUTE FOR DEMOCRATIZING ACCESS TO AI — This comment provides crucial context about India’s position in the global AI ecosystem, distinguishing between applicat…

S20

The Global Power Shift India’s Rise in AI & Semiconductors — -Building India’s AI and Semiconductor Ecosystem: The panel discussed India’s positioning in the global AI and semicondu…

S21

Need and Impact of Full Stack Sovereign AI by CoRover BharatGPT — “What raw material is needed for AI?”[9]. “sovereign AI comes to India, we’ll have the control”[56]. “Indian government …

S22

Sovereign AI for India – Building Indigenous Capabilities for National and Global Impact — Hello, good afternoon. Good afternoon. Good afternoon. My name is Sunil Gupta. I am co -founder and CEO of IOTA. So we r…

S23

IGF 2024 Opening Ceremony — This comment provided a structure for subsequent speakers to address specific aspects of AI governance and inequality. I…

S24

Comprehensive Report: China’s AI Plus Economy Initiative – A Strategic Discussion on Artificial Intelligence Development and Implementation — There is unexpected consensus among speakers from different backgrounds (academia, industry startup, and large corporati…

S25

Survival Tech Harnessing AI to Manage Global Climate Extremes — Thank you. See, we have data in place. We have policies in place. We have science in place. Now, what? Money in place. S…

S26

Global AI Policy Framework: International Cooperation and Historical Perspectives — Werner identifies three critical barriers that prevent AI for good use cases from scaling globally. He emphasizes that d…

S27

Democratizing AI: Open foundations and shared resources for global impact — ## Practical Applications and Real-World Impact Bernard Maissen: Yes, thank you. Hello, everybody, dear panelists. Nina…

S28

Digital Democracy Leveraging the Bhashini Stack in the Parliamen — Industry adoption requires domain-specific adaptation, feedback loops, and scalable edge deployment infrastructure for r…

S29

HETEROGENEOUS COMPUTE FOR DEMOCRATIZING ACCESS TO AI — This comment provides crucial context about India’s position in the global AI ecosystem, distinguishing between applicat…

S30

Building Trusted AI at Scale Cities Startups & Digital Sovereignty – Keynote Vivek Raghavan Sarvam AI — This comment establishes the foundational premise for the entire presentation, shifting the conversation from ‘why build…

S31

S32

Building Trusted AI at Scale Cities Startups & Digital Sovereignty – Keynote Kiran Mazumdar-Shaw — “I believe that nations that command the convergence of biology and AI, or what I like to call the convergence of biolog…

S33

S34

Designing Indias Digital Future AI at the Core 6G at the Edge — see this token economy in which we are going to go in the next 5 to 7 years so sovereignty is going to be a token sovere…

S35

Keynote ‘I’ to the Power of AI An 8-Year-Old on Aspiring India Impacting the World — 8 year old prodigy: Sharing is learning with the rest of the world. One, an AI that is independent. From large global A…

S36

Sovereign AI for India – Building Indigenous Capabilities for National and Global Impact — Absolutely. I think we are trying to do that in a collaborative way with all of our contributors. Please be a collaborat…

S37

HETEROGENEOUS COMPUTE FOR DEMOCRATIZING ACCESS TO AI — This comment provides crucial context about India’s position in the global AI ecosystem, distinguishing between applicat…

S38

AI Innovation in India — India’s unique strength lies in its people’s ability to work in unstructured environments and get jobs done regardless o…

S39

Building Trusted AI at Scale Cities Startups & Digital Sovereignty – Keynote Vivek Raghavan Sarvam AI — And it’s a core technology that a country like India must understand. from the foundational level. Otherwise, we will be…

S40

Leaders’ Plenary | Global Vision for AI Impact and Governance- Afternoon Session — Thank you, Prime Minister, for having us. As my colleagues have said, India will no doubt be a powerhouse in AI in many …

S41

Need and Impact of Full Stack Sovereign AI by CoRover BharatGPT — I think whatever is there, first, energy. Our brain is very useful. It only runs on 20 watts. But, the GPU doesn’t run o…

S42

Skilling and Education in AI — A technology company representative highlighted the critical importance of building comprehensive AI infrastructure with…

S43

We are the AI Generation — Doreen Bogdan Martin: Thank you. Good morning and welcome to Geneva for the AI for Good Global Summit 2025. I want to th…

S44

The potential of AI and recent breakthroughs in technology — I am excited about how my friends at Microsoft and their partners have been working together to use advanced generative …

S45

Artificial Intelligence & Emerging Tech — One viewpoint acknowledges the transformative potential of AI and its ability to generate novel content and integrate di…

S46

Breakthroughs in human-centric bioscience with AI — During the 2020-2021 COVID-19 pandemic, AI models dramatically sped up vaccine development, screening immune system targ…

S47

From KW to GW Scaling the Infrastructure of the Global AI Economy — The success of this transformation will depend on continued collaboration between global technology providers and local …

S48

Digital Democracy Leveraging the Bhashini Stack in the Parliamen — Industry adoption requires domain-specific adaptation, feedback loops, and scalable edge deployment infrastructure for r…

S49

Democratizing AI: Open foundations and shared resources for global impact — Focus on Real-World Impact and Practical Applications

S50

Waves of infrastructure Open Systems Open Source Open Cloud — Bharat from Divium addressed a critical deployment challenge: 90% of generative AI pilots never reach production, not du…

S51

AI for agriculture Scaling Intelegence for food and climate resiliance — “We will move from pilots to platforms, from fragmented data to interoperable systems, from experimentation to execution…

S52

Opening remarks — The keynote culminated in an invocation of collective duty and a rallying cry for all attendees to commit to this common…

S53

Building Trusted AI at Scale Cities Startups & Digital Sovereignty – Keynote Hemant Taneja General Catalyst — Taneja argued that India is uniquely positioned to lead in AI deployment due to its status as the world’s strongest grow…

S54

India needs a quantum leap in defence AI, says LatentAI founder — Jags Kandasamy, founder of US-based defence tech company LatentAI, isworking with Indian firms to pursue defence contrac…

S55

Open Forum #37 Digital and AI Regulation in La Francophonie an Inspiration and Global Good Practice — It is a space where there are several languages, more than 1,000 languages. For example, take the country of RDC, they h…

S56

Setting the Rules_ Global AI Standards for Growth and Governance — So… As a recent computer science student, I’m interested in building AI for India. Specifically with such a distinguis…

S57

Keynote-Rishad Premji — “UPI today processes over 20 billion transactions every month and has transformed how individuals and businesses partici…

S58

Scaling Trusted AI_ How France and India Are Building Industrial & Innovation Bridges — The summit’s emphasis on trust as the foundation for scale provides a framework for understanding why some AI applicatio…

S59

Open Forum #50 Digital Innovation and Transformation in the UN System — Ensuring solutions are scalable and cost-effective

S60

Open Forum #66 the Ecosystem for Digital Cooperation in Development — African Child Project’s work in local talent development and their success with school connectivity as a grassroots init…

S61

Can (generative) AI be compatible with Data Protection? | IGF 2023 #24 — Smriti Parsheera:Thanks so much, Luca, and hello to everyone in the room and online. So as Luca mentioned, I’m gonna rea…

S62

European Tech Sovereignty: Feasibility, Challenges, and Strategic Pathways Forward — Sovereignty has multiple layers: data, operations, technology stack – can control three out of four

S63

https://dig.watch/event/india-ai-impact-summit-2026/regulating-open-data_-principles-challenges-and-opportunities — and international cooperation that respects national regulatory frameworks. Together, these signals suggest that the eme…

Speakers Analysis

Detailed breakdown of each speaker’s arguments and positions

Speaker 1

1 argument133 words per minute73 words32 seconds

Argument 1

Sovereignty necessity – (Speaker 1)

EXPLANATION

The speaker emphasizes that having AI capabilities that are owned and controlled by India is essential, not a luxury. Sovereign AI is presented as a strategic requirement for the country’s future.

EVIDENCE

The speaker states that in a world dominated by English-language models, Sarvam’s work shows that sovereign AI capability is “not just a luxury, it is a necessity” [2].

EXTERNAL EVIDENCE (KNOWLEDGE BASE)

The need for sovereign AI is reinforced by remarks that sovereignty is not isolation and that India must develop core AI capabilities to avoid dependence, as discussed in [S6] and echoed in the keynote emphasizing the risk of becoming a digital colony [S4].

MAJOR DISCUSSION POINT

Need for sovereign AI capability

AGREED WITH

Vivek Raghavan

12 arguments139 words per minute2407 words1033 seconds

Argument 1

Build indigenous AI to avoid digital colonisation – (Vivek Raghavan)

EXPLANATION

Vivek argues that India must develop its own AI systems to prevent dependence on foreign technologies, which would turn the country into a digital colony. Indigenous development is framed as a non‑optional national mandate.

EVIDENCE

He warns that without building AI domestically, India will become “a digital colony which is dependent on other countries for this core, core technology” and stresses that this is “not an option” but something the country “must do” [25-27].

EXTERNAL EVIDENCE (KNOWLEDGE BASE)

The keynote repeatedly warns that without indigenous AI India will become a digital colony dependent on foreign technology, matching the argument’s claim [S4] and [S14].

MAJOR DISCUSSION POINT

Preventing digital colonisation

AGREED WITH

Speaker 1

Argument 2

Linguistic diversity as a strategic asset – (Vivek Raghavan)

EXPLANATION

Vivek highlights India’s vast linguistic landscape—22 official languages and regional variations every 50 km—as a unique advantage for AI development. He asserts that AI built in India must capture this diversity to truly represent the population.

EVIDENCE

He notes India’s 22 official languages, the rapid change in spoken language across short distances, and the need for AI to acknowledge this diversity to understand the voice of the people [29-34].

EXTERNAL EVIDENCE (KNOWLEDGE BASE)

A roadmap paper frames India’s cultural and linguistic diversity as a strategic advantage for AI development, directly supporting the argument [S9].

MAJOR DISCUSSION POINT

Leveraging linguistic diversity

Argument 3

Massive, cost‑conscious market drives demand – (Vivek Raghavan)

EXPLANATION

Vivek points out that India’s large, price‑sensitive market creates strong demand for affordable AI solutions at scale. He links this demand to the country’s economic size and cost‑conscious consumer behavior.

EVIDENCE

He describes India as a huge economy with demand for AI, emphasizing the need for low-cost, efficient AI that can reach the “last person” in the country, and cites the UPI success story as evidence of India’s ability to deliver better and cheaper services [35-40][41-42].

EXTERNAL EVIDENCE (KNOWLEDGE BASE)

Analyses of India’s large, price-sensitive economy and growing consumer demand for affordable digital services, exemplified by the UPI success story, provide contextual backing [S10] and [S11].

MAJOR DISCUSSION POINT

Market size and cost sensitivity as drivers

Argument 4

Three‑layer stack: models, applications, infrastructure – (Vivek Raghavan)

EXPLANATION

Vivek outlines Sarvam’s full‑stack sovereign AI platform, which is organized into three layers: indigenous models, AI‑powered applications, and scalable infrastructure. This structure is presented as the roadmap for building sovereign AI.

EVIDENCE

He explicitly lists the three components-models, applications, and infrastructure-as the pillars of Sarvam’s platform [43-46].

MAJOR DISCUSSION POINT

Full‑stack AI architecture

Argument 5

SARAS speech model delivers best‑in‑class Indian language performance – (Vivek Raghavan)

EXPLANATION

Vivek claims that the SARAS speech‑to‑text model, trained on extensive Indian data, outperforms global competitors in Indian languages. He stresses its native design and superior user preference in blind tests.

EVIDENCE

He describes SARAS as a native speech model trained on diverse Indian data, calling it best-in-class for Indian languages, and notes that blind-test results show it is the most preferred voice compared to global rivals such as Levin Labs and Cartesia [53-65][69-70].

MAJOR DISCUSSION POINT

Superior Indian‑language speech model

Argument 6

Vision model (3 B parameters) outperforms global rivals in document digitisation – (Vivek Raghavan)

EXPLANATION

Vivek presents a 3‑billion‑parameter vision model that excels at document digitisation, language layout understanding, and visual grounding, outperforming larger international models in both Indian and English contexts.

EVIDENCE

He explains that the vision model, despite its modest size, beats all other world models in document digitisation, language layout understanding, visual grounding, and reading order prediction, and that it outperforms models even with many more parameters [84-89].

MAJOR DISCUSSION POINT

High‑performance, lightweight vision model

Argument 7

Small LLM with 32K context, 16 T tokens, real‑time multilingual chat – (Vivek Raghavan)

EXPLANATION

Vivek describes a compact large language model with a 32 K token context window, trained on 16 trillion tokens, designed for real‑time conversational AI across all Indian languages. The model is positioned as efficient yet capable.

EVIDENCE

He notes that the model can run on a single GPU, has a 32 K context length, was trained on 16 trillion tokens, and is optimized for real-time multilingual chat applications [92-98].

MAJOR DISCUSSION POINT

Efficient multilingual conversational LLM

Argument 8

105 B parameter LLM matches or exceeds open‑source giants on benchmarks – (Vivek Raghavan)

EXPLANATION

Vivek asserts that Sarvam’s 105‑billion‑parameter LLM, the largest trained in India from scratch, performs on par with or better than leading open‑source and proprietary models on a range of benchmarks, including reasoning and web‑search tasks.

EVIDENCE

He states that the 105 B model is comparable to top open-source and closed-source models, superior to GBT-OSS 120 B and Gemini Flash in output quality, and handles complex reasoning and web-search tasks [101-107].

MAJOR DISCUSSION POINT

World‑class large‑scale Indian LLM

Argument 9

15‑person youth team built world‑class models, proving talent depth – (Vivek Raghavan)

EXPLANATION

Vivek highlights that a small team of 15 young engineers built the described models, demonstrating the depth of technical talent available in India and the potential for larger achievements with proper support.

EVIDENCE

He mentions that the models were built by a team of just 15 young people, emphasizing the youth’s contribution and expressing confidence that greater support would enable even bigger successes [114-119].

MAJOR DISCUSSION POINT

Youth talent and feasibility

Argument 10

Voice‑conversation platform handling >1 M minutes daily in 11 languages – (Vivek Raghavan)

EXPLANATION

Vivek reports that Sarvam’s real‑time voice conversation platform processes over one million minutes of speech each day across eleven Indian languages, illustrating large‑scale deployment and multilingual reach.

EVIDENCE

He states that more than a million minutes of voice conversation in 11 Indian languages are handled daily using Sarvam’s platform, and that these models are already used in enterprise and government contexts [123-126].

MAJOR DISCUSSION POINT

High‑volume multilingual voice service

Argument 11

Applications for NGOs, enterprises, government, content digitisation and dubbing – (Vivek Raghavan)

EXPLANATION

Vivek outlines a suite of applications built on the models, including NGO outreach, enterprise workflows, government services, book digitisation, translation, and video dubbing, showing practical societal impact.

EVIDENCE

He describes collaborations with NGOs that generated crore minutes of calls, as well as capabilities for digitising books, translating, and dubbing videos, indicating a broad application portfolio [127-131].

MAJOR DISCUSSION POINT

Diverse real‑world AI applications

Argument 12

Edge‑optimized models for phones and AR glasses; large‑scale compute at Indian cost – (Vivek Raghavan)

EXPLANATION

Vivek explains efforts to shrink models for edge deployment on smartphones and AR glasses, and to provide large‑scale compute resources at costs affordable for India, ensuring nationwide accessibility.

EVIDENCE

He mentions making models smaller for edge devices, launching glasses that run AI locally, and building large-scale compute infrastructure that delivers models at Indian scale and Indian cost [133-135].

MAJOR DISCUSSION POINT

Edge deployment and cost‑effective compute

Agreements

Agreement Points

Sovereign AI is essential for India’s future and must be built domestically

Speakers: Speaker 1, Vivek Raghavan

Sovereignty necessity – (Speaker 1) Build indigenous AI to avoid digital colonisation – (Vivek Raghavan)

Both speakers stress that AI capability owned and controlled by India is not optional but a strategic necessity; without it India risks becoming a digital colony and being left behind in the AI race [2][12-14][25-27].

POLICY CONTEXT (KNOWLEDGE BASE)

The consensus reflects India’s push for digital sovereignty, stressing the need to develop foundational AI models locally rather than rely on external providers, as discussed in the context of heterogeneous compute and sovereign capabilities [S29], and reinforced by keynote arguments that India must build AI to remain sovereign [S30][S31].

Similar Viewpoints

Both see sovereign AI capability as a non‑luxury, mandatory national mandate to safeguard India’s technological independence and development trajectory [2][12-14][25-27].

Speakers: Speaker 1, Vivek Raghavan

Sovereignty necessity – (Speaker 1) Build indigenous AI to avoid digital colonisation – (Vivek Raghavan)

Unexpected Consensus

Overall Assessment

The discussion shows clear alignment between the introductory speaker and the keynote on the need for indigenous, sovereign AI for India, framing it as a strategic imperative to avoid digital dependence. Beyond this core point, there is little overlap on other themes such as linguistic diversity, market size, or specific technical achievements.

Moderate consensus limited to the sovereignty argument; this shared stance reinforces policy momentum for building a national AI ecosystem but indicates divergent focus on implementation details and broader AI applications.

Differences

Different Viewpoints

Unexpected Differences

Overall Assessment

The transcript shows strong alignment between the introductory remarks and Vivek Raghavan’s keynote. Both emphasize the strategic need for sovereign, indigenous AI to avoid dependence on external technology and to serve India’s linguistic diversity and large market. No substantive contradictions or opposing viewpoints are evident.

Minimal – the speakers are largely in consensus, indicating a unified stance on the importance of building indigenous AI capacity. This coherence suggests that policy discussions around AI sovereignty in this context are likely to progress without major internal contention.

Partial Agreements

Both speakers stress that India must develop its own AI capabilities rather than rely on foreign models. Speaker 1 calls sovereign AI a "necessity" in a world dominated by English‑language models [2], while Vivek warns that without domestic AI India will become "a digital colony" and says building AI is "not an option" but a mandate [25-27].

Speakers: Speaker 1, Vivek Raghavan

Sovereignty necessity – (Speaker 1) Build indigenous AI to avoid digital colonisation – (Vivek Raghavan)

Takeaways

Key takeaways

AI sovereignty is essential for India to avoid digital colonisation and ensure long‑term strategic advantage. India’s linguistic diversity and large, cost‑conscious market provide unique advantages for building indigenous AI solutions. Sarvam is developing a full‑stack sovereign AI platform comprising three layers: indigenous models, applications, and scalable infrastructure. Indigenous models include the SARAS speech model (best‑in‑class for Indian languages), a 3 B‑parameter vision model that outperforms global rivals in document digitisation, a small LLM with 32K context and 16 trillion tokens for real‑time multilingual chat, and a 105 B‑parameter LLM that matches or exceeds leading open‑source and closed‑source models. A small, 15‑person youth team was able to build world‑class models, demonstrating the depth of talent in India. Sarvam’s applications already power over 1 million minutes of voice conversation daily in 11 Indian languages and support NGOs, enterprises, government, content digitisation, and dubbing. Infrastructure efforts focus on edge‑optimized models for phones and AR glasses and on delivering large‑scale compute at Indian cost.

Resolutions and action items

None identified

Unresolved issues

None identified

Suggested compromises

None identified

Thought Provoking Comments

India can train state‑of‑the‑art models, bring AI to a billion Indians, and do it all. In the long run, it’s sovereignty that matters; unless we build these things ourselves we will be left behind.

Frames AI development as a matter of national sovereignty rather than just a technical race, shifting the conversation from competition over model size to strategic self‑reliance.

Sets the thematic foundation for the entire talk, prompting the audience to view subsequent technical details through the lens of national independence and leading to deeper discussion of why indigenous AI is essential.

Speaker: Vivek Raghavan

Our unique advantage is actually our diversity – 22 official languages and dialects that change every 50 km. AI built in India must capture that diversity to truly understand the voice of the people.

Highlights linguistic diversity as a strategic asset, introducing a novel angle on why Indian AI can outperform global models on local tasks.

Leads to the introduction of the SARAS speech model and the emphasis on native language performance, steering the conversation toward concrete examples of leveraging diversity.

Speaker: Vivek Raghavan

We are a cost‑conscious country; everything needs to be at the lowest cost. So we need to build efficient AI that can be delivered at scale for the last person in the country.

Connects economic realities with technical design, urging a focus on efficiency and affordability rather than raw scale.

Shifts the tone from showcasing large models to discussing model size, optimization, and edge deployment, paving the way for later remarks about small‑model performance and edge devices.

Speaker: Vivek Raghavan

Rule number one for sovereign models: they are built from scratch, with no data dependency on anyone else, yet they are world‑class, state‑of‑the‑art models.

Establishes a clear principle that underpins the company’s technical strategy, challenging the common practice of fine‑tuning existing global models.

Creates a turning point that moves the discussion from high‑level motivation to the concrete methodology of model development, prompting listeners to consider the feasibility of truly independent AI pipelines.

Speaker: Vivek Raghavan

Our 105 billion‑parameter LLM, trained entirely in India by a team of just 15 young engineers, matches or exceeds the performance of comparable open‑source and closed‑source models.

Demonstrates that scale and excellence can be achieved with limited resources, reinforcing the earlier sovereignty narrative and inspiring confidence in domestic talent.

Elevates the conversation to a proof‑of‑concept milestone, encouraging the audience to envision larger future projects and reinforcing the message that talent, not just capital, drives success.

Speaker: Vivek Raghavan

We are already delivering more than a million minutes of real‑time voice conversation in 11 Indian languages every day, and have helped NGOs generate a crore minutes of calls in a month.

Shows tangible, large‑scale impact of the technology on society, moving the discussion from theory to real‑world application.

Broadens the scope of the talk to include social impact and public‑sector use cases, prompting listeners to think about deployment challenges and benefits beyond commercial profit.

Speaker: Vivek Raghavan

We are making our models smaller to run on the edge, on phones, even on glasses, so that AI can be present at every point in India at India‑scale and India‑cost.

Introduces the vision of ubiquitous, low‑cost AI access, linking back to the earlier cost‑consciousness point and expanding the discussion to hardware and infrastructure.

Serves as a forward‑looking conclusion that ties together sovereignty, diversity, efficiency, and accessibility, setting the stage for future collaborations and policy discussions.

Speaker: Vivek Raghavan

Overall Assessment

The discussion was driven by a series of strategically placed insights from Vivek Raghavan that moved the audience from a broad, ideological stance on AI sovereignty to concrete demonstrations of technical capability, social impact, and future deployment. Each pivotal comment reframed the conversation—first by positioning sovereignty as a national imperative, then by leveraging India’s linguistic diversity, emphasizing cost‑effective design, insisting on building models from scratch, showcasing a world‑class large model built by a tiny team, evidencing real‑world usage, and finally envisioning edge‑centric, ubiquitous AI. These moments collectively shifted the tone from abstract advocacy to demonstrable achievement and forward‑looking ambition, deepening the audience’s understanding of how India can achieve independent, inclusive AI at scale.

Follow-up Questions

How can India develop sovereign AI models from scratch without relying on external data or pretrained models?

Raghavan emphasizes the need for completely home‑grown models to ensure AI sovereignty, highlighting a gap that requires research into data collection, model architecture, and training pipelines.

Speaker: Vivek Raghavan

What methods can be used to capture and model India’s linguistic diversity, including 22 official languages and regional dialects that change every 50 km?

He stresses that AI must reflect India’s language variety, indicating a need for research on multilingual data acquisition, dialect representation, and evaluation metrics.

Speaker: Vivek Raghavan

How can AI models be made cost‑effective and efficient enough to serve the ‘last person’ in India at scale?

Raghavan points out India’s cost‑conscious market and the requirement for low‑cost deployment, suggesting research into model compression, quantization, and affordable inference infrastructure.

Speaker: Vivek Raghavan

What strategies are needed to deploy AI models on edge devices such as smartphones and specialized hardware like glasses?

He mentions work on making models run on phones and glasses, indicating further investigation into edge‑optimised architectures, on‑device training, and power‑efficient inference.

Speaker: Vivek Raghavan

How can large‑scale compute infrastructure be built in India to support training and serving of sovereign AI models at national cost levels?

Raghavan notes the necessity of massive, affordable compute for India‑scale AI, highlighting a research and policy area around data centre design, hardware sourcing, and financing models.

Speaker: Vivek Raghavan

What benchmarking frameworks should be used to evaluate Indian AI models against global competitors across speech, vision, and language tasks?

He references blind tests and comparisons with global models, implying a need for standardized, transparent benchmarking tailored to Indian languages and use‑cases.

Speaker: Vivek Raghavan

How can AI applications be effectively integrated into government services and large public platforms like UPI and Aadhaar?

Raghavan links AI to existing digital infrastructure, suggesting research on secure, scalable integration, privacy preservation, and impact assessment.

Speaker: Vivek Raghavan

What sustainable funding and policy mechanisms are required to support long‑term AI sovereignty initiatives beyond initial grants?

He mentions the grant from the India AI Mission that enabled model training, indicating a need to explore ongoing financing, regulatory frameworks, and public‑private partnerships.

Speaker: Vivek Raghavan

How can the talent pipeline be expanded and supported so that small teams (e.g., 15 young engineers) can scale up to larger, more complex AI projects?

Raghavan credits a small youth team for their achievements, pointing to research on education, mentorship, and ecosystem development to nurture AI expertise.

Speaker: Vivek Raghavan

What are the best practices for building AI‑driven applications (e.g., real‑time voice conversation, document digitization, dubbing) that serve NGOs, enterprises, and citizens effectively?

He describes various applications in use, indicating a need for further study on productisation, user experience, scalability, and impact measurement.

Speaker: Vivek Raghavan

Disclaimer: This is not an official session record. DiploAI generates these resources from audiovisual recordings, and they are presented as-is, including potential errors. Due to logistical challenges, such as discrepancies in audio/video or transcripts, names may be misspelled. We strive for accuracy to the best of our ability.

Building Trusted AI at Scale Cities Startups & Digital Sovereignty – Keynote Vivek Raghavan Sarvam AI

Summary

Keypoints

Speakers

Related event

India AI Impact Summit 2026