From Human Potential to Global Impact_ Qualcomm’s AI for All Workshop

20 Feb 2026 14:00h - 15:00h

Session at a glanceSummary, keypoints, and speakers overview

Summary

The session opened with the moderator introducing Durga Malladi, Executive Vice President and General Manager of Technology Planning, Edge Solutions and Data Center at Qualcomm Technologies, as the speaker on AI’s economic potential [1-2]. Malladi outlined that she would review the current AI landscape from edge to cloud, noting that model sizes have shrunk dramatically while quality has risen, a trend she described as an emerging “AI law” that underpins the feasibility of edge AI [5-6][10-13][14-16]. She highlighted that today’s premium smartphones, AR glasses and PCs can run models with up to tens of billions of parameters, and that on-device inference delivers consistent AI experiences regardless of network connectivity, addressing privacy concerns for personal data [16-18][22-24]. Tracing the evolution of user interfaces-from command lines to mouse, touch, and now voice-Malladi argued that multimodal AI agents can unify text, voice, video and sensor inputs into a single conversational interface, exemplified by a voice-first smartphone that authenticates the user and routes requests to appropriate apps [26-33][40-45]. She described a new AI-first phone released in China that hides traditional apps behind an agent, illustrating how edge AI is moving from concept to consumer products [46-49]. Malladi emphasized a hybrid AI architecture in which the cloud handles foundational model training, on-prem servers run large inference workloads, and devices execute smaller models, enabling flexible distribution of processing across the network; the Humane PC prototype was presented as a device that dynamically decides whether a query should be processed locally or in the cloud, demonstrating seamless edge-cloud collaboration for a universe of wearables such as glasses, earbuds and watches [60-65][68-69][70-74][75]. Qualcomm supports developers through the Qualcomm AI Hub, which offers model selection, cloud-native device farms for testing without physical hardware, and deployment pathways to app stores, while in data centers the company pursues energy-efficient high-performance computing, distinguishes inference-optimized processors from training-focused ones, and introduced the AI-250 solution with an innovative memory architecture to improve token generation speed [78-87][97-105][111-112]. Looking ahead, Malladi linked the upcoming 6G rollout-expected to be demonstrated around the 2028 Summer Olympics-to AI, arguing that tighter integration of cellular connectivity and AI will unlock new capabilities for edge devices and cloud services [114-134]. In the panel, Praveer Kochhar identified “shadow AI,” the widespread use of unauthorized AI tools on enterprise data, as an underrated pain point that threatens security and efficiency [175-182]. Madhav Bhargav recounted SpotDraft’s early mistake of trying to train a separate model for each client, which led to building a data-labeling pipeline and a Word plugin that now enables a single model to provide grounded answers from customer-specific data [196-209]. Shreenivas Chetlapalli stressed that setting realistic expectations about AI’s augmentative role and keeping processing local-especially in India where on-prem solutions like Orion are preferred-are key to adoption [219-224][254-255]. Participants also highlighted hardware constraints such as the need for continuous connectivity to manage remote robots, and the risk of excessive data leaving devices, advocating for minimal data transfer and synthetic data generation for training [304-308][313-320]. The discussion concluded with a consensus that by 2030 edge AI will be ubiquitous and taken for granted, emergent behaviors in large language models will begin to appear, and generative AI will fundamentally reshape user interfaces and applications across industries [376-382][392-396][403-405].

Keypoints

Major discussion points

– Edge AI is becoming practical and essential – Model sizes are shrinking while quality improves, allowing 10-billion-parameter models on smartphones, 1-2 billion on AR glasses, and 30 billion on PCs, which makes AI inference possible directly on devices and removes dependence on network quality and protects personal data [10-16][18-22][31-34][36-38][41-44].

– Qualcomm’s hybrid AI architecture spans edge, cloud, and data-center – AI workloads are distributed according to use-case: the cloud handles foundational model training, on-prem servers run large-scale inference with AI accelerator cards, and devices run smaller models locally. Qualcomm supports this with the AI Hub, AI 250/AI 300 memory-optimized solutions, and a focus on energy-efficient high-performance computing [60-68][70-73][96-106][107-112][114-119][120-124].

– 6G is positioned as the next catalyst for AI – The company links the evolution of cellular generations to AI potential, outlining a timeline that sees 6G trials around the 2028 Summer Olympics and broader deployments by 2029, arguing that tighter integration of AI and connectivity will unlock new use-cases [114-124][125-134].

– Panel highlighted real-world enterprise pain points and constraints – Participants discussed “shadow AI” (unauthorized AI use) as an underrated risk [175-182], the difficulty of building per-customer models in legal AI and the shift to data-centric pipelines [195-208], the trade-off between local vs. cloud processing in India and the need for clear expectations [218-226][254-259], and hardware concerns such as continuous connectivity for edge robots [304-308] and data-leakage risks [313-321].

– A vision of AI-driven, agentic interfaces and emergent behavior – Speakers imagined AI becoming the default UI, with agents that synthesize voice, text, video, and sensor data, generative UIs that auto-create apps or slides, and the emergence of AGI-like capabilities by 2030 [41-46][48-53][70-73][391-397][403-405].

Overall purpose / goal

The session was designed to showcase Qualcomm’s strategy for unlocking AI’s economic potential across the entire stack-from edge devices to cloud and data-center-while introducing developer-friendly tools (the Qualcomm AI Hub). The subsequent panel aimed to surface practical challenges, opportunities, and future directions from the perspective of startups and innovators building on-device AI solutions.

Overall tone and its evolution

– Opening – Formal, technical, and forward-looking as Durga outlines trends and Qualcomm’s roadmap.

– Mid-session – Becomes conversational and demonstrative, using concrete product examples (e.g., AI-first phone, Humane PC) and highlighting user-experience benefits.

– Panel – Shifts to a collaborative, candid tone; participants share real-world frustrations (shadow AI, regulatory uncertainty) and optimistic “wow” moments.

– Closing – Returns to a visionary, slightly speculative tone, emphasizing ubiquity, emergent behavior, and the transformative impact of AI on future interfaces. Throughout, the tone moves from informative to enthusiastic, then to reflective, and finally to aspirational.

Speakers

– Durga Maladi – Executive Vice President and General Manager, Technology Planning, Edge Solutions and Data Center at Qualcomm Technologies; expertise: Edge AI, data-center strategy, technology planning [S9].

– Siddhika Nevrekar – Senior Director and Head of Qualcomm AI Hub; expertise: AI developer ecosystem, on-device AI enablement [S6].

– Shreenivas Chetlapalli – Leads the Innovation Track for TechMahindra (AI, emerging technologies, blockchain, metaverse); expertise: AI innovation, emerging tech platforms [S13].

– Praveer Kochhar – Co-founder of Kogo AI; expertise: Private, sovereign agentic operating systems spanning edge to cloud, enterprise AI [S2].

– Madhav Bhargav – Co-founder and CTO at SpotDraft; expertise: AI for legal, contract review, drafting and negotiation [S4].

– Ritukar Vijay – Works in robotics and autonomous systems, focusing on edge AI, fleet orchestration and physical AI applications [S1].

– Moderator – Conference moderator (facilitates sessions and introductions); expertise: session moderation (no specific title provided).

Additional speakers:

– None identified beyond the listed participants.

Full session reportComprehensive analysis and detailed insights

The session opened with the moderator welcoming Durga Malladi (Qualcomm), Executive Vice President and General Manager of Technology Planning, Edge Solutions and Data Center at Qualcomm Technologies, and inviting her to discuss how Qualcomm is unlocking AI’s economic potential [1-2].

Malladi outlined a 25-minute programme that would trace the AI landscape from the edge to the cloud [3-7]. She highlighted a striking trend she called an “AI law”: model sizes have shrunk dramatically while quality has risen, exemplified by the move from the original 175-billion-parameter GPT model in 2022 to today’s 7-8-billion-parameter models that outperform it [10-13]. This reduction in parameters, she argued, is the technical foundation that makes on-device (edge) AI feasible [14-16].

The practical implications of this trend were illustrated with concrete device examples. Premium smartphones can now run 10-billion-parameter models, AR glasses can host 1-2-billion-parameter models, and PCs can handle up to 30-billion-parameter models without “breaking a sweat” [16]. Because inference occurs directly on the device, the quality of the AI experience is invariant to network connectivity [18-21] and personal or enterprise data can remain on-device, addressing privacy concerns [22-24]. Malladi traced the evolution of user interfaces-from command-line to mouse, touch, and now voice-showing how a multimodal AI agent can ingest text, voice, video, and sensor inputs to become the primary UI [26-33]. She demonstrated a voice-first smartphone scenario where the device authenticates the user, launches an AI agent, and routes requests to the appropriate apps, effectively hiding the traditional app clutter [40-46]. A recent AI-first phone launched in China by Byte that presents only an agent interface and hides conventional apps was cited as evidence that edge AI is moving from concept to consumer product [46-49].

Malladi then described Qualcomm’s “hybrid AI” philosophy, which distributes workloads across devices, edge servers, and the cloud. The cloud is used for training foundational models, on-premise AI accelerator cards run large-scale inference (100-300-billion-parameter models) for SMEs, and edge devices execute smaller models locally [60-68]. She highlighted the Humane PC prototype, launched in Saudi Arabia, as an example of dynamic workload placement: the system decides in real time whether a query should be processed on-device or in the cloud [70-74]. Following this, she expanded the vision to a universe of wearables-glasses, earbuds, watches, and rings-each capable of local or remote AI processing [75].

To support developers, Qualcomm offers the AI Hub. As Malladi stated, “Qualcomm is not a model creator; we ingest models from any provider” [??-??]. The AI Hub lets any developer select an existing model, upload a new one, or have Qualcomm create a model from supplied data; it also provides free cloud-native device-farm access, testing without physical hardware, and app-store deployment [78-87].

In the data-center strategy, Qualcomm emphasized energy-efficient high-performance computing. Inference-optimised processors differ from training-focused ones, and power-efficiency is as crucial as raw compute [96-105]. A concrete contrast was drawn: a typical smartphone operates within a 4 W power envelope, whereas a data-center rack consumes about 150 kW and relies on liquid-cooling to manage heat [??-??]. The AI-250 solution, with an innovative memory architecture that alleviates the decode-stage bandwidth bottleneck, demonstrates Qualcomm’s focus on memory-centric optimisation; a second-generation AI-300 is already in planning [107-112].

Looking ahead, Malladi linked the forthcoming 6G cellular generation to AI acceleration. She argued that 6G will provide the bandwidth and latency required for advanced edge AI, with trial deployments slated for the 2028 Summer Olympics and broader roll-outs by 2029 [114-124][125-134]. This integration of AI and next-generation connectivity is presented as a catalyst for new use-cases across the device-to-cloud continuum.

The moderator then introduced the panel, highlighting the Qualcomm AI Hub as a tool for inclusive, scale-oriented AI development [144-150].

Rapid-fire answers – The panelists responded to a series of quick questions:

* 6G or AI? – Ritukar Vijay chose 6G (citing connectivity importance) [??-??].

* Data-center or local? – Shreenivas Chetlapalli chose local/on-prem [??-??].

* Artificial or human? – Madhav Bhargav chose human (lawyers must make final decisions) [??-??].

* Innovate or regulate? – Praveer Kochhar chose innovate (regulation will always lag) [??-??].

* Agent tech or robotics? – Ritukar Vijay answered agents [??-??].

* LLM or SLM? – Shreenivas Chetlapalli answered SLM [260-262].

* Intellectuals or automation? – Madhav Bhargav answered integrations (automation needs integration) [??-??].

* Build a chip or buy a chip? – Shreenivas Chetlapalli answered sell a chip, but always build one [??-??].

The first panelist, Praveer Kochhar (Kogo AI), identified “shadow AI”-the unauthorised use of consumer AI tools on enterprise data-as an underrated pain point affecting 78 % of organisations, raising security and compliance concerns [175-182]. Madhav Bhargav (SpotDraft) recounted an early failure: attempting to train a separate model for each legal client proved unsustainable, leading the team to build a Word-plugin data-capture pipeline that now powers a single, grounded model capable of answering client-specific queries [195-209]. Shreenivas Chetlapalli (TechMindra) stressed that successful AI adoption in India hinges on realistic expectations-AI should augment, not replace, human work-and on the growing trust shown by public-sector banks and government AI centres [218-226][254-255]. He also warned that excessive data exfiltration from devices increases breach risk, advocating for minimal data movement and the use of synthetic data to train models [313-314][317-320]. In contrast, Ritukar Vijay (Autonomy) argued that the optimal data-flow depends on context: enterprise settings should limit upstream data, whereas B2C scenarios benefit from abundant data collection [321-325], highlighting a nuanced disagreement on data-leakage versus data-rich training.

A further point of consensus emerged around the necessity of a hybrid processing model. Malladi’s description of distributed AI workloads [60-68] was echoed by Vijay’s explanation that cloud orchestration handles fleet management while edge devices perform real-time navigation [237-238], and by Siddhika Nevrekar’s reminder that the AI Hub enables developers to test on cloud-native device farms, thereby supporting a hybrid workflow [145-148]. Connectivity was also a shared theme: Malladi noted that on-device inference makes the user experience independent of network quality [18-21], while Vijay highlighted that lack of continuous connectivity hampers remote robot management and keeps him awake at night [304-308]. Both speakers underscored that reliable connectivity is a prerequisite for effective edge AI services.

Several thought-provoking comments punctuated the discussion. Malladi’s observation that “model sizes are coming down dramatically while the model quality continues to increase” reframed assumptions about the necessity of massive models for useful AI [10-13]. She further envisioned AI agents as the new universal UI, consolidating multimodal inputs and personal knowledge graphs to replace app clutter [31-34][40-46]. Kochhar’s spotlight on shadow AI exposed a hidden governance risk [175-182], while Bhargav’s “wow” moment-when a skeptical internal lawyer demanded the source of a clause highlighted by the model-demonstrated AI’s capacity to surface hidden policy inconsistencies [333-336]. Chetlapalli’s warning that regulation will always play catch-up to rapid AI innovation added a cautionary note [276-281].

Key take-aways

(i) Shrinking model sizes enable practical on-device (edge) AI across smartphones, AR glasses, and PCs;

(ii) on-device inference guarantees consistent experiences and protects sensitive data;

(iii) a hybrid AI architecture optimises performance, cost, and energy use;

(iv) AI Hub streamlines model onboarding, cloud-native device-farm testing, and app-store deployment;

(v) shadow AI represents a significant, yet under-addressed, enterprise risk;

(vi) building per-customer models is inefficient, and data-capture pipelines can enable a single, grounded model;

(vii) Indian AI adoption benefits from clear expectations and growing public-sector interest;

(viii) robotics workloads require a clear split between edge navigation and cloud orchestration, with continuous connectivity being a critical hardware constraint;

(ix) 6G is expected to unlock new AI capabilities, with trials linked to the 2028 Olympics;

(x) regulation will lag behind innovation, necessitating responsible, innovation-first approaches [10-13][18-21][60-68][78-87][175-182][195-209][218-226][237-238][304-308][114-124][276-281].

Looking to 2030, panelists concurred that edge AI will become ubiquitous and taken for granted, much like everyday connectivity [376-384]. Emergent behaviours in large language models are expected to appear, signalling the early stages of AGI-like capabilities [392-396]. Moreover, generative AI is poised to reshape user interfaces, automatically creating screens, slides, and even whole applications, thereby lowering the learning curve for users in markets such as India [403-405].

Final pitch / where to find you

* Ritukar Vijay (Autonomy) – “You can find us at Autonomy; we help enterprises adopt physical AI and robot orchestration.” [??-??]

* Madhav Bhargav (SpotDraft) – “Talk to us about AI-enabled contract drafting and negotiation for legal teams.” [??-??]

* Praveer Kochhar (Kogo AI) – “Reach out to Kogo AI for a sovereign, edge-to-cloud agentic operating system.” [??-??]

* Shreenivas Chetlapalli (TechMindra) – “Connect with TechMindra for AI-driven fraud-call detection and the Orion on-prem platform.” [??-??].

The session concluded with Malladi emphasizing Qualcomm’s unique position of working across the entire AI stack-from doorbells to data-centres-allowing the company to influence every layer of the ecosystem [138-143]. Siddhika Nevrekar reiterated the role of the AI Hub in democratising AI development, and invited attendees to engage with the showcased startups for further collaboration [145-148]. This closing reinforced the overarching vision: a distributed, privacy-preserving, and developer-friendly AI ecosystem, accelerated by edge hardware, hybrid cloud-edge architectures, and the forthcoming 6G network, will drive the next wave of economic value from artificial intelligence.

Session transcriptComplete transcript of the session

Moderator

To share how these pieces come together and how Qualcomm is unlocking AI’s full economic potential, it’s my privilege to invite on stage Durga Malladi, Executive Vice President and General Manager, Technology Planning, Edge Solutions and Data Center at Qualcomm Technologies. Please join me in welcoming Durga.

Durga Maladi

Okay, so we’re reaching towards the later half of the afternoon and hopefully everyone had their lunch and their coffee. So I hope to talk over the next 25 minutes. I won’t take that much time, but about 25 minutes talking about what is going on in the AI landscape from Edge all the way into the cloud. Starting from yesterday, there was a lot of discussions on the relevance of Edge AI, what exactly is happening in that space, what should be the opportunities at the Edge and where we should be going in the cloud as well. So I’ll try to distill that. in a few slides, and I’ll probably go through a little faster so that we have enough time later on for the team to actually go through the panel discussion.

All right, I’m just going to click through this. This is good. This is probably a good indication of why the edge matters. If you go back in time three years, when GPT was originally announced back in November of 2022, that was a very large 175 billion parameter model. And if you take a look at what the model sizes today look like, they’re more like 7 to 8 billion parameters, but they actually outperformed that original model by quite a bit. Model sizes are coming down quite dramatically, while the model quality continues to increase. This is the equivalent of an AI law that seems to be emerging as far as models themselves are concerned. It’s an important trend line because this actually is the foundation for why edge AI is actually a big part of the model.

And if you take a look at the actual model size, you’ll see that the model size is actually relevant. In other words, you don’t have to necessarily use the trillion parameter models to be able to get through a large number of use cases that average consumers actually care about. and when you think about it that way this is a depiction of just in the last one year alone how much of a progress has been made just in terms of the model quality index itself there’s several parameters over here but the punch line is model quality is getting extremely powerful and now the question is what should we do about it what can we actually build on top of it so we’ve already established the fact that the model sizes are coming down while and these are sometimes known as slms though i would argue that it’s not just small language models but these are small multimodal models that are coming in but there are increased capabilities coming with it much larger context length a lot of on -device learning and personalization that can be done built upon that and reasoning models which actually mimic what we typically expect to see from some of the larger models when you put both of these together and build the right kind of an innovative architecture that’s what actually leads to edge ai in devices that you and i care about so is it here is this just a powerpoint presentation or are there actual consumer devices where you can do edge ai the answer is absolutely yes In fact, today, if you can get any of the premium smartphones where you can easily run a 10 billion parameter model without breaking a sweat, or glasses which have up to a billion to 2 billion parameter models which you can easily run, PCs with up to 30 billion parameter models and so on.

These are devices that you and I use very frequently, at least the PCs and the smartphones with more people adopting AR glasses as well. But one thing that’s nice about running on -device AI or AI inference that’s running directly on devices is the quality of the AI experience is invariant to the quality of connectivity that those devices had to have to the back end of the network. That is a key attribute. I don’t want to keep going back and forth between a regular experience and an AI experience just because I don’t have internet connectivity. That would not be very compelling for any of the consumer or enterprise use cases, and that’s key. The second part is there’s a large amount of data that happens to be very personal.

It can be consumer -centric or it can be enterprise -centric, but either way, I might or not be interested in storing the data in the cloud. And if you kind of think about it that way, then that’s another vector that takes us towards what you can do at the end. and as you put it all together, what exactly are we trying to do with the AI to begin with? Now, I was not there around in the 60s or the 70s, well, I was there in the 70s but I was not involved in, you know, what people used to do with very large mainframe computers where there was just a command line interface, there is a gigantic machine in front of you and you just keep typing something onto it.

That was the user interface between a human and a machine. The 80s changed that with the advent of you use a mouse, you use a PC, there is a graphical interface, you actually get to see something, not just see a command line interface, that changes things. Fast forward to where we are today, about 20 years back, we started using touch as the main UI. We all have our smartphones which happen to be touch -based and increasingly laptops and tablets and these are places where the UI shifted from just using typing and using a keyboard to touch interface as well. Well, we are now at a different era now. It’s at a place where we now are increasingly using voice as an interface towards devices.

And if you put it together, you have a combination of different modalities, whether it is text or voice or video, any other camera interaction, some sensors which tell you exactly where you are located, provide some context to what you’re doing. All of that gets ingested by a single interface, an AI agent. Imagine the following. Let’s take a smartphone because one can easily relate to it. You have your smartphone. Right now, people are either looking at it or scrolling through their apps. We all have a clutter of apps on our phone today. If I wanted to use one app, I’ll have to click that one. If I wanted to then correlate that information with another app, I have to go back, then open up a new app and go in again.

Instead, all you have, and this is a future where all you have is a voice UI where the device is sitting somewhere. It’s in your pocket. You talk to it. Your voice gets authenticated and then it says, OK, I’m ready for you. How can I help you? That’s your agent right there. I would always love to say talk to my agent, but this is the beginnings of that. that agent distills all the information that you’re saying encapsulates it maps it to apps that are running somewhere in the behind the models are actually they only provide a means towards an end goal they perform a job but that’s not the end job by itself so the agent actually picks one or two from a bouquet of models and then also accesses some of the personal attributes that could be sitting right there we call it the personal knowledge graph together when you put it all together you end up seeing a glimpse into how ai can then become the new ui to all the devices around us and this is a very powerful concept is this also just on powerpoint till about last year that was the case not anymore byte has introduced a new phone in china very recently and it’s not available everywhere in the world some of us do have the luxury of actually visiting china quite frequently this phone is like fundamentally different you can’t just buy a new phone you can’t just buy a new phone you can’t just buy a new phone it’s designed from ai first all you have is an agent by the way and all the other apps are actually missing.

They’re somewhere in the back, but you don’t get to see them. And if you think about it, it’s a very disruptive mechanism. It’s still early. Of course, it’s going to be a little clumsy and it doesn’t work all the time in a picture -perfect manner, but it’s something that is beginning to change the conversation of how you take AI agents from something that happens to be in presentations to something that is far more practical in devices. So I’m going to just skip through this part of it. A lot of it is in Mandarin, so it’s kind of hard to see, but at the same time, you get the picture of how it can do things for you when you give it a very generic, nebulous task and it figures out exactly what is it that you need and then does things for you.

It’s like shop something for me, check my bank balance. If I have enough over there, I want to buy that thing and then when it is done, do let me know. It does it. You actually don’t know it’s happening, but it actually does it. All right. So far, we talked about the edge. What about the cloud? Well, a lot of the data actually comes in from the edge. it’s the consumers who are actually generating the data. That’s where the AI action really is. But the cloud has an important role to play as well, as the data actually gets used both between the edge and in the cloud. And so our philosophy over here is to make sure that we have AI processing that is distributed across the network depending upon what the use case is.

For instance, the cloud is extremely powerful for training foundational models, creating new kinds of models. That’s very helpful. At the same time, there’s a large number of enterprises where you have on -prem servers where with using air -cooled cards, it’s very easy to run 100 to 200, 300 billion parameter models. Very useful for small and medium enterprises which don’t necessarily have to rely on the data center. Just buy a card server, plug in an AI accelerator card, maybe a handful of them, and you end up with extremely sophisticated processing. And keep in mind, in the beginning, we talked about the fact that the model sizes continues to actually come down while the quality continues to improve. So whatever you have, if tomorrow there’s a new model that comes in or you just want to replace your existing AI accelerator card, take out one, plug in another one, as opposed to rolling in a new rack, fundamentally different in terms of the network economics.

And finally, we just talked about devices as well. So bottom line is, when you think of AI processing as a hybrid AI, it’s a mix and match of processing between devices, the edge cloud, and in the data center. And speaking of what is it that you can do with it, imagine the following. This is one of the PCs that was launched in Saudi Arabia. It’s called the Humane PC. We had a lot to do with it. It’s a place where, in fact, the only interface is what you see in front of you. This is not a standard PC which you open up and you have the regular kind of a screensaver and you have all the other apps that are there and you open up your, you know, your mail client, your calendar, and so on.

you ask a question and in real time and it doesn’t matter what it is in real time it decides should i run it on the device or should i run it on the cloud maybe some questions that you ask are so complicated i want to run it on the cloud and the other questions are yeah without breaking a sweat i can just run it on the device and this is a place where you actually switch back and forth between what’s running on the device and what runs on the cloud it’s the beginnings of where we can go with it another step when we actually talk about devices we all have a universe of devices around us glasses which could be connected directly to the network tomorrow and today they are tethered through a phone your earbuds your wearables it could be a watch that you’re wearing and increasingly on our ring as well i think they’re running out of places where you put devices but every time i think that there is a new device that comes up already we’ve reached four this is like a universe of devices around you and perhaps the hub happens to be a phone how do you actually go back and forth between these two and how exactly do you make sure i wouldn’t even probably want my smartphone with me I want to keep it somewhere, just have my earbuds and constantly talk to my phone and do some of the processing perhaps in my earbuds itself, the rest of it on the phone and some of it on the edge server and the rest of it on the data center.

That is the vision of how the evolution of AI ought to be. Speaking of the number of things that we just discussed, it’s important. This is now more from a Qualcomm perspective. We have made sure that we have a good, easy way for developers to onboard our platforms, bring in their applications, their platforms and actually run from there. And in the subsequent session, as we go through that, there might be a little bit more talk about it. But suffice it to say, if you go to the Qualcomm AI Hub, it’s a place where any developer can pick a model, bring a model. Or if you don’t have a model, we’ll create one for you if you bring your data.

Once you do that, we’ll give you free cloud native access to device farm, which exists somewhere. But you don’t. You just have an IP address that you log into and you take it from there. And the rest of it is you write your application. You have the ability to test it without once having the device actually in your hand. If you’re comfortable with that, you get to deploy that app out there in any kind of an app store. Very powerful concept that we’ve actually worked on for a large number of time. And this is a place where, you know, we are not a model creator. We ingest models, which means we work locally with every single model provider out there on the planet and happy to actually discuss a lot more offline as it comes to it.

All right. How am I doing on time? Maybe I have 10 minutes. So let me talk a little bit on data center. I don’t see the timer here. That’s why I was asking. So what happens in data centers over here? Well, one thing that’s clear is that the data center capabilities are becoming more and more sophisticated. And as we learned a lot of lessons from the edge, one thing that became very clear for us is that it’s important to pay attention to energy efficiency in addition to performance. So we call it as energy efficient, high performance computing. And we kind of start bringing that sort of a paradigm into data center. A few other. Observations came in.

One is that. the processors that are designed for training are not necessarily the best processors that are intended for inference. They’re actually different kind of problem statements. It’s a little more subtle, but once you understand that, once we get past the whole notion of let’s just buy the biggest GPU that’s out there, and then you realize it’s a little bit of an overkill when it comes to the inference task that you might have. It’s a different architecture that’s needed. The second part is that we want to make sure that in addition to the rollouts that are currently occurring, we bring in solutions which would lower the total cost of ownership. So when we put it together, we introduced our family of solutions in the data center as well, learning from what we learned in devices, and then bringing those lessons into the data center.

A smartphone today operates at four watts at best. The battery inside a smartphone is 4 ,500 milliamp hours at best. In a data center, if you buy a state -of -the -art rack, it’s about 150 kilowatts. Fundamentally different. It’s directly liquid -cooled. You need water. There’s no water or liquid -cooled kind of a smartphone over there. two different universes but there is a way to learn lessons from one universe and actually apply it on the other side i would argue that in ai terminology that’s transfer learning that you seriously apply going from devices all the way into data centers itself so we entered that space and we have two different categories of solutions the second one ai 250 is a place where we focused on an innovative memory architecture as it turns out and it’s a little more of a subtle argument here but as it turns out that when we talk of inference the pre -fill stage is extremely compute bound the more computation horsepower you throw at it the better it is tokens per second is higher however the decode stage is fully memory bandwidth bound you can throw as much compute as you want it makes zero difference whatsoever so the memory architecture is equally to it’s actually equally important and so we innovated on that putting it together for our ai 250 solution this is the one that’s actually rolling out in the middle east and this was part of that earlier demo that we just talked about with a pc and something else that’s running in the cloud you We have an annual cadence that’s coming up.

This is stable stakes at this point in time with the innovative memory architecture continuing into the second generation by the time we get into AI 300, which is not yet announced, but something that is in planning. Now, finally, and I want to actually move a little faster here. There is a buzz in the industry about the next generation of cellular platforms, and usually one would scratch their head and say, wait a minute, we just launched 5G. I don’t know exactly why we’re talking of 6G over here. And besides, isn’t this all about AI? What does AI have to do with 6G? Are we just throwing AI pixie dust on top of every technology right now and simply saying there’s a hype cycle associated with it?

That’s not the case. It is true that cellular communications and AI have evolved as two parallel sets of innovation. But the time has come to actually put both of those together because cellular technology at the end of the day does involve the very same devices that we just talked about. It involves a network through which all the devices are connected. The data goes through and eventually goes into a data center as well. So we have a view in terms of how 6G can unlock a full potential of AI. And, you know, if you think exactly how the GE transitions occur, it’s only 10 years or so. So the earliest 5G launches were in 2019. So we are in year seven of the journey.

It’s not that far off. And it turns out we have a convenient Summer Olympics that’s coming up right next door. We’re based in, I mean, our headquarters is in San Diego. That’s where I live. And there’s the 2028 Summer Olympics. So there’s going to be a lot of show and tell in terms of what 6G capabilities can be. And there’ll be technology trials at that point in time culminating into the first set of deployments that we are driving towards in 2029. And we have another two minutes. I’m just about done. I want to actually stop with one final thing, and that is this part over here. What you heard is just a glimpse into the kind of world that we as Qualcomm live in.

We are probably the only ones in the industry that work on everything from doorbells to data centers. We are probably the only ones in the industry that work on everything from doorbells to data centers. There’s a lot of others who focus on data centers, maybe on servers, but they don’t exist below phones. We actually work ground up from everywhere over there. So happy to talk with

Moderator

Thank you, Durga, for this insightful presentation. As we talk about inclusive AI at scale, enabling developers is critical. Innovation only moves as fast as the tools behind it. Through the Qualcomm AI Hub, we are simplifying how developers access optimized models, test and deploy high -performance on -device AI from edge to cloud. To share how we are accelerating this developer ecosystem, please join me in welcoming Siddhika Nevrekar, Senior Director and Head of Qualcomm AI Hub, to moderate our panel discussion. We’re leading startup founders, exploring the evolution, evolving AI ecosystem and what excites them about building with on -device AI. Please join me in welcoming Siddhika.

Siddhika Nevrekar

I would like to welcome the panel over you guys know who you are so I don’t need to I know can we just take a moment for a quick picture if that’s possible thank you Thank you.

Shreenivas Chetlapalli

So I’m Srinivas Shetlapalli. I lead the innovation track for TechMindra for AI and emerging technologies, which includes blockchain metaverse. And I’m also responsible for creating an innovation ecosystem across a network of labs that we’ve created globally. Thanks.

Madhav Bhargav

Hi, I’m Madhav. I’m the co -founder and CTO at SpotDraft. We do AI for legal. We’ve created a bunch of agents that help lawyers not just review contracts, but also draft them and negotiate them faster and better.

Praveer Kochhar

Hi, everyone. I’m Praveer Kochhar . I’m one of the co -founders of Kogo AI. We run a full stack private agentic operating system from the edge to the cloud. So we are bringing agents closer to enterprise data rather than taking data to agents. So we are 100 % sovereign, built from scratch platform. And we do some very… exciting work with Qualcomm. I hope I get to share that with you today.

Siddhika Nevrekar

All right. Let’s start with some questions. None of you know these, so these are fun because they’ll be a surprise to you. They’re not hard. They’re very easy. We’ll start with you, Praveer. We’ll go in the reverse order because that kind of throws a curveball. What’s the most underrated pain point for enterprise users that AI will solve? You can perhaps talk specific to your product.

Praveer Kochhar

Did you say underrated?

Siddhika Nevrekar

Yes.

Praveer Kochhar

So there’s a concept called shadow AI. I don’t know how many of you know about shadow AI. Shadow AI is a lot of people who work in companies and sharing critical enterprise data on the cloud while using unauthorized AI tools like OpenAI or Cloud. So 78 % of enterprise users use shadow AI. And that’s a big concern. It’s underrated, but it’s still driving efficiency. So not a lot of eyeballs are going there. But I think that’s going to become one of the critical issues as we move forward. Things get more complex. Agentic systems get more complex. More data is shared on the cloud. So I think, yeah, for me, I think it would be shadow AI that people are using.

Siddhika Nevrekar

That’s a good answer. It was a curveball, but you caught it. Okay. Let’s go to Madhav. You work in a very niche field, you know, legal, which is very, very niche. You’re biggest and you also still dabble with technology, right? Yes, you like it. So your biggest and favorite AI failure, building spot draft that set you up for success. Can you remember any of that?

Madhav Bhargav

That’s a great question. So it sort of goes back to our founding years where we were a little bit early to the game. This is around six to eight years back when. And transformers were what people were talking about and not LLMs. And back then, we came in with the idea that, you know, cars are driving themselves. So why can’t AI actually review contracts for you? So we spent a bunch of time with enterprise customers trying to deploy AI and realize that we would have to train a model for each customer. And we built out our entire data labeling annotation pipeline as well as team at that point. So while that was in a way a failure because we then decided not to do that because we didn’t want to do services.

Otherwise, we would be building models one per customer. And the genesis of SpotDraft as it exists today came from there because we wanted to capture the data as lawyers were using the technology that they anyways use, which is where our word plugin comes in. So we can actually capture what they’re doing. And then our annotation team was also set up back then. And that’s sort of how today we are able to give. Grounded answers using data that is the customer’s data because of all the things we built back then.

Siddhika Nevrekar

That’s a good one. So now you’re on a path of just never regretting making single models for each customer.

Madhav Bhargav

I mean, I hope we don’t have to go back there, and I think a lot of the models that have come out are enabling that. But that part, yes, not regretting it.

Siddhika Nevrekar

All right. Srini, last -minute addition, so thank you. I know that it’s difficult to get here. This is probably something that you’ll be able to share with us. Yes. What’s the special ingredient for successful AI adoption in India specifically?

Shreenivas Chetlapalli

Okay, that’s a tough question to ask. I think the most important thing is understanding the limitations of AI. So typically it’s very easy to understand what are the advantages of doing AI. But if we can set the expectations right, that AI will augment their work to a certain extent, that will be one. Second, the complete misnomer that it is here to take away jobs. has to be remote. I think these are two things.

Siddhika Nevrekar

How do you feel about AI being trusted in India? Is it trusted enough? Is it adopted?

Shreenivas Chetlapalli

So if you look at the adaptability of AI, we are almost at the global level in terms of the enterprises that we are talking to. But the best part that I have seen is that a large number of public sector banks have taken to AI in a big way. Some of the banks have been our customers for both AI and emerging technologies. And we’ve also seen PSU units talking about AI. And I’ve also seen a lot of state governments, I had a chance to meet a lot of ministers today, ministerial delegations today, have set up AI centers. So we are in the game.

Siddhika Nevrekar

Yeah, good. Ritukar, this is an easy one. You think about this probably a lot. Cloud or on -device AI, which is the most important? Which, where and when?

Ritukar Vijay

so I think in continuation to the previous question so you know just through a bunch of compute and a problem statement is not how AI is adopted in enterprise settings because it’s very important to break down the big problem into smaller chunks and for what you want to use AI and for what you don’t want to use AI and that’s exactly what we do in robotics so we break down what is happening on edge and what is happening on the cloud so right now at this point in time for us it’s like you know we do orchestration on the cloud which is for the fleets of robots but you know we were doing all you know autonomous navigation on the edge part of it and for us it’s very important that you know we wanted to understand more intelligent navigation so at this point it’s been almost one and a half years since we started running VLMs on the edge to understand the context So I think that’s how you break down the overall problem, not just running everything on the edge or running everything on the cloud, because that won’t solve the problem.

Yeah, that’s pretty much how we break it down into small chunks.

Siddhika Nevrekar

So you guys are very thoughtful and very quickly giving these answers for longer questions. So we’ll go to rapid fire, which is just picking one word. There’s no judgment here. You can pick A or B. You pick A or B. Maybe a couple of words about B. Not too long. So we’ll start with you, Ritika. 6G or AI?

Ritukar Vijay

Sorry?

Siddhika Nevrekar

6G. Or AI.

Ritukar Vijay

So, okay, this is a long one. I can just share a good anecdote there. So we were running robots in Rio Tinto in Australia, mining areas, right? There is no internet. Still, we want to use AI on the edge. And so what we did was we put some installing satellites each on the robot. Right? so connectivity is very important if it is 6G it’s better I’ll go for 6G because that opens up a lot much possibilities there

Siddhika Nevrekar

that’s a good one I thought you would pick AI because that’s the buzz word that’s anyways happening good answer Shreemi data center or local

Shreenivas Chetlapalli

local is the first option but for India data center makes business local because one of the key products that we have built called Orion which is an AI platform has been built for on prem and we also see that a large number of requirements that have come to us is how do I process things in my own premises rather than doing an API call or taking it to the cloud and we have seen I know you asked for India but I have seen this happening in the Middle East also where a large one of the large the largest world largest companies said that can my exact solve their things on their own desktops or locally.

So local.

Siddhika Nevrekar

Local for you, okay. For you, I’m looking through because I want to ask a specific one. Madhav, artificial or human?

Madhav Bhargav

I mean, when you deal with lawyers at the end of the day, I have to go with human because… I know you easily pick artificial. So you can’t hold AI models neck, but you will go hold a lawyer’s neck. So for us, it’s important to give the lawyer the capability to do their job better, faster with a more thorough research. But at the end of the day, it has to be them taking that decision because a lot of times it’s not the black and white. Those are the easy scenarios. It’s the gray area where the lawyers are able to come in and really guide their customers, clients as to what to do and what not to do.

Siddhika Nevrekar

That’s a great question. I think we still want AI to be human, right? So I think it’s… That’s a good one. answer but there is no judgment you could have said otherwise to provide regulate or innovate

Praveer Kochhar

okay in regard to AI 100 % innovate I don’t I don’t see any reason anyways regulation in the in the age of AI is always going to play catch up because technology the speed at which it’s growing it’s very difficult to regulate it before it goes because we don’t even know the social implications of what we are building and as we build them and as it goes into public and people start using it these tools are very intelligent they’re getting intelligent by the week so I think it will always be innovation at the side of caution but I don’t think this is an industry that you can regulate first and then expect it to grow.

Siddhika Nevrekar

Having your first answer very first answer about I wouldn’t say illegal but unauthorized usage was pretty much in line to this. and it still was saving time, and it still is so, yeah, I think that’s a good answer. For the next ones, you don’t have to say why. So you can pick an answer. Nobody is, again, no judgment. Go to AI?

Praveer Kochhar

No, 100 % innovate. I don’t see any reason. Anyways, regulation in the age of AI is always going to play catch up because technology, the speed at which it’s growing, it’s very difficult to regulate it before it goes because we don’t even know the social implications of what we are building. And as we build them and as it goes into public and people start using it, these tools are very intelligent. They’re getting intelligent by the week. So I think it will always be innovation at the side of caution, but I don’t think this is an industry that you can regulate first and then expect it to grow.

Siddhika Nevrekar

Your first answer, very first answer about, I wouldn’t say illegal, but unauthorized usage was pretty much in line to this and it still was saving time and it’s still so, I think that’s a good answer. For the next ones, you don’t have to say why. So you can pick an answer. Nobody’s, again, no judgment. You can pick whichever one you want. Agent tech or robotics?

Ritukar Vijay

Robots are the agents.

Siddhika Nevrekar

You have to pick one.

Ritukar Vijay

So agents, yeah.

Siddhika Nevrekar

Okay. LLM or SLM?

Shreenivas Chetlapalli

SLM, all the time.

Siddhika Nevrekar

Intellectuals or automation?

Madhav Bhargav

You can’t do automation without integrations, so I would have to go with integrations.

Siddhika Nevrekar

Build a chip or buy a chip? This is just a selfish question, but, you know.

Shreenivas Chetlapalli

I would sell a chip, but then build a chip always.

Siddhika Nevrekar

Wow, it’s an interesting answer. I don’t know how much time is left. Okay. All right, we’ll do some few extra questions. You guys can take longer now to answer the questions, I guess. Just moderate the time accordingly. So what’s the one hardware constraint that keeps you up at night?

Ritukar Vijay

So one of the biggest hardware constraints is if your entirety of the system is without any connectivity and you are restricted that you cannot access remotely. If you cannot access robots remotely in any which way, be it for scheduled maintenances or predictive maintenance or anything of that sort, and even emergency situations, like what we see, you know, the Waymos which are running in San Francisco SF right now, they are monitored from Philippines, right? So I think that part is something which is very important, that everything should be connected at all times. So I think that keeps us awake that, you know, the robots should not go in silos or isolated where we cannot reach them.

And only then we have to. We have to physically, you know, make sure that somebody is around. to manage a fleet or whatever.

Siddhika Nevrekar

You talked about local. So I’m going to ask this question which seems apt for you. What’s more dangerous? Too much data leaving the device or too little?

Shreenivas Chetlapalli

Too much data leaving the device. I think too much data leaving the device.

Siddhika Nevrekar

How do you train? I was saying how do you train if it doesn’t?

Shreenivas Chetlapalli

See, I think the focus for us also has been how do we train with lesser data and make it much better. The moment we’re talking about more data and more data leaving, we’re actually talking about more issues happening, more breaches happening. So with lesser amount of data, if we can train or if you can create synthetic data sets and work it, that’s the best way for LLM to be trained rather than waiting for large data set to come. And then then like you said, then wait for it to leave.

Ritukar Vijay

If I may, it depends. If it is enterprise, then less data going up is always better. If it is B2C, then everybody wants to learn from that data. Because that is free data. So in a way, that’s something which is very important. Situation.

Siddhika Nevrekar

Yeah. Okay. This is probably going to be interesting. You get to tell another story. What was the last thing that made you go, wow, about AI? And this doesn’t need to do, don’t pitch your company. It’s fantastic.

Madhav Bhargav

I’ll try not to. I think we’ve seen the kind of, and this sort of goes back to the last question in a way, where a lot of companies have so much data sitting in people’s heads, in people’s inboxes, random share points, drives. And historically what we’ve seen as we onboard customers, they’re like, oh, I have a playbook. You know, which is a policy of what contracts we will sign, won’t sign. but we also know that it is out of date and we’ve been working on techniques to be able to really infer that from older data and one of the things that we’ve seen which really blew my mind was we actually ran one of the early prototypes of that on our internal data we run SpotDraft on SpotDraft and some of the things it threw up when I was talking to our internal legal team and I expected them to say no this is absolutely wrong and that guy is like actually I want to know where this came from because I have been trying to track this down that why are certain contracts having certain clauses and not so it’s that ability to do knowledge work which otherwise would not be done at all and to have this always up to date always learning sort of knowledge base that truly captures what your company and organization policies are that’s something that no one wants to spend you know 100k to get lawyers to create that but if you have a agentic way of doing it, then suddenly that becomes the one thing that everyone cares about, because that is now your onboarding, that is now what you, you know, compare your new contracts with.

And I think in the coding side, we’ve already started seeing a lot of this where things like, you know, Cloud Code and Codex are able to go in and learn from your code base and give you these insights, which earlier would take a new engineer, like maybe a month or a quarter to get onboarded. Now they’ve started shipping code within days, because of this, and that is going to start happening across all kinds of knowledge work. And for us, the, you know, the, the wow moment was when the lawyer who doesn’t trust AI suddenly said, No, I need to see this, this is

Siddhika Nevrekar

So that’s, I’m going to spin to Madhav, the not CTO, maybe a consumer AI feature that just wowed you in recent time, any you can think of?

Madhav Bhargav

I think I’m sure everyone has been talking about OpenCloud, the ability for me to have a personal assistant almost when I, of course, can’t afford one. But for that to really sit and start doing a lot of these things for me, and I’m sure it’s going to come to everyone’s devices very soon, hopefully with Qualcomm chips. And that, I think, is where I was really wowed by it because I deployed it on my WhatsApp and it started sending messages to people. It was a little bit scary, but also saved me a bunch of time. So that was where I was like, OK, this is something that was not at all possible before.

Siddhika Nevrekar

All great responses on WhatsApp. I had to switch it off very quickly because there’s just too much data in there. But that is the next challenge, right? How do we control these autonomous agents, especially when they’re sitting on your personal data? Given you’re a rebel, we’re going to ask you, what are you most scared? The deal.

Praveer Kochhar

No there’s a lot of fear there’s a lot of fear because I think we don’t know the societal impact of this technology yet and I think that’s probably the largest fear because up till now we were engaging with algorithms that were trained to derive attention from us now we are dealing with intelligent algorithms that can self adapt and become far more personalized now with the ability to generate content at will I think it will be very difficult to keep attention away from a device when you have a hyper intelligent system on the other side that’s changing itself based on you it will become extremely addictive so I think that that’s the biggest fear

Siddhika Nevrekar

Yes, but but then but then we are pleasure seeking beings, we will we will go after that until it it gives us some guardrails and then we’ll have apps that will lock themselves up for two days and we won’t use them. It’s possible that we’ll be all on vacation and the robots will be interacting with

Praveer Kochhar

Yeah, and then imagine what we’ll be doing we’ll be interacting with these attention seeking agents, right? I just want to take the last question also, because I just I just saw a real recently and they got a unitary robot in Bangalore and they sent it out to beg. So it was the first robotic beggars that somebody started out and was there more empathy, probably there was more empathy than I don’t know, but but I still think that there’s a lot of tangential use cases of AI that can come out of come out of all this. And yeah, I mean, I mean, that’s something that got me and also kind of told me that you can think very, very differently about this technology and not just think what we do and replicate what we do.

There are a lot of tangential things that might come out of this.

Siddhika Nevrekar

I asked why, if there was more empathy because I was recently driving and there was a two lane road. One lane was completely blocked. Everybody was trying to squeeze into the other lane. And then when you pass by, you saw way more that was not operational. And everybody would go, oh, you know, nobody was upset. Nobody was screaming. I’m like, just because it’s a robot, you’re more empathetic. But they were. So it changes your psychology somehow.

Praveer Kochhar

Yes, yes. And we are still not interacting with robots on a day -to -day basis. And I think that that will be another kind of mystery thing added to our societal weave.

Siddhika Nevrekar

True. Thanks for taking the second question, too, which was interesting. All right. We’ll get into closing so we can wrap up. You all will get to pitch to companies, so that’s very exciting. We’ll start with, you know, complete the sentence in one word. So you have to just say one word. Edge AI in 2030 will be blank. You can repeat the sentence.

Ritukar Vijay

Edge AI in 2030, it will be, it will be, I mean, it will be very not so sophisticated. I mean, it will be taken for granted. So just like you use connectivity for granted. That’s how the Edge AI will be. It will be everywhere almost. My default, like the pins and, you know, the human pins and everything. So what we talked about in the keynote as well. So I think it will be like that. So taken for granted.

Siddhika Nevrekar

Will you still complete itwith one word? Sorry. Taken for granted is one word. Okay. Granted. So it will be business as usual or taken for granted. That’s it. I mean, nobody will mind that. Edge AI in 2030 will be as a default.

Madhav Bhargav

be ubiquitous I think there will not be anything that does not have AI and I think there is a lot of Hollywood sci -fi that has demonstrated this but we will probably be trying to talk to tables or screens or walls to that degree where anything that can have a chip inside it the chip will also have AGI inside it AGI in 2030 will be I think AGI

Praveer Kochhar

in 2030 will be emergent we will start seeing signs of of what OpenClaw just did was a very small trick in the play but it added a little bit of emergent behavior into LLM giving it autonomy to be able to create its own files. That’s all that OpenClaw did and that’s the magic behind it. And I think that’s going to come to the edge and with that emergent behavior you’re actually giving a model the ability to create its own learning. That’s why I say emergent. That’s a good answer. One last thing you

Siddhika Nevrekar

want the audience to remember. This is also the cue for pitch if you like. As I said earlier robots are agents and

Ritukar Vijay

I think I kind of agree with that so we are going to be, part of us will be agentic as well because we’ll have something some AI in us as well whether it is, so there’s a lot of work which is going on with Neuralink so the airports are tracking the brain waves of how you react to a particular situation so agentic you know both robots and people will be agentic in some fashion and I think that’s how things will be and you need some orchestration where everything can talk to each other that’s what we are looking forward to do So I think one thing that we should

Shreenivas Chetlapalli

all remember is there is a lot of work that TechMindray and Qualcomm is doing together in detecting fraud calls and this is using Agile LM so I think that will grow as we go ahead that research will see a lot of action because the number of fraud calls that we are getting are increasing every day so I think that’s an area we will see a lot of action happening and I think both our companies are geared for it I think and I think it was mentioned

Madhav Bhargav

in the keynote I think one of the takeaways for me would be how we think about interacting with technology today is going to change entirely like uh UIs, phones, you know, screens, all of these going away and everything becoming very, very generative, whether it is, you know, slides being generated for you on the fly based on the conversation you’re having, or even entire apps, UIs being generated for each specific scenario and use case. I think everything is going to move away from just being SaaS that people learn, and it as an individual persona is actually caring about. And that actually opens a lot more, specifically in the Indian context where you might not, like people might not have to go through so much training and learning, and they can just go and start using it because the platform can actually understand your needs, as opposed to you having to understand the platform.

Siddhika Nevrekar

Can you just repeat the question once for me, please? One thing you want audience to remember. Whatever you want. To remember.

Praveer Kochhar

So, so, so, so remember how we used to work. work and plan for how we are going to work because very soon we’ll have a lot of time that will be available to us because a lot of systems that we are going to manage will be intelligence and autonomous and we’ll have to only take decisions. So what we do with that time is going to be a critical question everyone’s going to ask themselves and I think all of us are also going to be builders because we’ll have very intelligent tools to build things, run them and manage multiple systems at the same time. So I see that future and I think we should all look around and see how we manage things today and how we are going to do that in the future.

Siddhika Nevrekar

Great. This is a chance to actually pitchyour company but it’s okay. It’s pitched. I will give a more specific one to pitch which is there are a lot of people in the audience, maybe some customers, if they were to find you where should they find you or a spot where they can talk and what should they come and talk to you about? Okay. What specifically and what industry?

Ritukar Vijay

So, I mean, so we are autonomy, so you can always find us at autonomy. So that’s where you can find us. Always where. Yeah, we are brand, I think. We are proud of it. And the most important thing is, like, you know, robots and, you know, just like AI, there’s a lot of emphasis on physical AI. And it’s not something which is going to come. It’s there. It’s just the adoption curve which is happening now. So think more ways of adopting technology. And if you want, if the enterprise customers are looking forward to adopt more and more robots, not only just dull and dirty scenarios, but also in, you know, different walks of life, I think that is where, you know, talk to us and we can help.

Even if they are not our robots, we can help them to have a set of orchestration with, you know, variety of things. But still, they have some level of control. Yeah. Thank you.

Siddhika Nevrekar

Thank you.

Related ResourcesKnowledge base sources related to the discussion topics (35)

Factual NotesClaims verified against the Diplo knowledge base (4)

✓

Confirmedhigh

“Durga Malladi is Executive Vice President and General Manager of Technology Planning, Edge Solutions and Data Center at Qualcomm Technologies.”

The knowledge base lists Durga Malladi with exactly that title at Qualcomm Technologies [S1] and references her participation in panels as a Qualcomm representative [S17] [S96].

✓

Confirmedhigh

“Personal or enterprise data can remain on‑device, addressing privacy concerns.”

The source notes that keeping data on the device protects privacy, confirming the claim that on-device inference helps keep data local [S57].

ℹ

Additional Contextmedium

“Model sizes have shrunk dramatically while quality has risen, with a trend toward smaller, more powerful models.”

The knowledge base discusses industry focus on making models smaller through distillation and mixture-of-experts techniques, highlighting a “ladder of models” that become more efficient while retaining performance [S101] and notes the broader move toward smaller, more capable AI models [S100].

ℹ

Additional Contextmedium

“Premium smartphones can run 10‑billion‑parameter models, AR glasses 1‑2‑billion, and PCs up to 30‑billion‑parameter models without issue.”

Examples of billions-parameter models running on phones, PCs, and cars are documented, showing that such on-device deployments are feasible, though exact parameter counts are not specified [S29].

External Sources (106)

From Human Potential to Global Impact_ Qualcomm’s AI for All Workshop — -Ritukar Vijay: Works in robotics and autonomous systems. Expertise in edge AI for robotics, fleet orchestration, and ph…

From Human Potential to Global Impact_ Qualcomm’s AI for All Workshop — Hi, everyone. I’m Praveer Kochhar . I’m one of the co -founders of Kogo AI. We run a full stack private agentic operatin…

https://dig.watch/event/india-ai-impact-summit-2026/from-human-potential-to-global-impact_-qualcomms-ai-for-all-workshop — Hi, everyone. I’m Praveer Kochhar . I’m one of the co -founders of Kogo AI. We run a full stack private agentic operatin…

From Human Potential to Global Impact_ Qualcomm’s AI for All Workshop — -Madhav Bhargav: Co-founder and CTO at SpotDraft. Expertise in AI for legal applications, creating AI agents for contrac…

https://dig.watch/event/india-ai-impact-summit-2026/from-human-potential-to-global-impact_-qualcomms-ai-for-all-workshop — Hi, I’m Madhav. I’m the co -founder and CTO at SpotDraft. We do AI for legal. We’ve created a bunch of agents that help …

From Human Potential to Global Impact_ Qualcomm’s AI for All Workshop — – Praveer Kochhar- Siddhika Nevrekar

https://dig.watch/event/india-ai-impact-summit-2026/from-human-potential-to-global-impact_-qualcomms-ai-for-all-workshop — 6G. Or AI. All great responses on WhatsApp. I had to switch it off very quickly because there’s just too much data in t…

https://dig.watch/event/india-ai-impact-summit-2026/building-the-future-stpi-global-partnerships-startup-felicitation-2026 — Economic development and social growth. and the Three Sutras of People, Planet and Progress. This summit is focusing ver…

From Human Potential to Global Impact_ Qualcomm’s AI for All Workshop — – Praveer Kochhar- Durga Maladi – Ritukar Vijay- Durga Maladi

S10

Keynote-Olivier Blum — -Moderator: Role/Title: Conference Moderator; Area of Expertise: Not mentioned -Mr. Schneider: Role/Title: Not mentione…

S11

Keynote-Vinod Khosla — -Moderator: Role/Title: Moderator of the event; Area of Expertise: Not mentioned -Mr. Jeet Adani: Role/Title: Not menti…

S12

Day 0 Event #250 Building Trust and Combatting Fraud in the Internet Ecosystem — – **Frode Sørensen** – Role/Title: Online moderator, colleague of Johannes Vallesverd, Area of Expertise: Online session…

S13

From Human Potential to Global Impact_ Qualcomm’s AI for All Workshop — -Shreenivas Chetlapalli, who leads the innovation track for TechMahindra

S14

https://dig.watch/event/india-ai-impact-summit-2026/from-human-potential-to-global-impact_-qualcomms-ai-for-all-workshop — Okay, that’s a tough question to ask. I think the most important thing is understanding the limitations of AI. So typica…

S15

Re-evaluating the scaling hypothesis: The AI industry’s shift towards innovative strategies — In recent years, the AI industry has heavilyinvestedin the ‘scaling hypothesis,’ which posited that by expanding data se…

S16

AI for Good Technology That Empowers People — Now, in terms of availability, if I want to talk about it, I think we’re increasing. We’re increasingly seeing, and Qual…

S17

HETEROGENEOUS COMPUTE FOR DEMOCRATIZING ACCESS TO AI — Artificial intelligence | Information and communication technologies for development Durga argues that AI applications …

S18

https://dig.watch/event/india-ai-impact-summit-2026/heterogeneous-compute-for-democratizing-access-to-ai — So I’ll keep it brief. I think what I’m looking forward to with all the conversations here and in other parts of the wor…

S19

Inclusive AI_ Why Linguistic Diversity Matters — “So this is our prototype open AI inference device”[44]. “The hope is that anyone could feel empowered to connect up to …

S20

Efforts to improve energy efficiency in high-performance computing for a Sustainable Future — The demand for high-performance computing (HPC) has surged due to technological advancements like machine learning, geno…

S21

Designing Indias Digital Future AI at the Core 6G at the Edge — The panel discussion revealed that AI-driven applications will fundamentally change network traffic patterns, with uplin…

S22

Private AI Compute by Google blends cloud power with on-device privacy — Googleintroduced Private AI Compute,a cloud platform that combines the power of Gemini with on-device privacy. It delive…

S23

Privacy concerns intensify as Big Tech announce new AI-enhanced functionalities — Apple, Microsoft, and Google arespearheadinga technological revolution with their vision of AI smartphones and computers…

S24

How local LLMs are changing AI access — As AI adoption rises, more usersexplorerunning large language models (LLMs) locally instead of relying on cloud provider…

S25

New ChatGPT layout blends audio, text and maps in one view — OpenAI has unveiled anupdated ChatGPT interfacethat combines voice and text features in a single view. Users can speak n…

S26

New ChatGPT Voice design aims to smooth AI conversations — OpenAI has rolled out anupdate to ChatGPT Voicethat unifies voice and text in a single interface. Users can now speak, t…

S27

Hume AI unveils emotionally intelligent AI voice interface — A New York-based startup, Hume AI,unveileda groundbreaking AI voice interface, the Empathic Voice Interface (EVI), desig…

S28

Waves of infrastructure Open Systems Open Source Open Cloud — Current transition from training-focused to inference-focused workloads requires rethinking system architecture for dist…

S29

Lift-off for Tech Interdependence? / DAVOS 2025 — Examples of running models with billions of parameters on phones, PCs, and cars.

S30

How Small AI Solutions Are Creating Big Social Change — Alban, can I pick up quickly? I think it’s really important, and actually I’m going to name the number if it’s okay. Oka…

S31

Building Trusted AI at Scale Cities Startups & Digital Sovereignty – Keynote Cristiano Amon — The cloud versus edge debate is misguided – both will work together as distributed intelligence across cloud, network, a…

S32

Trusted Connections_ Ethical AI in Telecom & 6G Networks — As AI -driven telecom operations scale across borders, issues of interoperability, standards, and ethical alignment beco…

S33

Shadow AI and poor governance fuel growing cyber risks, IBM warns — Many organisations racing to adopt AI arefailing to implement adequate security and governance controls, according to IB…

S34

Fireside Chat Intel Tata Electronics CDAC & Asia Group _ India AI Impact Summit — Dr. Khaneja provided insight into why proof-of-concepts fail to scale, noting that whilst organisations achieve impressi…

S35

What is it about AI that we need to regulate? — Some industry representatives questioned the practical feasibility of full algorithmic transparency. In theAI security s…

S36

How to make AI governance fit for purpose? — Shan emphasized international collaboration through the ITU and global standards development, expressing concern about p…

S37

Keynote_ 2030 – The Rise of an AI Storytelling Civilization _ India AI Impact Summit — But today it is truly beginning to happen because we have conversational AI within characters. It’s already happened wit…

S38

Understanding emergent intelligence in work: Agentic, robotic and creative — James describes how AI voice technology is being used to create new forms of interactive entertainment and educational e…

S39

Designing Indias Digital Future AI at the Core 6G at the Edge — Power consumption concerns are driving data centers toward edge deployment This disagreement is unexpected because both…

S40

Powering AI _ Global Leaders Session _ AI Impact Summit India Part 2 — This comment reframes the entire AI development narrative by identifying energy as the primary bottleneck rather than th…

S41

HETEROGENEOUS COMPUTE FOR DEMOCRATIZING ACCESS TO AI — -Infrastructure Constraints and Resource Management: Significant focus on three critical bottlenecks – power consumption…

S42

Building Public Interest AI Catalytic Funding for Equitable Compute Access — Thanks, Andrew. The advantage of coming last is that I could say I agree with all of them. Actually, I’m going to add to…

S43

Artificial intelligence as a driver of digital transformation in industries (HSE University) — The analysis offers a comprehensive examination of artificial intelligence (AI) and its impact on various sectors. One s…

S44

Building Trusted AI at Scale Cities Startups & Digital Sovereignty – Keynote Cristiano Amon — “So 6G is going to provide an evolution of connectivity, faster speed, lower latency, higher coverage.”[20]. “The bigges…

S45

Advancing Scientific AI with Safety Ethics and Responsibility — Oversight should be distributed across multiple entities rather than relying on a single central authority, creating che…

S46

Driving Indias AI Future Growth Innovation and Impact — Distributed rather than concentrated data center development to balance efficiency with accessibility

S47

From Human Potential to Global Impact_ Qualcomm’s AI for All Workshop — Madhav predicts that by 2030 artificial general intelligence will be commonplace, residing in any device that contains a…

S48

AI set to drive trillion-dollar growth by 2030 — AI is forecast to add a cumulative $19.9 trillion to the global economy by 2030, according to arecent IDC study. This gr…

S49

China sets 10-year targets for mass AI adoption — China has set its mostambitiousAI adoption targets yet, aiming to embed the technology across industries, governance, an…

S50

WS #288 An AI Policy Research Roadmap for Evidence-Based AI Policy — High level of consensus on fundamental principles with constructive disagreement on implementation details. This suggest…

S51

Inclusive AI For A Better World, Through Cross-Cultural And Multi-Generational Dialogue — Demands on policy exist without the building blocks to support its implementation Factors such as restricted access to …

S52

AI for Good Technology That Empowers People — The discussion revealed relatively low levels of direct disagreement, with most speakers focusing on complementary aspec…

S53

Global AI Policy Framework: International Cooperation and Historical Perspectives — Werner identifies three critical barriers that prevent AI for good use cases from scaling globally. He emphasizes that d…

S54

AI and Global Power Dynamics: A Comprehensive Analysis of Economic Transformation and Geopolitical Implications — Absolutely. Every sphere of life and economy, we are focusing on diffusion of AI, and in a very systematic way. So, okay…

S55

Overview of AI policy in 15 jurisdictions — Summary China remains a global leader in AI, driven by significant state investment, a vast tech ecosystem and abundant …

S56

National Strategy for Artificial Intelligence — A certain amount of data is needed to develop and use artificial intelligence. At the same time, one of the key principl…

S57

The Mind and the Machine — Data can live on device for privacy protection

S58

Connecting open code with policymakers to development | IGF 2023 WS #500 — Efficient policy measures and rules are necessary to govern data usage while preserving privacy. GDPR mandates user cons…

S59

Introduction — – Security. Data breaches can have a significant impact on a business, in terms of costs, lost business, and sometimes, …

S60

Rights and Permission — – Scalability. FIDO can be integrated with a variety of online services across industries. The protocols a…

S61

From Human Potential to Global Impact_ Qualcomm’s AI for All Workshop — Premium smartphones can run 10 billion parameter models, PCs can handle 30 billion parameters

S62

HETEROGENEOUS COMPUTE FOR DEMOCRATIZING ACCESS TO AI — Modern smartphones can run 10 billion parameter multimodal models, glasses can run sub-1 billion parameter models

S63

Lift-off for Tech Interdependence? / DAVOS 2025 — – Aiman Ezzat- Magdalena Skipper- Aidan Gomez Examples of running models with billions of parameters on phones, PCs, an…

S64

Building the Next Wave of AI_ Responsible Frameworks & Standards — Addressing enterprise and government requirements for complete data sovereignty, Sabharwal detailed the development of e…

S65

Focus shifts to improving AI models in 2024: size, data, and applications. — Interest in artificial intelligence (AI) surged in 2023 after the launch of Open AI’s Chat GPT, the internet’s most reno…

S66

Designing Indias Digital Future AI at the Core 6G at the Edge — The convergence of AI and 6G will create a distributed computing fabric that extends far beyond traditional network boun…

S67

S68

AI for Good Technology That Empowers People — Hybrid approach combining cloud-based training with edge-based inference to balance computational requirements with priv…

S69

Omnipresent Smart Wireless: Deploying Future Networks at Scale — An ethical and responsible approach to 6G technology is emphasized to ensure its positive use and avoid potential negati…

S70

Trusted Connections_ Ethical AI in Telecom & 6G Networks — And let’s do it. India can show the direction forward. For whole world. There is a tradition for great. collaboration, g…

S71

Shadow AI and poor governance fuel growing cyber risks, IBM warns — Many organisations racing to adopt AI arefailing to implement adequate security and governance controls, according to IB…

S72

S73

Practical Toolkits for AI Risk Mitigation for Businesses — Improving data representation is essential for enhancing the reliability of algorithms. Stakeholder consultations have r…

S74

How to make AI governance fit for purpose? — Shan emphasized international collaboration through the ITU and global standards development, expressing concern about p…

S75

Workshop 7: Generative AI and Freedom of Expression: mutual reinforcement or forced exclusion? — David Caswell: Yes, solutions. That’s the big question. I’ll just go through the where I see kind of. the state of the f…

S76

Understanding emergent intelligence in work: Agentic, robotic and creative — ### Future Visions and Applications Harry Yeff: I appreciate the plug. But no, this concept of giving data a voice, and…

S77

S78

Delegated decisions, amplified risks: Charting a secure future for agentic AI — Kenneth Cukier: But the point is that there’s so much that we do without thinking, and that’s good. So, for example, lik…

S79

Opening plenary session and adoption of the agenda — The technological landscape is evolving rapidly.

S80

Opening Ceremony — The tone is consistently formal, diplomatic, and optimistic yet cautionary. Speakers maintain a celebratory atmosphere a…

S81

Opening of the session — In summary, the analysis distils into a narrative that intertwines technology, governance, and equity on a global scale….

S82

Opening address of the co-chairs of the AI Governance Dialogue — The tone is consistently formal, diplomatic, and optimistic throughout. It maintains a ceremonial quality appropriate fo…

S83

Generative AI and Synthetic Realities: Design and Governance | IGF 2023 Networking Session #153 — Following Eloisa’s presentation, Roberto Zambrana offered his industry-oriented views on generative AI. He emphasized th…

S84

Closure of the session — Cuba: Gracias, señor. Thank you, Mr. Chairman. We welcome the fact that this year the work of the Working Group and …

S85

HIGH LEVEL LEADERS SESSION IV — A human rights-based approach is advocated for the application of technology. It is essential to safeguard human rights,…

S86

Creating Eco-friendly Policy System for Emerging Technology — Bosen Lily Liu:Thank you so much, Doris. And thank you so much to Francesca in highlighting the importance of greening i…

S87

AI for Safer Workplaces & Smarter Industries Transforming Risk into Real-Time Intelligence — The discussion maintained an optimistic and collaborative tone throughout, with speakers consistently emphasizing human …

S88

AI for equality: Bridging the innovation gap — The conversation maintained a consistently optimistic yet realistic tone throughout. Both speakers demonstrated enthusia…

S89

Impact & the Role of AI How Artificial Intelligence Is Changing Everything — The discussion maintained a cautiously optimistic tone throughout, balancing enthusiasm for AI’s potential with realisti…

S90

Open Forum #17 AI Regulation Insights From Parliaments — The discussion maintained a collaborative and constructive tone throughout, with participants openly sharing challenges …

S91

The Intelligent Coworker: AI’s Evolution in the Workplace — The discussion maintained a notably optimistic tone throughout, with panelists emphasizing AI’s potential benefits for w…

S92

Knowledge in the Age of AI: World Economic Forum Town Hall Discussion — The discussion maintained a thoughtful, exploratory tone throughout, with panelists acknowledging both the promise and p…

S93

Upskilling for the AI era: Education’s next revolution — The tone is consistently optimistic, motivational, and action-oriented throughout. The speaker maintains an enthusiastic…

S94

Partnering on American AI Exports Powering the Future India AI Impact Summit 2026 — The tone is consistently optimistic, collaborative, and forward-looking throughout the discussion. Speakers emphasize “l…

S95

From brainwaves to breakthroughs: The future with brain-machine interfaces — The tone is consistently inspirational and optimistic throughout, characterized by enthusiasm for technological possibil…

S96

The Global Power Shift India’s Rise in AI & Semiconductors — This panel discussion focused on India’s strategic positioning in artificial intelligence and semiconductor technologies…

S97

Opening of the session — – Addressing the technological divide between developed and developing countries 3. Role of Emerging Technologies Kaza…

S98

WS #31 Cybersecurity in AI: balancing innovation and risks — Melodena Stephens: Thank you First of all, I want to mention that digital literacy is not the same thing as AI litera…

S99

A Global Human Rights Approach to Responsible AI Governance | IGF 2023 WS #288 — Marlena Wisniak:Sure, thanks so much Ian and hi everyone. Welcome to day two, I think it is of IGF. It feels like a week…

S100

AI models are getting smaller and more powerful — Large language models (LLMs) such as GPT-3 have been growing in size and complexity, with models like GPT-4 having nearl…

S101

The Foundation of AI Democratizing Compute Data Infrastructure — A lot of engineers working on AI in industry these days, even in academia, are actually focusing on how can I make this …

S102

Democratizing AI Building Trustworthy Systems for Everyone — “I think open source is going to be in my mind a critical aspect of it”[32]. “Sustainability also requires these kinds o…

S103

WS #460 Building Digital Policy for Sustainable E Waste Management — Several concrete examples were discussed:

S104

WS #83 the Relevance of Dpgs for Advancing Regional DPI Approaches — Concrete examples demonstrated practical cross-border cooperation possibilities. Brazil’s PIX system’s international exp…

S105

Pre 8: IGF Youth Track: AI empowering education through dialogue to implementation – Follow-up to the AI Action Summit declaration from youth — ## Practical Examples and Initiatives Several concrete examples were shared:

S106

Open Forum #78 Shaping the Future with Multistakeholder Foresight — This concrete example powerfully illustrated the practical value of foresight exercises. By describing how unprepared go…

Speakers Analysis

Detailed breakdown of each speaker’s arguments and positions

Durga Maladi

12 arguments214 words per minute3564 words997 seconds

Argument 1

Model scaling trend – shrinking parameters with higher quality enable on‑device AI

EXPLANATION

Durga explains that recent AI models have become smaller in parameter count while delivering higher quality, making it feasible to run sophisticated AI directly on consumer devices.

EVIDENCE

He references the original GPT model with 175 billion parameters announced in 2022 and contrasts it with newer 7-8 billion-parameter models that outperform the original, highlighting a rapid reduction in model size alongside quality improvements [10-13].

EXTERNAL EVIDENCE (KNOWLEDGE BASE)

The scaling hypothesis discussion and evidence that newer, smaller models outperform larger predecessors support the trend of reducing parameter counts while improving quality [S15][S1].

MAJOR DISCUSSION POINT

Model size reduction enables edge AI

Argument 2

On‑device inference delivers consistent experience regardless of connectivity

EXPLANATION

Durga argues that running AI inference on the device ensures the user experience remains stable even when network connectivity is poor or unavailable.

EVIDENCE

He notes that on-device AI makes the AI experience invariant to the quality of the network connection, avoiding the need to switch between regular and AI experiences when offline [18-21].

EXTERNAL EVIDENCE (KNOWLEDGE BASE)

Running inference locally ensures the user experience does not degrade when network quality is poor, as highlighted in discussions of offline inference and edge deployment [S17][S19].

MAJOR DISCUSSION POINT

Connectivity‑independent AI performance

AGREED WITH

Ritukar Vijay

Argument 3

Hybrid processing mix optimizes performance, cost, and energy use

EXPLANATION

Durga describes Qualcomm’s strategy of distributing AI workloads across devices, edge, cloud, and data‑center resources to match use‑case requirements while controlling cost and energy consumption.

EVIDENCE

He outlines a philosophy of AI processing that is distributed across the network depending on the use case, emphasizing a mix-and-match approach that balances performance, cost, and energy efficiency, and cites the ability to swap accelerator cards without replacing entire racks [60-68][65-66].

EXTERNAL EVIDENCE (KNOWLEDGE BASE)

A hybrid approach that distributes compute across devices, edge, and cloud is presented as a way to balance performance, cost, and energy consumption [S18][S20].

MAJOR DISCUSSION POINT

Hybrid AI architecture

AGREED WITH

Ritukar Vijay, Siddhika Nevrekar

Argument 4

Data‑center energy‑efficient HPC draws lessons from edge devices

EXPLANATION

Durga states that lessons learned from low‑power edge devices are being applied to design energy‑efficient high‑performance computing solutions for data centres.

EVIDENCE

He mentions Qualcomm’s focus on energy-efficient high-performance computing, bringing edge-derived efficiency principles into data-center designs, and contrasts a smartphone’s 4 W power draw with a 150 kW data-center rack to illustrate the scaling challenge [97-99][107-110].

EXTERNAL EVIDENCE (KNOWLEDGE BASE)

Energy-efficient high-performance computing draws on edge-derived efficiency principles and power-draw comparisons between smartphones and data-center racks [S20][S1].

MAJOR DISCUSSION POINT

Cross‑domain efficiency transfer

Argument 5

6G will provide the bandwidth and latency needed for advanced edge AI, with trials aimed for 2029

EXPLANATION

Durga outlines a roadmap where 6G networks will unlock the full potential of edge AI by delivering the required bandwidth and latency, with pilot trials planned around the 2028 Olympics and deployments slated for 2029.

EVIDENCE

He links the evolution of cellular platforms to AI, noting that 6G can unlock AI’s full potential, and cites the upcoming 2028 Summer Olympics as a showcase, with technology trials leading to deployments in 2029 [124-129].

EXTERNAL EVIDENCE (KNOWLEDGE BASE)

6G is described as unlocking AI potential with planned trials and deployments by 2029, aligning with broader expectations for 6G-enabled edge AI [S1][S21].

MAJOR DISCUSSION POINT

6G as AI enabler

AGREED WITH

Ritukar Vijay

Argument 6

Energy‑efficient, high‑performance computing is vital for both edge devices and data‑center racks

EXPLANATION

Durga emphasizes that achieving high performance while maintaining low energy consumption is crucial across the entire AI stack, from smartphones to large‑scale data‑center infrastructure.

EVIDENCE

He reiterates the importance of energy-efficient high-performance computing, comparing the 4 W power consumption of a smartphone to the 150 kW of a state-of-the-art data-center rack, and stresses applying edge efficiency lessons to data-center designs [97-99][107-110].

EXTERNAL EVIDENCE (KNOWLEDGE BASE)

Studies on sustainable HPC emphasize the need for energy-efficient high-performance computing across both edge and data-center environments [S20][S1].

MAJOR DISCUSSION POINT

Energy‑efficient AI computing

DISAGREED WITH

Ritukar Vijay, Durga Malladi

Argument 7

On‑device AI preserves personal data privacy by keeping sensitive information local

EXPLANATION

Durga argues that processing AI directly on the device avoids sending personal or enterprise data to the cloud, thereby reducing privacy risks and data exposure.

EVIDENCE

He explains that “there is a large amount of data that happens to be very personal… I might or not be interested in storing the data in the cloud” and that keeping processing on-device mitigates this concern [22-24].

EXTERNAL EVIDENCE (KNOWLEDGE BASE)

On-device processing keeps data local, matching privacy-focused solutions such as Google’s Private AI Compute and local LLM deployments that avoid data exfiltration [S22][S24].

MAJOR DISCUSSION POINT

Privacy benefits of edge AI

AGREED WITH

Shreenivas Chetlapalli

Argument 8

AI is becoming the new multimodal user interface that unifies voice, text, video, and sensor inputs

EXPLANATION

Durga describes a future where AI agents ingest various modalities—voice, text, video, and sensor data—to provide a single, seamless interaction point for users across devices.

EVIDENCE

He outlines the shift from mouse to touch to voice, stating that “All of that gets ingested by a single interface, an AI agent” and illustrates a voice-first UI that maps user intent to apps [31-34][40-46].

EXTERNAL EVIDENCE (KNOWLEDGE BASE)

Recent multimodal chat UI developments illustrate how AI agents ingest voice, text, and visual inputs within a single interface [S25][S26].

MAJOR DISCUSSION POINT

AI as a unified UI layer

Argument 9

Modern premium devices can run multi‑billion‑parameter models on‑device, enabling sophisticated edge AI

EXPLANATION

Durga points out that today’s high‑end smartphones, AR glasses, and PCs are capable of executing large AI models locally, which expands the range of feasible edge applications.

EVIDENCE

He cites examples such as “premium smartphones where you can easily run a 10 billion parameter model”, “glasses with up to a 1-2 billion parameter model”, and “PCs with up to 30 billion parameter models” [16].

EXTERNAL EVIDENCE (KNOWLEDGE BASE)

Qualcomm workshop reports note that premium smartphones, AR glasses, and PCs can execute models with billions of parameters locally [S1][S16].

MAJOR DISCUSSION POINT

Hardware capability for on‑device AI

Argument 10

Inference workloads need different processor architectures than training, influencing data‑center design

EXPLANATION

Durga notes that processors optimized for AI training are not ideal for inference tasks, suggesting a need for specialized inference‑focused hardware in data centers.

EVIDENCE

He states, “the processors that are designed for training are not necessarily the best processors that are intended for inference. They’re actually different kind of problem statements” [101-103].

EXTERNAL EVIDENCE (KNOWLEDGE BASE)

Analyses of emerging inference-centric system designs highlight that inference workloads demand distinct hardware architectures from training workloads [S28].

MAJOR DISCUSSION POINT

Distinct hardware requirements for AI inference

Argument 11

Memory bandwidth, not compute, is the bottleneck for the decode stage of inference, requiring innovative memory architectures

EXPLANATION

Durga explains that while increasing compute improves token throughput, the decode phase of generative models is limited by memory bandwidth, prompting Qualcomm’s AI‑250 solution with a novel memory design.

EVIDENCE

He describes that “the decode stage is fully memory bandwidth bound” and that Qualcomm’s AI-250 solution incorporates an innovative memory architecture to address this limitation [107-112].

MAJOR DISCUSSION POINT

Memory architecture as a key factor for inference performance

DISAGREED WITH

Ritukar Vijay, Durga Malladi

Argument 12

Qualcomm’s end‑to‑end ecosystem spans from doorbells to data‑centers, enabling AI across the entire device spectrum

EXPLANATION

Durga highlights Qualcomm’s unique position of developing AI solutions for a wide range of products, from consumer IoT devices to large‑scale data‑center infrastructure, ensuring cohesive AI integration.

EVIDENCE

He states, “We are probably the only ones in the industry that work on everything from doorbells to data centers” and emphasizes their broad portfolio [139-143].

MAJOR DISCUSSION POINT

Comprehensive AI coverage across device categories

Siddhika Nevrekar

4 arguments130 words per minute1174 words539 seconds

Argument 1

AI Hub streamlines model onboarding, provides cloud‑native device farms for testing and deployment

EXPLANATION

Siddhika describes the Qualcomm AI Hub as a platform where developers can import or create models, receive free access to cloud‑native device farms, and test applications without needing physical hardware.

EVIDENCE

She explains that the AI Hub lets any developer pick or create a model, offers free cloud-native device farm access, and enables testing and deployment of applications without having the device in hand [145-148][78-86].

MAJOR DISCUSSION POINT

Developer‑friendly AI infrastructure

AGREED WITH

Durga Maladi, Ritukar Vijay

Argument 2

Inclusive AI tools accelerate innovation at scale

EXPLANATION

Siddhika highlights that providing inclusive AI tools lowers barriers for developers, thereby speeding up innovation across the edge‑to‑cloud spectrum.

EVIDENCE

She notes that inclusive AI at scale enables developers, simplifying access to optimized models and high-performance on-device AI from edge to cloud [145-147].

MAJOR DISCUSSION POINT

Inclusive developer ecosystem

Argument 3

The AI Hub streamlines deployment of AI applications to app stores, simplifying distribution

EXPLANATION

Siddhika mentions that once developers have built and tested their AI‑enabled apps, the AI Hub allows them to publish these apps directly to various app stores, reducing time‑to‑market.

EVIDENCE

She says, “If you’re comfortable with that, you get to deploy that app out there in any kind of an app store” [86-87].

MAJOR DISCUSSION POINT

Simplified app distribution via AI Hub

Argument 4

The AI Hub acts as an open model‑ingestion platform, allowing external model providers to integrate their models without Qualcomm being a model creator

EXPLANATION

Siddhika clarifies that Qualcomm does not create models itself but instead ingests models from a wide range of providers, fostering an open ecosystem for AI development.

EVIDENCE

She states, “We are not a model creator. We ingest models, which means we work locally with every single model provider out there” [88-89].

MAJOR DISCUSSION POINT

Open model ingestion ecosystem

Praveer Kochhar

5 arguments171 words per minute974 words341 seconds

Argument 1

Shadow AI (unauthorized cloud AI use) is an underrated risk that threatens enterprise data security

EXPLANATION

Praveer warns that many enterprises silently use unauthorized AI services in the cloud, exposing critical data and creating a hidden security risk.

EVIDENCE

He defines shadow AI as the use of unauthorized AI tools on enterprise data, cites that 78 % of enterprise users engage in it, and calls it an underrated but significant concern for data security [175-180].

MAJOR DISCUSSION POINT

Undetected enterprise AI risk

Argument 2

Regulation will always lag behind rapid AI innovation; focus should remain on responsible innovation

EXPLANATION

Praveer argues that regulatory frameworks cannot keep pace with the speed of AI development, so the industry should prioritize responsible innovation while acknowledging that regulation will inevitably follow.

EVIDENCE

He states that regulation in the age of AI will always play catch-up, emphasizing that innovation should proceed with caution but cannot be pre-emptively regulated [270-281].

MAJOR DISCUSSION POINT

Regulatory lag vs innovation

Argument 3

The most feared outcome is addictive, hyper‑personalized AI that reshapes societal behavior

EXPLANATION

Praveer expresses concern that hyper‑intelligent, self‑adapting AI could become addictive, profoundly altering human attention and social interactions.

EVIDENCE

He describes fear that intelligent algorithms will become hyper-personalized, generate content at will, and become extremely addictive, reshaping societal behavior and attention patterns [352-356].

MAJOR DISCUSSION POINT

Societal impact of hyper‑personalized AI

Argument 4

Building a fully sovereign, in‑house AI stack ensures data privacy and control for enterprises

EXPLANATION

Praveer emphasizes that a 100 % sovereign platform, built from scratch, gives enterprises full ownership over their data and AI pipelines, reducing reliance on external services.

EVIDENCE

He notes, “We are 100 % sovereign, built from scratch platform” and that this approach keeps data close to the enterprise [162-165].

MAJOR DISCUSSION POINT

Data sovereignty through a sovereign AI stack

Argument 5

An agentic operating system that spans edge to cloud keeps enterprise data at the source, enhancing security and compliance

EXPLANATION

Praveer describes their private, agentic OS that runs across the edge and cloud, allowing AI agents to operate on enterprise data locally rather than moving data to external clouds.

EVIDENCE

He explains, “We run a full stack private agentic operating system from the edge to the cloud. So we are bringing agents closer to enterprise data rather than taking data to agents” [161-164].

MAJOR DISCUSSION POINT

Edge‑to‑cloud agentic OS for data locality

Madhav Bhargav

5 arguments180 words per minute1182 words392 seconds

Argument 1

Building a separate model per customer proved inefficient; shifting to data‑capture plugins enabled a single, grounded model

EXPLANATION

Madhav recounts that initially SpotDraft attempted to train a bespoke model for each client, which proved unsustainable, leading them to develop a data‑capture plugin that aggregates usage data into a unified, grounded model.

EVIDENCE

He explains that early efforts required training a model per customer, which was abandoned; instead, they built a plugin that captures lawyer interactions, allowing a single model to provide grounded answers using aggregated customer data [201-208].

MAJOR DISCUSSION POINT

Unified model via data capture

Argument 2

AI should augment legal professionals while keeping final decision‑making with humans to preserve accountability

EXPLANATION

Madhav stresses that AI tools can speed up research and drafting for lawyers, but the ultimate judgment must remain with the human lawyer to ensure responsible outcomes.

EVIDENCE

He says, “I have to go with human because… the lawyer’s decision is final… AI gives the capability to do the job better, faster, but the decision stays with the lawyer” [259-264].

MAJOR DISCUSSION POINT

Human‑in‑the‑loop for legal AI

Argument 3

Generative AI can automate knowledge work by continuously learning from internal data, improving onboarding and policy compliance

EXPLANATION

Madhav describes how AI can ingest an organization’s internal documents and policies to provide up‑to‑date, grounded answers, dramatically reducing the time needed for new employees to become productive.

EVIDENCE

He recounts running SpotDraft on internal data, which surfaced unexpected contract clauses and helped build a constantly learning knowledge base, noting that this “wow” moment showed AI’s ability to keep policies up to date” [333-339].

MAJOR DISCUSSION POINT

AI‑driven knowledge management

Argument 4

Consumer‑grade AI assistants (e.g., WhatsApp personal assistant) demonstrate the potential for personal productivity gains

EXPLANATION

Madhav shares an example of deploying a personal AI assistant on WhatsApp that automated tasks, highlighting how such tools can save time for everyday users.

EVIDENCE

He explains, “I deployed it on my WhatsApp and it started sending messages to people. It was a little bit scary, but also saved me a bunch of time” [341-345].

MAJOR DISCUSSION POINT

Personal AI assistants for productivity

Argument 5

Generative AI can automatically create UI components and entire applications, reducing the need for extensive developer training

EXPLANATION

Madhav notes that AI can now generate slides, UI designs, and even full apps on the fly, which lowers the learning curve for users and accelerates product development.

EVIDENCE

He states that AI can generate “slides on the fly based on the conversation” and “entire apps, UIs being generated for each specific scenario” [403-405].

MAJOR DISCUSSION POINT

AI‑generated UI and app creation

AGREED WITH

Ritukar Vijay

Shreenivas Chetlapalli

5 arguments158 words per minute559 words211 seconds

Argument 1

Setting realistic expectations—AI augments work, does not replace jobs

EXPLANATION

Shreenivas stresses the importance of communicating that AI is meant to enhance human tasks rather than eliminate employment, correcting common misconceptions.

EVIDENCE

He notes that setting expectations correctly means AI will augment work to a certain extent and that the notion of AI taking away jobs is a misnomer that must be dispelled [219-222].

MAJOR DISCUSSION POINT

AI as augmentation, not replacement

Argument 2

Indian public‑sector banks and governments are actively adopting AI solutions

EXPLANATION

Shreenivas points out that a significant number of Indian public‑sector banks, PSU units, and state governments have embraced AI, indicating growing institutional trust and deployment.

EVIDENCE

He cites that many public-sector banks have adopted AI, PSU units are customers for AI and emerging technologies, and state governments have set up AI centers after ministerial delegations visited [227-232].

MAJOR DISCUSSION POINT

AI adoption in Indian public sector

Argument 3

Excessive data exfiltration from devices poses security risks; minimizing data leave is crucial

EXPLANATION

Shreenivas argues that allowing large amounts of data to leave a device increases breach risk, advocating for approaches that keep data local or use synthetic data for training.

EVIDENCE

He states that too much data leaving the device is a bigger security concern and suggests training with less data or synthetic datasets to reduce exposure [313-314][317-320].

MAJOR DISCUSSION POINT

Data leakage risk

AGREED WITH

Durga Maladi

DISAGREED WITH

Ritukar Vijay

Argument 4

Synthetic data can be used to train models with less real data, reducing privacy risks while maintaining performance

EXPLANATION

Shreenivas proposes generating synthetic datasets as an alternative to large real‑world data collections, thereby limiting data exfiltration and preserving confidentiality.

EVIDENCE

He mentions, “you can create synthetic data sets and work it, that’s the best way for LLM to be trained rather than waiting for large data set to come” [317-320].

MAJOR DISCUSSION POINT

Synthetic data for privacy‑preserving training

Argument 5

AI‑driven fraud‑call detection using Agile LM is a key emerging application area

EXPLANATION

Shreenivas highlights a joint effort with Qualcomm to detect fraudulent phone calls using an Agile language model, underscoring the practical security benefits of AI.

EVIDENCE

He states, “there is a lot of work that TechMindray and Qualcomm is doing together in detecting fraud calls and this is using Agile LM” [402-403].

MAJOR DISCUSSION POINT

AI for fraud detection in telecommunications

Ritukar Vijay

5 arguments168 words per minute851 words303 seconds

Argument 1

Orchestration runs in the cloud while autonomous navigation runs on the edge; a balanced split is essential

EXPLANATION

Ritukar explains that their robotics architecture places fleet orchestration in the cloud and real‑time navigation on the edge, emphasizing the need for a thoughtful division of labor between cloud and edge.

EVIDENCE

He describes that cloud handles orchestration for robot fleets while autonomous navigation runs on the edge, and that this balanced split is critical for solving the problem effectively [237-238].

MAJOR DISCUSSION POINT

Cloud‑edge split for robotics

AGREED WITH

Durga Maladi, Siddhika Nevrekar

Argument 2

Lack of reliable connectivity for remote robot management is a critical hardware constraint

EXPLANATION

Ritukar highlights that without continuous connectivity, remote monitoring, maintenance, and emergency interventions for robots become impossible, making connectivity a top hardware concern.

EVIDENCE

He notes that if a system lacks any connectivity, remote access for maintenance or emergency situations is impossible, stressing that robots must stay connected at all times to avoid isolation [304-308].

MAJOR DISCUSSION POINT

Connectivity as hardware constraint

AGREED WITH

Durga Maladi

DISAGREED WITH

Durga Malladi

Argument 3

Preference for 6G over pure AI improvements to enable robust edge robotics

EXPLANATION

Ritukar states that in remote mining scenarios where internet is unavailable, 6G connectivity would dramatically improve edge AI capabilities, making it a preferred solution over solely improving AI models.

EVIDENCE

He recounts deploying robots in a mining site with no internet, using satellite links, and argues that 6G would provide better connectivity and open many possibilities, thus preferring 6G over pure AI enhancements [246-252].

MAJOR DISCUSSION POINT

6G as connectivity solution for edge robotics

AGREED WITH

Durga Maladi

Argument 4

Future integration of AI agents with neural interfaces (e.g., Neuralink) will create new agentic interactions between humans and machines

EXPLANATION

Ritukar envisions that brain‑wave tracking and neural‑link technologies will allow AI agents to interact directly with human neural signals, expanding the scope of agentic systems.

EVIDENCE

He says, “there’s a lot of work which is going on with Neuralink so the airports are tracking the brain waves of how you react to a particular situation so agentic you know both robots and people will be agentic in some fashion” [401-403].

MAJOR DISCUSSION POINT

Neural‑interface‑enabled AI agents

Argument 5

Edge AI will become ubiquitous and taken for granted, similar to everyday connectivity

EXPLANATION

Ritukar predicts that by 2030 edge AI will be so pervasive that users will not notice its presence, treating it as an invisible layer of functionality.

EVIDENCE

He repeats that “Edge AI in 2030 will be… taken for granted” and likens it to how connectivity is now assumed [376-384].

MAJOR DISCUSSION POINT

Ubiquity of edge AI

AGREED WITH

Madhav Bhargav

Moderator

2 arguments135 words per minute156 words69 seconds

Argument 1

Inclusive AI at scale requires providing developers with accessible tools and platforms to accelerate innovation

EXPLANATION

The moderator emphasizes that for AI to be inclusive and impactful at scale, developers must have easy access to the necessary tools and ecosystems that lower barriers to entry and enable rapid development.

EVIDENCE

He thanks Durga and states, “As we talk about inclusive AI at scale, enabling developers is critical. Innovation only moves as fast as the tools behind it. Through the Qualcomm AI Hub, we are simplifying how developers access optimized models, test and deploy high-performance on-device AI from edge to cloud” [144-148].

MAJOR DISCUSSION POINT

Developer enablement for inclusive AI

Argument 2

The speed of AI innovation is limited by the availability of robust developer tools and ecosystems

EXPLANATION

The moderator points out that without proper tooling, the pace at which AI solutions can be created and deployed is constrained, highlighting the importance of platform support for developers.

EVIDENCE

He notes, “Innovation only moves as fast as the tools behind it” and reinforces this by mentioning the role of the Qualcomm AI Hub in simplifying access to models and testing environments [145-147].

MAJOR DISCUSSION POINT

Tooling as a bottleneck for AI progress

Agreements

Agreement Points

Connectivity is critical for AI deployment; on‑device inference mitigates connectivity issues while lack of connectivity hampers remote robot management

Speakers: Durga Maladi, Ritukar Vijay

On‑device inference delivers consistent experience regardless of connectivity Lack of reliable connectivity for remote robot management is a critical hardware constraint

Durga stresses that on-device AI makes the experience invariant to network quality [18-21], while Ritukar highlights that without any connectivity remote monitoring and maintenance of robots is impossible [304-308]. Both underline the importance of reliable connectivity for effective AI services.

POLICY CONTEXT (KNOWLEDGE BASE)

Connectivity is highlighted as a key factor in edge AI deployments, with 6G promises of higher speed and lower latency underscoring its importance, and networking challenges identified as a major bottleneck in heterogeneous compute strategies [S41][S44].

AI workloads should be distributed across edge, cloud and data‑center resources rather than confined to a single tier

Speakers: Durga Maladi, Ritukar Vijay, Siddhika Nevrekar

Hybrid processing mix optimizes performance, cost, and energy use Orchestration runs in the cloud while autonomous navigation runs on the edge; a balanced split is essential AI Hub streamlines model onboarding, provides cloud‑native device farms for testing and deployment

Durga describes a philosophy of distributing AI processing across devices, edge and cloud to match use-cases [60-68]; Ritukar explains their robotics stack splits orchestration to the cloud and navigation to the edge [237-238]; Siddhika notes the AI Hub enables developers to test on cloud-native device farms, supporting a hybrid workflow [145-148].

POLICY CONTEXT (KNOWLEDGE BASE)

Policy trends encourage distributed compute across edge, cloud, and data-center tiers to improve efficiency and accessibility, as reflected in India’s edge-centric digital strategy and broader calls for democratized AI infrastructure [S39][S41][S46].

The upcoming 6G cellular generation is seen as a key enabler for advanced edge AI applications

Speakers: Durga Maladi, Ritukar Vijay

6G will provide the bandwidth and latency needed for advanced edge AI, with trials aimed for 2029 Preference for 6G over pure AI improvements to enable robust edge robotics

Durga links 6G to unlocking AI’s full potential and mentions pilot trials around the 2028 Olympics leading to deployments in 2029 [124-129]; Ritukar, citing remote mining robot use, argues that 6G would dramatically improve connectivity for edge AI, preferring it over solely improving AI models [246-252].

POLICY CONTEXT (KNOWLEDGE BASE)

Multiple industry and policy discussions position 6G as a catalyst for advanced edge AI, citing India’s digital future roadmap and analyses of 6G’s AI-centric capabilities [S39][S43][S44].

Keeping personal or enterprise data on the device reduces privacy and security risks

Speakers: Durga Maladi, Shreenivas Chetlapalli

On‑device AI preserves personal data privacy by keeping sensitive information local Excessive data exfiltration from devices poses security risks; minimizing data leave is crucial

Durga argues that processing AI on-device avoids sending personal data to the cloud [22-24]; Shreenivas emphasizes that too much data leaving a device increases breach risk and advocates for minimal data exfiltration [313-314]. Both converge on the need to limit data movement for privacy and security.

POLICY CONTEXT (KNOWLEDGE BASE)

This aligns with data-protection principles such as GDPR and national data-minimisation guidelines that advocate on-device data storage to safeguard privacy and reduce breach risk [S56][S57][S58][S59].

AI is expected to become ubiquitous and taken for granted in everyday devices by 2030

Speakers: Ritukar Vijay, Madhav Bhargav

Edge AI will become ubiquitous and taken for granted, similar to everyday connectivity Generative AI can automatically create UI components and entire applications, reducing the need for extensive developer training

Ritukar predicts edge AI will be an invisible layer, just like connectivity, by 2030 [376-384]; Madhav envisions AI-generated UI and apps becoming commonplace, making technology accessible without extensive training [403-405]. Both see AI integration becoming seamless and pervasive.

POLICY CONTEXT (KNOWLEDGE BASE)

Forecasts from leading AI research and national strategies anticipate AI becoming pervasive in consumer devices by 2030, with massive economic impact and adoption targets in China and global estimates supporting this view [S47][S48][S49].

Similar Viewpoints

Both speakers highlight Qualcomm’s comprehensive ecosystem that simplifies AI development and deployment across the full stack, from edge devices to data‑centers, emphasizing developer accessibility and end‑to‑end support [78-86][145-148].

Speakers: Durga Maladi, Siddhika Nevrekar

AI Hub streamlines model onboarding, provides cloud‑native device farms for testing and deployment Qualcomm’s end‑to‑end ecosystem spans from doorbells to data‑centers, enabling AI across the entire device spectrum

Both stress that limiting data movement off the device is essential for privacy and security, advocating for on‑device processing and reduced data leakage [22-24][313-314].

Speakers: Durga Maladi, Shreenivas Chetlapalli

On‑device AI preserves personal data privacy by keeping sensitive information local Excessive data exfiltration from devices poses security risks; minimizing data leave is crucial

Unexpected Consensus

AI as a unified multimodal user interface versus AI‑generated UI components

Speakers: Durga Maladi, Madhav Bhargav

AI is becoming the new multimodal user interface that unifies voice, text, video, and sensor inputs Generative AI can automatically create UI components and entire applications, reducing the need for extensive developer training

Durga envisions a single AI agent ingesting multiple modalities to serve as the primary UI [31-34][40-46], while Madhav points to AI automatically generating UI elements and full apps, making UI creation itself an AI-driven process [403-405]. The convergence of AI as both the interface layer and the creator of that interface was not an obvious overlap.

Overall Assessment

The discussion reveals strong consensus around the need for distributed, hybrid AI architectures, the importance of connectivity (including 6G) for edge AI, and the imperative to keep data on‑device for privacy. Participants also agree that AI will become ubiquitous and seamlessly integrated into everyday experiences.

High consensus on technical and strategic directions (hybrid processing, connectivity, privacy) indicating a shared vision among industry leaders, which bodes well for coordinated development of AI ecosystems and policies.

Differences

Different Viewpoints

What constitutes the most critical hardware constraint for edge AI/robotics deployments

Speakers: Ritukar Vijay, Durga Malladi

Lack of reliable connectivity for remote robot management is a critical hardware constraint Energy‑efficient, high‑performance computing is vital for both edge devices and data‑center racks Memory bandwidth, not compute, is the bottleneck for the decode stage of inference, requiring innovative memory architectures

Ritukar emphasizes that without continuous connectivity the robots cannot be monitored, maintained or intervened remotely, making connectivity the top hardware concern [304-308]. Durga, on the other hand, focuses on the need for energy-efficient high-performance compute and memory-bandwidth-optimized architectures to enable inference at the edge and in data centres, highlighting power and memory rather than connectivity as the primary constraint [97-99][107-112].

POLICY CONTEXT (KNOWLEDGE BASE)

The debate mirrors policy-level analyses that highlight energy consumption, compute availability, and GPU scarcity as critical factors for edge AI scalability [S40][S41][S54].

Optimal amount of data that should leave a device for AI processing

Speakers: Shreenivas Chetlapalli, Ritukar Vijay

Excessive data exfiltration from devices poses security risks; minimizing data leave is crucial If it is enterprise, less data going up is always better. If it is B2C, everybody wants to learn from that data

Shreenivas argues that allowing large volumes of personal or enterprise data to leave the device increases breach risk and therefore advocates keeping data local or using synthetic data [313-314][317-320]. Ritukar counters that the appropriate data flow depends on context: for enterprise scenarios less data upload is preferred, but for consumer-facing (B2C) scenarios abundant data collection is valuable for model improvement [321-325].

POLICY CONTEXT (KNOWLEDGE BASE)

Established data-minimisation policies and GDPR requirements limit the volume of personal data transmitted off-device, emphasizing a balance between utility and privacy [S56][S58].

Unexpected Differences

Differing views on the primary hardware bottleneck for edge AI (connectivity vs compute/energy)

Speakers: Ritukar Vijay, Durga Malladi

Lack of reliable connectivity for remote robot management is a critical hardware constraint Energy‑efficient, high‑performance computing and memory bandwidth are the key constraints for edge inference

While most discussions centred on AI model scaling and hybrid architectures, the panel revealed an unexpected split: a robotics-focused participant prioritized network connectivity as the make-or-break factor, whereas the Qualcomm executive highlighted power efficiency and memory architecture as the decisive hardware challenges. This divergence was not anticipated given the common focus on AI model improvements. [304-308][97-99][107-112]

POLICY CONTEXT (KNOWLEDGE BASE)

Expert panels identify both networking limitations and energy/compute constraints as pivotal challenges for edge AI deployment, reflecting ongoing disagreement over the dominant bottleneck [S40][S41][S44][S54].

Overall Assessment

The panel exhibited limited direct conflict, with most participants aligning on the vision of pervasive, hybrid AI that enhances user experience while preserving privacy. The principal disagreements revolved around which hardware or data‑flow factor should be prioritized—connectivity versus energy/memory efficiency, and the amount of data that should leave devices. These differences reflect varied domain perspectives (robotics vs semiconductor) rather than fundamental opposition to the overall AI strategy.

Low to moderate. The disagreements are technical and contextual, not ideological, suggesting that consensus on broader AI goals (ubiquitous edge AI, privacy, hybrid processing) is strong, but implementation pathways will require cross‑sector coordination to reconcile connectivity, energy, and data‑governance priorities.

Partial Agreements

All speakers concur that AI will be widely distributed across devices, edge, cloud and data‑centres, that edge AI will become pervasive and that keeping processing close to the data (on‑device or at the edge) improves user experience and security. However, they differ on the emphasis (e.g., hybrid architecture vs ubiquity vs privacy) while sharing the same overarching goal of a seamless, secure AI ecosystem. [60-68][376-384][18-21][161-164][31-34][40-46]

Speakers: Durga Malladi, Ritukar Vijay, Madhav Bhargav, Praveer Kochhar, Shreenivas Chetlapalli

Hybrid processing mix optimizes performance, cost, and energy use Edge AI will become ubiquitous and taken for granted, similar to everyday connectivity On‑device inference delivers consistent experience regardless of connectivity AI agents should keep enterprise data at the source, enhancing security and compliance AI will serve as a new multimodal user interface that unifies voice, text, video, and sensor inputs

Both highlight the importance of developer‑friendly platforms (Qualcomm AI Hub) to lower barriers and speed up AI adoption across the edge‑to‑cloud spectrum. While Durga focuses on the technical ecosystem, Siddhika stresses the inclusive, scale‑oriented impact of such tools. [78-86][145-148]

Speakers: Durga Malladi, Siddhika Nevrekar

The AI Hub streamlines model onboarding, provides cloud‑native device farms for testing and deployment Inclusive AI tools accelerate innovation at scale

Takeaways

Key takeaways

AI model sizes are shrinking while quality improves, making on‑device (edge) AI practical for smartphones, AR glasses, PCs and other consumer devices. Running AI on‑device provides a consistent user experience independent of network connectivity and protects personal/enterprise data from unnecessary cloud exposure. A hybrid AI architecture that distributes processing across devices, edge servers, cloud, and data‑center accelerators optimizes performance, cost, and energy efficiency. Qualcomm’s AI Hub offers a streamlined developer workflow: model onboarding, cloud‑native device farms for testing, and easy deployment to edge devices. Shadow AI – the use of unauthorized cloud AI services – is an underrated risk that threatens enterprise data security. In legal AI, building a separate model per customer proved inefficient; capturing user interactions via plugins enables a single, grounded model that learns from real usage. Successful AI adoption in India requires setting realistic expectations (AI augments, not replaces jobs) and leveraging growing interest from public‑sector banks and government AI centers. Robotics deployments need a clear split: real‑time navigation on the edge, orchestration and fleet management in the cloud; reliable connectivity is a critical hardware constraint. 6G is expected to unlock new AI capabilities by providing higher bandwidth and lower latency, with trial deployments targeted for 2029. Regulation will inevitably lag behind rapid AI innovation; the focus should be on responsible, innovative development while monitoring societal impacts such as addictive, hyper‑personalized agents.

Resolutions and action items

Qualcomm will continue to promote and expand the AI Hub, offering free cloud‑native device farms for developers to test and deploy edge AI models. Panel participants (e.g., SpotDraft, Kogo AI, Autonomy) expressed interest in further collaboration with Qualcomm for leveraging the AI Hub and hybrid AI solutions. Organizations are encouraged to audit and mitigate shadow AI usage within their enterprises.

Unresolved issues

How to effectively govern and secure shadow AI usage without hindering productivity. Specific regulatory frameworks or standards for emerging edge AI and autonomous agents remain undefined. The exact timeline and technical specifications for 6G rollout and its integration with AI workloads are still pending. Best practices for minimizing data exfiltration from edge devices while still enabling model improvement need further development. Strategies for managing the societal impact of highly personalized, potentially addictive AI agents were discussed but not resolved.

Suggested compromises

Adopt a hybrid AI approach that balances on‑device inference for latency‑sensitive tasks with cloud processing for heavy training and orchestration, rather than an all‑edge or all‑cloud model. Prioritize 6G development as an enabler for edge AI while continuing to improve AI models themselves, acknowledging that both network and model advances are needed. Encourage responsible innovation—pursue rapid AI advances while implementing cautionary guardrails, recognizing that regulation will trail but should not stifle progress.

Thought Provoking Comments

Model sizes are coming down dramatically while model quality continues to increase – this “AI law” means we can run 7‑8 B parameter models on phones and still outperform the original 175 B GPT‑3, making edge AI feasible.

Highlights a fundamental shift in AI economics: smaller, more capable models unlock on‑device inference, challenging the assumption that only massive models are useful.

Set the technical foundation for the rest of Durga’s talk, leading to discussion of concrete edge devices (smartphones, AR glasses, PCs) and framing why edge AI matters for both consumer and enterprise use cases.

Speaker: Durga Malladi

AI agents will become the new universal UI, consolidating voice, text, video, sensors and personal knowledge graphs, replacing the clutter of separate apps on smartphones.

Introduces a visionary user‑experience paradigm that redefines how people interact with technology, moving from app‑centric to agent‑centric interaction.

Shifted the conversation from hardware capabilities to user‑experience implications, prompting the panel to consider practical examples (e.g., Byte’s AI‑first phone) and the broader societal impact of such UI changes.

Speaker: Durga Malladi

Shadow AI – the widespread, unauthorized use of consumer AI tools on enterprise data – is an underrated pain point that 78 % of enterprises face.

Identifies a hidden risk that blends security, compliance, and productivity concerns, which many organizations overlook.

Redirected the panel’s focus toward governance and data‑privacy challenges, leading to follow‑up questions about data leaving devices and the need for on‑premise solutions.

Speaker: Praveer Kochhar

Our early failure was trying to train a separate model for each legal customer; we pivoted to capturing usage data via a Word plugin, which now lets us provide grounded answers without per‑customer models.

Shows a concrete product‑strategy lesson: the cost of bespoke models versus leveraging user‑generated data, illustrating how a ‘failure’ informed a scalable solution.

Inspired other panelists to discuss data collection strategies and reinforced the theme that edge‑generated data can power better AI without massive centralized training.

Speaker: Madhav Bhargav

The special ingredient for AI adoption in India is setting realistic expectations about AI’s limits and dispelling the myth that it will take jobs away.

Addresses cultural and market‑specific barriers, emphasizing education and expectation‑management as critical for adoption.

Steered the discussion toward regional adoption challenges, prompting follow‑up on trust, data privacy, and the role of local data‑centers versus cloud.

Speaker: Shreenivas Chetlapalli

Regulation will always play catch‑up; AI innovation must lead, with caution, because we don’t yet understand the social implications of these intelligent systems.

Challenges the common narrative that regulation should precede deployment, highlighting the speed of AI progress and the need for proactive governance.

Prompted a brief debate on the balance between innovation and oversight, influencing later comments about fear of hyper‑intelligent, attention‑seeking AI.

Speaker: Praveer Kochhar

The ‘wow’ moment was when our internal legal team, skeptical of AI, asked to see the source of a clause the model highlighted – it revealed hidden policy inconsistencies that humans had missed.

Provides a vivid, real‑world example of AI delivering unexpected, high‑value insight, illustrating the transformative potential of agentic systems.

Reinforced the earlier claim about AI as a new UI and knowledge work enhancer, and gave the audience a tangible success story that underscored the urgency of adopting edge‑centric AI.

Speaker: Madhav Bhargav

A major hardware constraint that keeps us up at night is ensuring continuous connectivity for robots; without remote access, fleets become isolated and unmanageable.

Highlights a practical, often‑overlooked operational challenge that bridges edge AI capabilities with real‑world deployment realities.

Shifted the conversation from high‑level vision to concrete infrastructure needs, linking back to Durga’s earlier point about hybrid AI across device, edge, and cloud.

Speaker: Ritukar Vijay

Overall Assessment

These comments acted as the engine of the discussion. Durga’s technical framing of shrinking models and the AI‑agent UI set a forward‑looking context, while the panelists injected real‑world friction points—shadow AI, data privacy, regulatory lag, and hardware connectivity—that grounded the vision. Each insight sparked a new thread: governance, product strategy, regional adoption, and operational constraints. Together they moved the conversation from abstract possibilities to concrete challenges and opportunities, giving the audience both a compelling future narrative and actionable considerations for building on‑device AI.

Follow-up Questions

What are the implications of the emerging “AI law” that model sizes are decreasing while quality improves, especially for edge AI deployment?

Understanding this trend is crucial for designing efficient edge AI solutions and anticipating future hardware and software requirements.

Speaker: Durga Malladi

How feasible and performant are large‑parameter models (e.g., 10B on smartphones, 1‑2B on AR glasses, 30B on PCs) on consumer devices in real‑world scenarios?

Assessing power, thermal, and latency constraints will guide product roadmaps and developer expectations for on‑device AI.

Speaker: Durga Malladi

How can AI experiences remain invariant to connectivity quality, and what offline inference strategies are needed?

Ensuring consistent user experience in low‑ or no‑network environments is vital for both consumer and enterprise use cases.

Speaker: Durga Malladi

What methods can be used to build and maintain personal knowledge graphs on‑device for AI agents while preserving privacy?

Personal knowledge graphs enable contextual, personalized AI without sending sensitive data to the cloud.

Speaker: Durga Malladi

How do users interact with AI‑first phones that expose only an agent UI (e.g., the new Chinese phone), and what are the adoption challenges?

Studying UX and acceptance will inform design of future agent‑centric devices.

Speaker: Durga Malladi

What are the optimal memory architectures for inference where the decode stage is memory‑bandwidth bound (as in AI‑250), and how can they be scaled to next‑gen AI‑300?

Improving memory efficiency can significantly boost inference throughput and reduce energy use.

Speaker: Durga Malladi

What capabilities will 6G bring to AI workloads, and how can upcoming trials (e.g., 2028 Olympics) be leveraged to validate them?

Mapping 6G’s latency, bandwidth, and edge‑cloud integration to AI use cases will shape future service models.

Speaker: Durga Malladi

How prevalent is “shadow AI” in enterprises, and what strategies can mitigate its security and compliance risks?

Shadow AI poses data‑leakage and governance challenges that need systematic detection and policy frameworks.

Speaker: Praveer Kochhar

How can models be trained effectively with less data, possibly using synthetic data generation, to reduce privacy and cost concerns?

Reducing data requirements while maintaining performance is key for scalable, privacy‑preserving AI.

Speaker: Shreenivas Chetlapalli

What are the risks and opportunities associated with emergent behavior in large language models (e.g., OpenClaw’s autonomous file creation)?

Understanding emergent capabilities is essential for safety, controllability, and new application development.

Speaker: Praveer Kochhar

How can Agile LM be further developed for fraud call detection, and what metrics should be used to evaluate its effectiveness?

Fraud calls are rising; robust AI detection requires ongoing research into model accuracy, false‑positive rates, and deployment scalability.

Speaker: Shreenivas Chetlapalli

What hardware constraints (e.g., continuous connectivity) keep robot fleets from operating reliably, and how can they be addressed?

Ensuring remote access for maintenance and emergency control is critical for large‑scale autonomous deployments.

Speaker: Ritukar Vijay

Is it more dangerous for devices to transmit too much data off‑device or to retain too little data for model improvement, and how should the trade‑off be managed?

Balancing data privacy with the need for sufficient training data is a core challenge for edge AI governance.

Speaker: Shreenivas Chetlapalli, Ritukar Vijay

How can sovereign, agentic operating systems that run from edge to cloud be architected to ensure privacy, scalability, and compliance?

Research into decentralized AI platforms can enable enterprises to keep data on‑premise while leveraging powerful cloud models.

Speaker: Praveer Kochhar

How will AI‑driven generative UI (e.g., AI‑generated screens, slides, apps) transform user interaction paradigms, especially in markets like India?

Studying the shift from static SaaS interfaces to dynamic, AI‑generated experiences will guide product design and adoption strategies.

Speaker: Madhav Bhargav

What governance, safety, and control mechanisms are needed for autonomous agents that operate on personal data to prevent misuse or unintended behavior?

Establishing robust safeguards is essential to maintain user trust and comply with privacy regulations.

Speaker: Praveer Kochhar (question raised by Siddhika Nevrekar)

Disclaimer: This is not an official session record. DiploAI generates these resources from audiovisual recordings, and they are presented as-is, including potential errors. Due to logistical challenges, such as discrepancies in audio/video or transcripts, names may be misspelled. We strive for accuracy to the best of our ability.

From Human Potential to Global Impact_ Qualcomm’s AI for All Workshop

Summary

Keypoints

Speakers

Related event

India AI Impact Summit 2026