ElevenLabs Voice AI Session & NCRB/NPMFireside Chat
Summary
The session opened with Swati Sharma highlighting that India’s 1.4 billion citizens speak diverse languages, yet most online content is available only in English, creating a major accessibility gap [1-6]. Shailendra Pal Singh introduced the Bhashni (also referred to as Pashni/Bhajani) translation plugin, already deployed on over 500 websites, designed to break this language barrier by automatically translating website content into any of India’s 22 scheduled languages [7-19]. He illustrated the need with a farmer who had to travel 40 km to fill an English form for the PM Kisan Samman Nidhi, underscoring that 800 million Indians are not fluent in English and that 95 % of digital content is English-only; the plugin addresses this by providing a lightweight, one-liner code that can be copied onto any site and render multilingual pages within minutes without backend redesign [21-45]. Swati noted that more than 400 websites have already integrated the plug-in, generating over 24 million translation inferences and creating 1.5 million glossary entries to improve contextual accuracy [96-99]. During a live demo she showed that the plug-in instantly adds a language selector supporting all 22 languages, works across all pages, and complies with Digital Brand Management (DBM) standards to ensure accessibility for visually impaired users [52-95]. The solution requires no developer expertise-anyone can copy-paste the single line of code to make a site multilingual [46-49], and it is framework-agnostic, operating on any website stack [88-89]. Advanced capabilities include translating from any source language, skipping specific elements via a CSS class, customizing language order, limiting displayed languages, handling portals without page reload, batching dynamic content to reduce API calls, voice-activated translation, and URL redirection to language-specific domains [96-180]. Glossaries allow precise control over terminology and transliteration (e.g., preserving “Vakil Saab Bridge” or correcting “home” to “Mukhya Prash”), with over 1.5 million entries created to enhance contextual relevance [190-224]. Future plans expand support to 36 Indian languages plus 35 international ones, automate glossary uploads, and add text-to-speech and screen-reader features for broader accessibility [190-197]. Audience questions addressed commercial use-private entities can adopt the solution under separate agreements-and the possibility of region-based default language selection, which is technically feasible pending further review [304-322]. Swati explained that glossaries are customized per client and ingested into their solutions, and while the team does fine-tune models for specific domains, this requires careful classification and is an ongoing effort [330-336]. The discussion concluded by emphasizing that language is an identity and that the Bhashni translation plugin, together with tailored glossaries, aims to provide a multilingual AI layer for digital inclusion across India’s diverse population [301-303].
Keypoints
Major discussion points
– The language barrier in India and the need for a multilingual digital layer – the speakers stress that India’s 1.4 billion citizens speak many languages, yet most online content is only in English, creating a “language divide” that excludes 800 million non-English speakers [1-6][27-29][31-34].
– Introduction and live demo of the Bhashni (Bhashni/Bhajani) translation plugin – described as a lightweight, one-liner code that can be copy-pasted onto any website to instantly render it in all 22 Indian scheduled languages without backend redesign; it is DBM-compliant and framework-agnostic [9-10][36-45][52-54][67-71][79-88].
– Key technical features and how the plugin handles real-world challenges – automatic multilingual support across all pages, skip-translation classes, customizable language ordering, domain-level redirection, batch processing of dynamic content, no-reload option for portals, and automatic handling of mixed-language text [79-88][90-108][111-118][119-126][129-136][141-148][152-160][162-168][170-176][179-181][184-186].
– The role of glossaries in improving translation quality and contextual relevance – glossaries capture domain-specific terms, correct mistranslations, manage transliteration, and are customized per client; examples include handling “home” vs. “ghar”, proper nouns like “Vakil Saab Bridge”, and abbreviation disambiguation; the team also discusses automation of glossary ingestion and future fine-tuning of models [96-101][115-124][128-136][190-197][210-218][221-226][230-238][241-250][254-262][267-274][276-284][286-294][295-299][330-336].
– Audience questions on commercial use, regional default language settings, and glossary maintenance – participants ask whether the solution can be used by private entities, how default languages could be set per region, and how glossaries are managed, customized, and potentially used for model fine-tuning [304-306][307-310][311-317][318-325][326-329][330-336][333-336].
Overall purpose / goal of the discussion
The session aims to raise awareness of India’s digital language exclusion, present the Bhashni translation plugin as a scalable, easy-to-integrate infrastructure for multilingual web content, demonstrate its capabilities and technical nuances, explain the supporting glossary system, and engage stakeholders (government, private, and developers) about adoption, customization, and future enhancements.
Overall tone and its evolution
The conversation begins with a problem-focused, urgent tone, highlighting the scale of exclusion. It then shifts to an enthusiastic, demonstrative tone as the speakers showcase the plugin’s simplicity and power. A technical, solution-oriented tone follows when discussing features, challenges, and glossaries. Finally, during the Q&A, the tone becomes responsive and collaborative, addressing audience concerns and emphasizing partnership opportunities. Throughout, the tone remains professional, optimistic, and supportive.
Speakers
– Swati Sharma
– Role/Title: Presenter / Expert on Bhashini translation solutions and language accessibility initiatives
– Area of Expertise: Multilingual AI, digital inclusion, language technology
– Source: [S7]
– Shailendra Pal Singh
– Role/Title: Senior General Manager, Bhashani
– Area of Expertise: Technical implementation and integration of translation solutions, language barrier mitigation
– Source: [S3]
– Audience
– Role/Title: General attendees (including professionals such as professors, researchers, and industry experts)
– Area of Expertise: Varied across public administration, cybersecurity, digital identity, etc.
Additional speakers:
– None identified beyond the listed speakers.
The session opened with Swati Sharma foregrounding India’s massive linguistic divide: a nation of 1.4 billion people – and therefore 1.4 billion distinct voices – nonetheless finds almost all online content presented in a single language, predominantly English [1-6][27-29]. She argued that this “language divide” excludes roughly 800 million citizens who are not fluent in English, limiting their ability to access digital services and undermining inclusive development [31-34][27-28].
Shailendra Pal Singh then framed the problem as a national imperative to “break the language barrier that exists in our country” and introduced the Bhashni (also known as Pashni/Bhajani) translation plugin as a concrete solution [7-13]. He noted that the plugin is already deployed on more than 500 websites, providing automatic multilingual rendering for users who cannot understand English or Hindi content on state-level portals [9-10][19]. The technology leverages over 350 language models from the team’s platform to deliver translations in any of India’s 22 scheduled languages [19-20].
To illustrate the human impact, Swati recounted a farmer who had to travel 40 km merely to find someone capable of completing an English-only PM Kisan Samman Nidhi form, underscoring how the language gap translates into real-world hardship [21-28]. She positioned the Bhashni plugin as the remedy, describing it as a “unified multilingual layer for India’s digital ecosystem” that treats language not as a mere feature but as foundational infrastructure for digital inclusion [31-36].
The core of the solution is a framework-agnostic, DBM-compliant JavaScript one-liner that can be copied and pasted onto any website, instantly rendering the entire site in all 22 Indian scheduled languages without any backend redesign or repeated code insertion [36-45][41-45][46-49][52-55][67-71][79-86][88-95][92-95]. Once embedded on the home page, it automatically applies to every subsequent page, so developers do not need to insert the code repeatedly.
Advanced capabilities follow a logical progression:
* Direct source-to-target translation (e.g., Marathi → Hindi) without an English intermediate [104-108];
* A CSS “skip-translation” class that lets developers exclude elements such as calendars, email addresses, or other non-translatable sections [111-118];
* Customizable language ordering and the option to limit the dropdown to a subset of languages [119-126][141-148];
* Portal-mode operation that prevents full-page reloads and preserves any data the user has entered [149-151];
* Batch processing of dynamic content to reduce API calls and stabilise response times, illustrated with the State Bank of India and MyBharat Hotel examples [162-168];
* Automatic detection and skipping of already-translated mixed-language segments [152-160];
* Voice-activated language selection demonstrated on the Rail Madad site, where speaking a language name triggers instant localisation [171-174]; and
* URL redirection that maps a language selection to a dedicated domain (e.g., a Hindi-specific domain for the MSD website) [176-180].
Impact metrics substantiate the rollout: more than 400 websites have integrated the plugin, generating over 24 million translation inferences and creating upwards of 1.5 million glossary entries to improve contextual accuracy [96-99]. Glossary management is client-specific: each customer’s domain-specific glossary is ingested only into that client’s solution, ensuring correct terminology and preventing cross-site contamination [330-336]. Representative glossary entries illustrate the breadth of the effort: correcting mistranslations such as “home” rendered as “Mukhya Prash” instead of the literal “ghar” [190-186]; preserving proper nouns like “Vakil Saab Bridge” through transliteration [221-226]; resolving abbreviation ambiguities such as “BN” meaning “battalion” for the BSF rather than “billion” [276-284]; skipping abbreviation translation for the animal-husbandry department [260-262]; handling the nuance between “authorized officer” and “newt adhikari” [268-270]; and ensuring the term “Maanenya” appears only when explicitly provided in the source text for the “PS to Minister” use-case [276-284]. Additional practical issues uncovered during deployments-such as hyphen mismatches, singular-plural inconsistencies, and punctuation-induced sentence breaks-were remedied through targeted glossary entries [254-262][267-274][241-250][295-299].
Looking ahead, the roadmap includes support for 36 Indian languages (up from the current 22) and the addition of 35 international languages, automation of glossary uploads via an onboarding portal, and an accessibility bar that integrates text-to-speech and screen-reader functionality [190-197][189-197].
Audience Q&A – Private or commercial entities can adopt the plugin under separate collaboration agreements, with details available at the Bhashni Pavilion [304-310]; region-based default language selection (e.g., Hindi for Delhi users, Marathi for Maharashtra users) is technically feasible but requires a feasibility assessment before implementation [311-317][318-324]; glossaries are customised per client and model fine-tuning is undertaken after careful domain classification, a lengthy but ongoing process [330-336][333-336].
Swati concluded by emphasizing that language is not merely a set of words but an expression of identity, urging stakeholders to prepare India’s linguistic landscape for the future of AI by creating glossaries, embedding multilingual layers, and championing digital dignity, accessibility and inclusivity for all citizens [301-303].
accessibility, language accessibility and language inclusivity. We are a country of 1 .4 billion people. More importantly, a country of 1 .4 billion voices. We all think differently, we all speak differently, and we all dream differently. But whenever we go online, everything is available only in one language. Majorly English.
To break the language barrier that exists in our country. And we have different solutions and different integrations that we have. One of them is Pashni translation plugin, which is already sitting on top of more than 500 websites, if I’m not wrong, the exact number. And we are enabling people, we are enabling citizens of India who are essentially not being able to understand in English and Hindi because most of the digital content that you see, primarily the website, maximum you’ll see is a website which is sitting in a state. The default language would be there or English primarily. But then what about rest of the languages? Imagine a scenario that I’m someone from north and I’m living there in Maharashtra.
Mostly you will see the content in Marathi or English. But then what about having the same content? I don’t know English. But I really want to understand what is there. And I want to convert it, the different policies at the state level, different guidelines, different content, maybe creative content, etc. You need to know in my language. So, Bhajani Translation Plugin is one of the engineered solution using all the models that you might already be aware of. 350 plus models from our platform. We have this solution as Peksa Swati.
So, as Shailendra Pal mentioned, last year a farmer wanted to apply for the PM Kisan Samman Nidhi. It’s basically a very simple form that the farmer has to fill. But the form was in English. The farmer literally had to travel 40 kilometers only to find somebody who can actually help him out filling the form. This is the language divide. This is the barrier that we are trying to avoid. Eliminate. 800 plus million people. are not fluent in English. And 95 % of the content which is available, it is in English. This is where Bhashni comes into picture. The National Language Translation Mission of India. We are trying to transcend the language barrier. We are creating a unified multilingual layer for India’s digital ecosystem.
We are not just providing language as a feature. We are providing language as an infrastructure. We are encouraging language as the foundation for digital inclusion. Next slide, please. So, like sir introduced, the Bhashni Translation Plugin. It’s a powerful product through which you can have any website being translated into multiple languages, being accessible to all the people in the last mile. And this happens in matter of minutes. Not days. Or months. Or just minutes. This is the power of the product that we are talking about. And you don’t have to rebuild the entire website. You don’t have to redesign it. There is no back -end overhaul. Just one liner, very lightweight, simple code that you can copy and paste onto the website and you will have your website speaking multiple Indian languages.
This is how accessibility is made effortless, inclusion is made scalable, and the last mile reach is made real. So I just want anybody to see. Anybody who can copy and paste. Like we don’t need a developer or a person who knows JavaScript or the entire back -end. just somebody knows copy and paste and we’ll see how with the help of that you can have the entire website multilingual. So anybody who would like to do that? Yes, sir, please.
Maybe, you want to open a website first and show what exactly VashuCast is.
So this is the Vashni’s website and here is the plugin that has been integrated on the website. This plugin will help us have the entire website available in all 22 Indian languages. All right, so while we just give a quick glimpse of what Bhajani translation plugin is, it is basically a very lightweight utility, though. you find it very simple but the content that we have on this website primarily is in English and there are other challenges that you that we would like to discuss later on as how this translation plugin brings in though it looks very easy just you clicked on a button and then you do a translation all together but then we’ll discuss more about what are the different challenges we come across not from the fact the engineering side of it but on the language side of it how we cater and have this challenge taken care so this is just a plugin we just wanted to tell you this is how it works but you know if you go back to English then and then you know we will just talk about what you wanted to we’ll continue with that so I just wanted to have a quick demo of how you can integrate this plug -in onto the website so I think some if yes you can come we’ll just see how with the help of just the knowledge of copy and paste we can have the entire code implemented and you’ll have the entire website translated into multiple different languages.
For the purpose of this demo we had created this dummy website and the code for this website is here. So this is the code that none of us would most of us would not understand. And I would like to request sir to just copy and paste the plug -in code that we have. So we want to tell that this website content is only in English and you want to add multi -lingual flavor to it using Bhashini. You can integrate the solution that we have on the top of
That’s what you’ve meant.
Yes. So if you can sir just copy and paste this code. The code which is written here. Yes.
Anywhere here.
If you can just add a hyphen between translation and plug -in.
Yes.
Can you go back to the website? Refresh it. So you can see that the plug -in is added. And we can now have this website available in all 22 Indian Schedule languages. So that’s the power of this code. We’ve taken care of everything that is happening at the back -end. and you just have to copy and paste the code that we’ve created for you. It’s as simple as that.
So Swati, so let’s say I’ve embedded this particular thing on this particular website. Now it is available. There is the icon. What about if I go to next pages, right? Will the system understand that there’s a link in the I chose and I go to any page? It will reflect Hindi or I have to select every time I go to any page as my language, which I chosen as Hindi.
So you don’t have to apply this code on every page. The pages of the website will automatically understand that the multilingual feature has to be embedded on all the pages. So if you move on to any other page of the website. So this was just a dummy website that we had created. Let me. Go to Bajni translation plugin. in Bhajani’s website. So if you go to any of the pages, the plugin will remain there. And you will have the multilingual feature added on all the pages of the website, not just the home page. So let’s go back to the slides now. So like we just demonstrated, the code that we have for the plugin that we are talking about is a one -liner, very lightweight, simple integrated, simply integrated code, which you can use to have your website available in all 22 Indian Schedule languages.
It is DBM compliant and framework agnostic. So if you have your website, in different, made in different languages, it’s irrespective of that, the code will be applied to your website and you can use the same code.
So Swati, can you just give some light on what is DBM compliant as how the website is DBM compliant? If I, let’s say I have a government website and I want to include the Bhasini translation plugin onto it, what is this DBM compliant that you talked about?
So these are the compliances mentioned in the digital brand identity management compliance book that is available. So for everybody to have an accessible website, the DBM compliance have to be followed. And we have the DBM compliant code with us wherein all the accessibility features like, you know, that happens in the backend, you know, for, any person who is a visually special person who wants to access the website. is able to do that with the help of the technical integrities that we’ve incorporated into the plug -in code that we have. So this is a glimpse of the impact that we’ve already created. We have approximately more than 400 plus websites that are already integrated with Pashni translation plug -in.
From those websites, we get approximately 24 million plus inferences. And we’ve created 1 .5 million plus glossaries. So glossary is something that I will take at a later section during the session only. But just for a short description, glossary enhances the translation in such a manner that the end citizen who is actually consuming the content from the website is able to understand the content. And also, these are the 22 Indian scheduled languages in which the plug -in is available. available. Next slide, please. So while we were creating the plugin, we had to create something that, you know, one size fits all product and which is something very difficult to create because everybody has different requirements and to cater to all those requirements, we had to make one product that can simply be accessed by everybody.
So these are some of the use cases that I will be discussing that our plugin has the capability to resolve to. The first one is that generally what happens in, you know, a product like this, you translate, you know, from English to the target language. But here in our plugin, what we’ve done is that even if your website is, let’s say, created in a language other than English, let’s say Marathi. That can also be translated to the targeted language directly. So you don’t have to first translate the website to English and then move on to the targeted language. You can have the source and target language as per your requirement. So that’s how we’ve not, you know, you don’t have to get into the bridge of creating English as an intermediary to move from one language to another.
Next slide, please. Okay, so when I talk about a website, there are different sections of the website. And not all these sections would you want to translate. For example, the calendar, if there’s a calendar, you would not want it to be translated into, you know, the target language. Including email IDs and, you know, there are certain sections that a lot of people didn’t want to be translated. So there is one class that you can embed that is the skip translation class. Embedding that will help you. Navigate to the, navigate the sections that you don’t want to be translated. So, that’s also one feature that we have with our plugin. Next slide, please. Okay, so, you know, you saw the plugin, right?
There were languages listed in a certain manner in the plugin. So, what happens is at, you know, many regional places, we want the plugin to have the regional languages on top. So, for example, after English, people don’t want to go alphabetically like Assamese, Bengali. They would want their regional language. In this case, they wanted Hindi to come in the, you know, to change the order of the languages that are appearing. And that is also possible. So, if you want your regional language to come on top, you can have that with our plugin. So, you know, majorly what we say is that we would want to… We want to display our website in a certain language.
So for example, if you created the website in let’s say English, but you would want all the users to have the language to be displayed as Hindi first. And probably then they can navigate to their own targeted language. So even if your website, the source language of your website is English, you can, there is a possibility of adding the parameter which can have the source language as Hindi or Marathi or Punjabi as the user requires for all your websites. Next slide please. Okay, so what if your… Your website has the, you know, has been created in two languages. So for example, you’ve created your website in English and Marathi also. So that was the use case that we had with finance department Maharashtra.
So they did not want translation to happen in the Marathi language and the English language, though their source language of, you know, so basically the source language of the website was English and Marathi. So if you want to skip translation for different languages also, you can do that. So in this case, what happens is that the user selects a language. If the language of the source is selected, let’s say, you know, English or Marathi, it will go redirected to the English or the Marathi page of the website. And if the user has selected any other language, it will move on to the normal process of translating it into the target language. Next slide, please. So, you know, sometimes we have portals also.
Yes. So, you know, because we would want to have websites available in all 22 Indian scheduled languages so that we try and reach out to the maximum people. But if that is your use case wherein you would want just three or four languages to be displayed for every user to be seen, you can have that also. So the drop -down will only display four languages in that case? Yes. But it’s always encouraged to have all the languages so that everybody, you know, who’s accessing your website can have the website available. Thank you, then. So, talking about this use case, what happens is that in most of the cases, we also have portals. And in portals, we have forms or, you know, we basically ask input from the user who is using the portal.
So if they apply Bhajani translation plugin and they, you know, move on from one language to another, it will reload the entire page. If it reloads the entire page, whatever the user has filled in, like their details, their name, their email IDs, all that information was lost. So what we did to capture this was that now plugin can also have the portals without the reload picture. So if you don’t want the plugin to reload every time a user selects a language from the drop -down, you can have that. Next slide, please. So this was a very interesting use case. You know, you can see this is how the website was displayed. So the source language of the website is English.
But like we can see, after every English, below every English word. there is a different language. So Haryana written in English, then Haryana written in Hindi. Puducherry written in English and some other language. So here this was use case of handling mixed languages. So what we did here was that whenever the plugin sees that the source language of the plugin is different from what characters it is getting, like here in Haryana, it is getting Hindi characters also, it will skip this translation automatically. So you would not have to skip it at your end. We’ve done it and we’ve created it, we’ve designed the plugin in such a manner that if the source language of the website is, you know, if the contents going for the translation are different from the source language of the website, it will automatically skip the translation.
Next slide, please. So… With certain use cases, what happened was… that there was a lot of dynamic content on the website. So, static content can easily be translated. Like, it is also difficult, but it’s not as difficult as handling the dynamic content. But for certain, like for State Bank of India and for MyBharat Hotel, the dynamic content was changing so rapidly that it was making too many API calls and the response time was getting delayed. So, what we did there was that we intelligently had the code running in such a manner that the dynamic content was, the translation of dynamic content was handled in batches. And that’s how the, you know, API calls, the increased API calls reduced and the response time was stabilized.
Next slide, please. Okay, so now… We all can, you know, navigate to the website. select the target language on the website and have the website available in the target language. But what if somebody cannot navigate, cannot select a language from the drop -down? We also, with Rail Madad, you know, if you go to the Rail Madad’s website, there is a mic button. So you just say out your language. So for example, if you say out Gujarati, the entire website will turn into Gujarati. So that’s the capability of it. Next slide, please. Okay, so this is a very recent use case that we’ve handled. So like you can see here, there is the MSD website. And there is also another domain name, which is Hindi, which is in Hindi.
So what the client wanted was that, you know, once the user selects Hindi as the drop -down, the translation happens, but it also redirects to the… Hindi domain of the website. So that mapping of which language to which domain, that is also something that we have done at our end and you can have URL redirection also. Next slide please. Okay, so what happens, so let me just ask, I hope everybody here understands Hindi, right? What is the translation of home in Hindi? Ghar, Ghray, that’s right, right? But the home tab on the website, if it is getting translated to Ghar, it’s not the correct translation. It should be translated to Mukhya Prash. So these kind of use cases wherein the translation which is being given by the model is correct but you would want a specific different translation for a specific word or phrases that can also be handled through glossary.
So, the way that we have done this is that we have website. So, we have a lot of information in the information in the Just now, after we complete this, next slide please. So, this is the future roadmap for plugin that we have. We have expanded it to 36 languages, 36 more Indian languages. So, you can go to Vashni Pavilion which is right here in this hall only. We have a demo of the plugin which is available in 36 languages. We are also incorporating the 35 international languages. We have done that for certain use cases which are displayed here today at Bhaat Mandapam. Secondly, we will be talking about glossary but the glossary in, you know, traditionally the glossaries were sent to us through emails and there was a process to, you know, process the glossaries and then ingest it.
But now, we are also planning to get it automated wherein, you can just simply upload the glossary from your onboarding portal. and third, we are also adding the accessibility bar to the plugin. So if you want to have text -to -speech also integrated or screen reader also integrated with the plugin that we just showed, that is also something that we are going to do in some time. So technology for dignity, Bajni Translation plugin would help. It is a powerful tool that will empower you to actually disseminate whatever information you want to, to actually reach the last mile. Moving on to the next segment, which is the glossary. So, you know, we all of us here, we would have some application, some website developed for…
the ease of the user. We would want a person, a student who is registering for a form who can actually do we would want the person to do it in their own preferred language. We would want a farmer to listen to the schemes that are available for him in his preferred language. We would want an Angadwari worker to have the schemes that are available for her told to her in her own language. So that is all what we are working for. We are working for inclusivity and we are working for accessibility. Next slide please. So while we do that we also add Bhashni’s layer to all our solutions or websites to have the actual information reach the last mile.
But generally what happens is that you know we get a remark that the translation is not correct. It is wrong. And after doing analysis with most of our customers we realize that the audience, that the users who are trying to actually use our product, they are not looking for accurate translations. They are looking for understanding the content, the intent of the content which is there on the solution on the website that they have created. And this is not the result, you know, for this we don’t have to focus on getting the accuracy of the translation. We actually have to focus on the context of the translation, use case of the translation, domain of the translation.
So when we realize that, we understood the concept of glossary and that’s how glossary was formed. Next slide please. Now you all would be, you know, waiting for, to understand what glossary is all about. So glossary saw… It involves two kinds of use cases. One is the post -translation that I just told you before. That, you know, home being translated to ghar in Hindi is absolutely right. But home being translated to home tab being translated to ghar is probably not correct. So post translation wherein you would want home to appear as Mukhya Prasht on the tab, home tab, that is something that we cater through with glossary. The second use case is like in the example, there is a bridge called Vakil Saab Bridge in Gujarat.
So Vakil Saab Bridge if translated to English would become something like Lawyer Bridge or something. We wouldn’t want that. Vakil Saab Bridge is our coined terminology and we would want it to retain its identity. We would want Vakil Saab Bridge to be written as Vakil Saab Bridge only in English. And this is the use case of transliteration. So these two kind of use cases are solved through glossary. What we do is we create. We create these glossaries with our customers and we ingest it to the customer’s specific API. Next slide, please. So, you know, like I told you the meaning of glossary, all of us here have different glossaries. Like, you know, the science domain glossaries are different.
Gen Z has a different glossary altogether. You know, any region would have a specific kind of a glossary. So all of these glossaries have to be created with us. And, you know, the customers who created those glossaries have got the translation, which are accepted by the end user, which are understood by the end user. Like you can see, Ministry of Panchayati Raj gave us 15 lakh words of the Panchayat. Survey of India has given us 16 lakh words. So if we create glossaries together, we can have the translation barrier completely eliminated. So I will now walk you through certain. Use cases wherein we faced problems with our customers, but they were not. translation issues, they were actually issues that could have been easily resolved through glossary.
So if you can read this sentence here. So this use case, you know, this problem was reported to us by Ministry of Home Affairs where Honourable Home Minister Sir’s profile was not reflecting correctly. So this was the English sentence. Okay. And this was the translation that we were getting. So if anybody can tell me what is the problem here? So because of this full stop, Srimati full stop, SMT full stop, what the model thought was that the sentence has ended here. And that is why the formation of the sentence is entirely incorrect. But the solution was very simple. What we had to do was just add SMT dot to the glossary or just remove the dot from the SMT.
And we could have the correct output. So it’s as simple as that. It’s not the translations problem. It’s the understanding of glossary problem. Next slide, please. So, okay. Can anybody tell me what’s the difference between this and this puzzle? To these two puzzle pieces, what is the difference? Yes. So one of them has a hyphen and the other one does not have a hyphen. So when we received the glossary from MSME, there was a hyphen in between PMS and dashboard. but actually on the website it was displayed without the hyphen. So glossary is that sensitive. If you give me PMS hyphen dashboard, it will only recognize that and translate that. But if there is no hyphen, it will not recognize that and it will not give you the translated output which you have given us in glossary.
So that’s the, and again, here there was a singular and plural problem. So street vendors was mentioned in the glossary sheet that we received. But actually street vendor was mentioned on the website. So if there is a singular and plural difference in the glossary sheet that you are giving to us and what is actually reflected on the website or your solution, it will create a difference. So this is one thing that you can also do through glossary. So, you know, we received. A requirement wherein they wanted, you know, the animal husbandry department wanted that the entire sentence should not be translated. the, you know, abbreviations should be skipped from translation. So if you just give me this sentence and this sentence as glossary pairs in English and Hindi, this can easily be achieved.
Next slide, please. Okay, so in one of the glossaries that we received was authorized officer, so they wanted us to write authorized officer as newt adhikari. But actually, newt adhikari means appointed officer. So this is also something that we have to be careful of. Because, you know, for the end user, so there are two kind of users that we have in this case, the English users and the Hindi users. So the English user would read it as authorized officer, but since we have added glossary and changed it to, newt adhikari in Hindi, the Hindi user would read it, would understand it as appointed officer. So we have to be very careful while drafting the glossaries.
Next slide please. Okay. So if I can just ask what is the full form of BN? Normally what do we consider as the full form of BN? Billion, right? We would not consider BN as battalion. But in BSF’s case, this was a huge problem. So BN for BSF means battalion and not billion. The entire context changes. So for BSF, we have created glossaries for all the abbreviations. So it is always suggested that you know whatever abbreviations that you are displaying on your website or your solution, just give it to us as glossary so that the correct one can be displayed. So okay. So, So can you tell me if PS to Minister being translated as Maanenya Vastra Mantri Ji ki Iji Sacheev, is there a difference or like what would be the problem here?
Fine, let me tell you. So this is also correct, this is also correct. But this is not the actual translation of PS to Minister. If we want to have Maanenya written in the Hindi translation of it, we should always have it in the English version of it also. Glossaries are supposed to be equal. They are supposed to be equally weighted. You cannot expect the model to add or delete words as their own. So basically what we did here was we went back to the customer and said, that if you want to add Maanenya, you can add Maanenya to the text. at the output, please add respected or honourable in the input. Only then it will be balanced out.
Next slide, please. So this is one request from our end only. We receive a lot of glossaries that are redundant in nature for us. By that I mean that, you know, for example, we received employment and skill development as the glossary terminology and the translation that we are getting in Hindi. That was the actual output of the model also. So in this case, if you are giving us the glossary, which is actually the output of the model, you are only creating redundancy. So if you can just avoid that and give us translations, give us the post -translations or transliterations that are not recognized by the model, that would be handy. Next slide please. Next slide.
So in the end I would just like to say that language is not just words, it is identity. Let us prepare India’s languages for the future of AI and let us create glossaries, let us have multilingual AI in, multilingual layer in all your solutions so that actually the end user is benefited, actually there is digital inclusion, accessibility, inclusivity. Thank you. Any question?
Can you hear me? So I was saying like the translation thing which you were showing, is it, I know like this has been sponsored by the government and stuff, so can it be also used for commercial purpose like for private or private? Public entity? Can they also use that in their websites also?
So, you know, we have different kind of collaborations with us. So for that collaboration, there is a different agreement altogether that is being created. If you want to know more about it, just go to the Bhajani Pavilion. We have stakeholders who are handling the startups, the private organizations also, and they can help you there.
And one more thing which I wanted to know. So like you were showing for the websites, it was by default we can choose the default language, right? So can it be also extended? Like let’s say in some use cases, we could have someone who is logging in from Delhi. They would want to see it in Hindi and someone who is coming from Maharashtra. So can it change the default languages? Can it change from the region perspective?
So that’s an interesting use case. From what I’ve understood, you want different regions to have websites opened in different default languages. As per my knowledge, I don’t see a technical challenge to it. But again, we will have to look at the use case at our end and see if this can be deferred. It’s a very use case. This is a very interesting use case. We’ll look at it. Thank you.
Hi. Hi. So we all are aware that we have multilingual languages. And apparently, they have been trained on a lot of words also according to their domain knowledge. So if we have glossaries, how do we ensure that each and every glossary according to the domain is maintained and then trained or fine -tuned?
So glossaries are customized. For example, somebody from Ministry of Home Affairs would not want the glossary of, let’s say, CSI. Right. Right. you know the domains are different the contexts are different so glossaries are ingested only are customized and are ingested to the client itself they are not we have general glossaries also that can be applied to all but since glossary does not have the you know one glossary fits all type of a solution so we customize it for a client and then ingest it on to that client solution itself not to other clients or other environments.
Thanks So the glossaries you have do you have them do you use them to fine tune your models or is it just available as documents to infer while using?
So we do that we try to fine tune the models as well but we there are lot of things that we have to look at it look around while doing that because you know we have to classify them into different domains and then apply fine tuning models for the domain space. So it’s a long process, but we do that. Okay. Thank you. If there are any other questions, I will be available at the Bhashni Pavilion here also. And I would request everybody to please come visit us, explore our solutions, explore our services. And thank you so much for being a lovely audience. Thank you.
“India has 1.4 billion people, but most online content is presented only in English.”
The knowledge base states that India has 1.4 billion people with diverse languages, yet most online content is only available in English, confirming the claim.
“Bhashni (also known as Pashni/Bhajani) is described as a unified multilingual layer for India’s digital ecosystem, not just a feature but core infrastructure.”
Source S8 describes Bhashini as ‘The National Language Translation Mission of India… creating a unified multilingual layer for India’s digital ecosystem… not just providing language as a feature, but as infrastructure,’ matching the report’s description.
“India has 22 scheduled languages that the solution targets.”
Speaker Pradeep Kumar Verma references India’s 22 scheduled languages in the knowledge base, confirming the number targeted by the plugin.
“Roughly 800 million citizens are excluded because they are not fluent in English.”
The knowledge base notes that a large portion of the population is digitally illiterate (S40), providing context about language‑related exclusion but does not specify the 800 million figure.
“The Bhashni plugin is already deployed on more than 500 websites, including state‑level portals.”
S42 mentions case studies such as the Bhasanet portal, indicating that Bhashini‑based solutions are being used on government sites, which adds context to the claim of wide deployment.
The speakers show strong consensus on the need to eliminate linguistic barriers, the wide adoption and technical simplicity of the Bhashni translation plugin, and its ability to function site‑wide without extensive development effort. Additional agreement emerges around advanced customization such as glossaries and potential regional default language settings.
High consensus on core objectives (language inclusion, scalability, ease of integration) which reinforces the viability of multilingual digital infrastructure as a public good. The limited but notable unexpected consensus on regional default language indicates emerging interest in further personalization.
The discussion shows strong consensus on the need to eliminate language barriers and on the technical promise of the Bhashni/Pashni translation plugin. Disagreements are limited to quantitative reporting of deployment scale and the readiness of advanced features such as region‑based default language selection, which require further feasibility work.
Low – most participants align on goals and core solution; the few disagreements are technical or factual rather than ideological, implying smooth collaborative progress toward multilingual digital inclusion.
The discussion was anchored by Swati’s framing of India’s massive linguistic divide, which established a compelling problem statement. Shailendra’s vivid scenario and the farmer anecdote turned abstract statistics into human stories, driving urgency. By positioning language as infrastructure and emphasizing context‑driven translation through glossaries, Swati shifted the conversation from a simple plugin demo to a strategic, ecosystem‑level solution. Audience questions about commercial applicability and region‑based defaults introduced new dimensions—market viability and personalization—prompting the speakers to acknowledge future work and broader adoption pathways. Collectively, these pivotal comments steered the dialogue from problem identification to technical depth, strategic vision, and practical expansion, shaping a nuanced, forward‑looking conversation.
Disclaimer: This is not an official session record. DiploAI generates these resources from audiovisual recordings, and they are presented as-is, including potential errors. Due to logistical challenges, such as discrepancies in audio/video or transcripts, names may be misspelled. We strive for accuracy to the best of our ability.
