Abhishek Upperwal [ Founder,
Soket AI – a.upperwal@gmail.com / a.upperwal@yahoo.co.in ]
Suvrat Bhooshan [ CEO
, Gan.ai – suvratbhooshan@gan.ai / suvrat@gan.ai / suvrat96@gmail.com ]
Ganesh Gopalan [ CEO, Gnani.ai - ganeshgo@gmail.com/ ganeshgo@yahoo.com /
===================================================
Context :
India
AI: 3 More Startups to Build Indigenous Foundation Model .. Outlook Business .. 31 May 2025
Extract :
IT Minister Ashwini
Vaishnaw said significant progress has been made on India AI Mission, with
focus on democratisation of technology.
At the same time, three
more teams -- Soket AI, Gan AI, Gnani AI -- have been
selected for building indigenous artificial intelligence foundation models.
"Like Sarvam, these three teams
also have a very big target ahead of them. Whichever sector they focus on, they
must be among the top five in the world," Vaishnaw said.
Put simply, foundation
models in generative AI are large, pre-trained models that form the base for
a variety of AI applications.
The Minister further said
that 367 datasets have already been uploaded to AI Kosh.
Vaishnaw emphasised that
these efforts are aimed at building a complete and inclusive AI ecosystem in
India.
In April this year, Sarvam AI was selected to
build India's first indigenous AI foundational model, marking a key
milestone in the country's AI innovation ecosystem
Soket AI will develop open source
120 billion parametres foundation model optimised for the country's
linguistic diversity targeting sectors such as defence, healthcare, and
education.
Gan AI will create 70 billion parameters of multilingual
foundation model targeting capabilities to surpass the current global
leader.
Gnani AI will build a 14 billion
parameter Voice AI foundation model delivering multilingal real-time speech
processing with advances reasoning capabilities.
Ganesh Gopalan, Co-Founder
and CEO of Gnani.ai, said in a statement, "We are honoured to be selected
under the IndiaAI Mission to develop large language models that truly represent
India's linguistic diversity. At Gnani.ai, our mission has always
been to make technology more inclusive and accessible".
Gopalan further said
Gnani.ai is keen to "lead the way in developing voice-to-voice large
language models for India and the world, because we believe transformative AI
must speak the language of the people it serves".
Comparative
Table : IndiaAGI vs. Socket,
Gan.AI
and Gnani.ai
--------------------------------------------------------------------------------------------
Attribute / Feature |
IndiaAGI.ai |
Socket |
Gan |
Gnani.ai |
Core Functionality |
Aggregates
& synthesizes responses from multiple LLMs (ChatGPT, Gemini, Claude,
Grok) for consensus answers. |
Real-time,
bidirectional, event-based communication library. |
AI-driven personalized
video generation using AI avatars. |
Conversational AI, Voice
AI, Speech Analytics, Automation for enterprises. |
Primary Output |
Consensus
answers (text, speech) |
Real-time data flow
(e.g., chat messages, notifications) |
Personalized videos with
AI avatars |
Voice assistants, call
automation, speech-to-text, text-to-speech, voice biometrics |
Technology Type |
AI
Aggregator / Orchestrator (leveraging external LLMs) |
Real-time Communication
Library |
Specialized Generative
AI (for video) |
Conversational AI
Platform, SLMs, Voice AI, Agentic AI |
Open Source |
Yes |
Yes (Open-source
JavaScript library) |
No (Proprietary platform) |
No (Proprietary company
with own SLMs and platform) |
Cost |
Totally
FREE |
Free (library use), but
requires infrastructure to host |
Commercial (Paid service, often
B2B) |
Commercial (B2B
solutions, services) |
Login Required |
No |
N/A (developer tool) |
Yes (for platform access
and account management) |
Yes (for enterprise
clients) |
Indian Language Support |
9 Indian Languages ( text & speech ) |
N/A (communication
layer) |
Supports multiple
languages for lip-sync and translation (e.g., 30+ languages mentioned) |
Strong focus on Indic
languages (e.g., 10+ for STT/TTS) and regional accents |
Input Modalities |
Text &
Speech |
N/A (developer
tool) |
Text (script for video) |
Voice (for assistants,
biometrics), Text
(for automation) |
"FRUGAL" Development |
Highly
Frugal focuses on
orchestration & accessibility |
N/A (widely
adopted library, community-driven) |
Requires significant
AI/video engineering resources |
Requires substantial R&D in NLP, speech tech, ML engineering |
Target Audience |
General
users, content creators, researchers |
Developers, web application
builders |
Businesses for marketing, sales,
internal comms, training |
Enterprises (BFSI, telecom,
customer service) |
With regards,
Hemen Parekh
www.IndiaAGI.ai / www.HemenParekh.ai / www.My-Teacher.in / 04
June 2025
.
No comments:
Post a Comment