India is developing an AI-enabled public data ecosystem to improve governance and data usability. Central to this transformation is the integration of the Model Context Protocol (MCP) and semantic search into the e-Sankhyiki portal, allowing users to query 136 million statistical records using natural language.
This evolution extends to sectoral applications such as BharatGen, India's first sovereign multilingual LLM, and the Bharat-VISTAAR system for agricultural advisory. To ensure ethical deployment, the government launched BODH in February 2026 to benchmark health AI models, alongside Aadhaar Vision 2032, which integrates blockchain and quantum computing to secure the identities of over a billion citizens.
Key AI Initiatives in the Statistical Ecosystem
e-Sankhyiki & MCP: Introduced a beta MCP server in February 2026, enabling AI agents to interact directly with 21 statistical products without downloading large files.
StatsDoc AI Assistant: An intelligent search tool that allows field investigators to query survey manuals and reports (PDFs/images) using natural language.
Legacy Data Extraction: An AI-based tool specifically designed to extract and process NSO India’s historical data from non-machine-readable formats like merged-cell Excels and images.
NIC Code Finder: A NLP-based tool that suggests the three most relevant National Industrial Classification codes from text queries, reducing manual coding effort.
NDAP 2.0: A next-generation analytics platform providing cross-sectoral visualization and AI-based responses to complex user queries across 52 ministries.
Data Innovation Lab (DIL): Functions as a "statistical sandbox" to test emerging technologies; has already developed 12 new AI use cases for official statistics.
What is the "Model Context Protocol (MCP)"? The Model Context Protocol (MCP) is an open standard that allows AI-based tools and applications to interact directly with structured datasets. In the e-Sankhyiki portal, it plays a role in enabling researchers and businesses to connect their own AI agents to official Indian statistics for automated reporting and real-time analysis. This is supported by the goal of reducing the time spent on manual data retrieval, as users no longer need to download and process massive CSV files. By providing a unified interface for multiple datasets, MCP reflects growth in India’s "Agentic AI" readiness, where automated systems can autonomously fetch and interpret context-rich data for governance.
Sectoral Use Cases: AI for Public Welfare
Healthcare (BODH & SAHI): The BODH platform assessments evaluate AI models for bias and robustness using anonymized health data, while the SAHI framework sets safety and ethical standards for AI in clinical settings.
Agriculture (Bharat-VISTAAR & Kisan e-Mitra): Bharat-VISTAAR integrates AgriStack and ICAR resources for multilingual farm support, while Kisan e-Mitra has already responded to 93 lakh queries in 11 regional languages.
Digital Identity (UIDAI): The "Invisible Shield" initiative uses AI-based biometric deduplication across face, iris, and fingerprint modalities to secure Aadhaar enrollments.
Meteorology: The Ministry of Earth Sciences uses deep learning and the Advanced Dvorak Technique for cyclone intensity estimation and lightning alerts.
National Pest Surveillance (NPSS): Uses image-based AI analysis to detect pest attacks across 66 crops, supporting 10,000 extension workers nationwide.
Policy Relevance: Lessons for Data-Centric Governance
Scaling Multilingual Accessibility: The launch of BharatGen (supporting 22 languages) reflects growth in the government's ability to provide digital services that are inclusive of India's diverse linguistic landscape.
Internalising Data Quality Testing: The BODH platform plays a role in resolving the "AI Quality Testing Trilemma" by balancing reliability, openness, and data coverage for healthcare developers.
Bypassing Technical Barriers for MSMEs: The e-Sankhyiki AI Chatbot is supported by the need to allow small businesses to navigate complex reports through simple conversational queries.
Supporting Future-Ready Identity: Aadhaar Vision 2032 contributes to national security by exploring quantum-safe encryption and blockchain to protect the world's largest biometric database.
Relevant Question for Policy Stakeholders: How will MoSPI mechanically ensure that the MCP server maintains data integrity when external AI agents perform autonomous cross-sectoral analysis?
Follow the Full Coverage Here: AI-Driven Transformation of India’s Statistical and Data Ecosystem


