
Browser AI in 2026: What's Next for Dhiya NPM and Client-Side Intelligence
by Deep Parmar
CTO, Sunbots & Xwits

When I launched Dhiya NPM in October 2025, I knew there was an underserved need for client-side RAG — building AI assistants that run entirely in the browser without servers, API keys, or recurring infrastructure cost. What I did not know was how many developers had been waiting for exactly this. The npm download curve was steeper than I expected, the GitHub issues came faster than I could address them, and the use cases developers brought to Dhiya were more diverse than the ones I built it for. Eight months in, here is the honest picture of where browser AI stands and where Dhiya NPM is going.
What Surprised Us Eight Months After Launch
The primary use case I designed for — privacy-first enterprise knowledge assistants that run client-side to avoid data leaving the browser — is real and has traction. But the use case that produced the most creative usage was offline-capable AI: applications that need AI functionality without guaranteed internet access. Field workers, users in areas with unreliable connectivity, and applications in regulated environments where network calls are restricted all found Dhiya NPM useful in ways I had not specifically designed for.
The second surprise was the geographic distribution of downloads. A significant fraction of Dhiya NPM usage comes from developers in regions with high data costs or limited cloud provider presence — parts of Africa, Southeast Asia, and South Asia where running everything in the browser is not just a privacy preference but an economic necessity. Browser AI turns out to be an infrastructure equaliser in ways that go beyond the privacy argument.
What the WebGPU Ecosystem Looks Like Now
WebGPU browser support has expanded substantially since Dhiya NPM launched. The coverage across Chrome, Edge, Firefox, and Safari has improved, and the number of models available via Transformers.js and ONNX Runtime Web has grown significantly. Models that required 8GB of VRAM to run acceptably in mid-2025 now run on integrated graphics with acceptable latency via improved quantisation and model optimisation.
The remaining friction: Safari's WebGPU support is still inconsistent, and mobile WebGPU performance on mid-range Android devices (the most common device in India) is not yet good enough for real-time inference in the way that desktop Chrome is. The mobile gap is closing but not closed. Dhiya NPM's fallback to CPU inference handles this, at the cost of latency.
The Dhiya NPM 2026 Roadmap
The three capabilities we are building next:
- Streaming inference — Token-by-token output streaming, currently missing from Dhiya NPM. Users who expect ChatGPT-style streaming responses find the "wait for complete response" pattern jarring. Streaming changes the UX quality significantly and is the highest-priority addition.
- Multimodal input — Accepting images alongside text queries, powered by small multimodal models that run in the browser. The primary use case is document understanding: upload a photo of a form, receipt, or chart and ask questions about it, entirely client-side.
- IndexedDB compression — The current vector store implementation stores embeddings as raw Float32 arrays. Compressed representation will reduce IndexedDB storage requirements by 60-70%, making large knowledge bases practical in the browser without hitting storage quotas.
Why Browser AI Matters Beyond Cost
The argument for browser AI that I make most often is the privacy argument: data that never leaves the browser cannot be breached, subpoenaed, or misused. But eight months of real usage has added a second argument I now make equally often: the zero-infrastructure model changes what is possible for solo developers and small teams. A developer in Tier-2 India with a laptop, a GitHub account, and an npm package can now deploy a functional AI assistant to any user in the world, with no server costs, no ongoing infrastructure maintenance, and no API key management. That is a genuine democratisation of AI capability, and it is one of the things I am most proud Dhiya NPM contributes to.
Frequently Asked Questions
Quick answers about this topic — also indexed by AI search engines via FAQPage schema.
Share this article:
