
AI That Runs on Your Phone, Not in Someone's Cloud
by Deep Parmar
CTO, Sunbots & Xwits

Part 6 of the series "AI, Without the Hype". Start at Part 1.
On-device AI — also called edge AI — means the AI model runs directly on your own phone, laptop, or web browser instead of on a company's servers far away. Because the work happens locally, three things follow: it can work without an internet connection, your data never leaves your device, and there is no per-question bill, because no remote server is doing the work. The model lives with you, not in someone else's cloud.
That is the short version. Here is why it suddenly matters, and where it actually helps.
I build both kinds of AI — cloud and on-device — and for a lot of real problems, especially in India, keeping it local is the better answer. Let me show you why.
How it recently became possible
For years, useful AI was simply too big to run anywhere but a data centre. The good models needed racks of expensive hardware. That has changed, and three quiet shifts did it.
- Models got smaller and smarter. Engineers learned to shrink models — through techniques like distillation and quantisation — so that a model small enough to fit on a phone can now do work that used to need a server. There is a whole class of capable "small language models" now.
- Browsers learned to use your graphics chip. A technology called WebGPU lets an ordinary web page tap into your device's GPU — the same chip that renders games. That means a website can now run an AI model on your hardware, with no installation.
- Phones and laptops got dedicated AI chips. Modern devices ship with hardware built specifically to run AI efficiently, so the model runs fast without draining the battery.
Put together, the model came down to meet the device, and the device came up to meet the model. The gap closed.
What on-device AI is good at, and what still needs the cloud
On-device AI is not magic, and it is not a full replacement for the big cloud models. It is the right tool for a specific shape of problem.
It is genuinely good at:
- Private tasks — anything involving personal or confidential data you would rather not send anywhere.
- Offline and unreliable-network situations — a moving train, a basement, a village with patchy signal.
- Fast, repeated, simple tasks — transcription, translation, search over your own files, smart suggestions.
- Anything you want to be free to run — no API bill, no usage meter ticking.
The cloud still wins when you need:
- The absolute frontier of capability — the most powerful reasoning still lives on big servers. As of mid-2026, there is no single "best" model; the strongest systems trade the top spot, and they run in data centres.
- Very large context — feeding in an entire book or huge codebase.
- Heavy, occasional jobs — work too big for a phone but not worth special hardware.
The honest framing is a partnership, not a winner. I go deeper on the trade-offs in edge AI versus cloud AI if you want the full comparison.
Real examples
This is not theory. I ship it.
Dhiya runs in the browser. Dhiya is a tool I built that lets a website answer questions from a set of documents — entirely inside your browser. It uses WebGPU to run the model on your machine and stores everything locally, so there is no API key, no server, and no data leaving your device. You open a web page and the AI is just there, working on your hardware. If you want the no-cost, no-server idea in full, I wrote it up in how Dhiya does no-cost AI on the web.
SmartON runs on edge devices. SmartON is assistive AI used by more than 17,000 blind and low-vision users. For a tool someone relies on to understand the world around them, "please wait, connecting to the server" is not acceptable. Running on the device means it responds quickly and keeps working when the network does not — which, for the person depending on it, is the entire point.
The pattern in both: put the intelligence where the person is, not in a distant building they have to reach over a flaky connection.
Why this matters especially for India
For India, on-device AI is not a niche preference. It lines up almost perfectly with how the country actually uses technology.
- Connectivity is uneven. Mobile data is widespread, but it is not uniform or always reliable — on trains, in rural areas, inside buildings. An AI that needs a perfect connection fails exactly where many people live and work. One that runs locally does not.
- Cost matters at scale. A free, on-device tool can serve millions without a per-query bill stacking up. That changes what is affordable to build for a price-sensitive market. None of this depends on the country's growing AI infrastructure — though that is expanding fast, with the IndiaAI Mission bringing tens of thousands of GPUs online for the heavier work that does need the cloud.
- Privacy is real here too. With India's data protection law now in force, keeping personal data on the user's own device is the cleanest way to avoid a whole category of compliance risk.
Edge AI quietly fits the Indian reality: variable networks, tight budgets, and a billion people who deserve tools that work where they are.
That last point — tools that work for everyone — leads to the question people ask me most. If AI is this capable and this available, what does it mean for my work?
Next up — Part 7: "Will AI Take My Job?" — An Honest Answer for India. Read the whole series at deepap.dev.
Frequently Asked Questions
Quick answers about this topic — also indexed by AI search engines via FAQPage schema.
Share this article: