On-device LLMs in 2026: Gemini Nano vs Apple Intelligence

On-device LLMs are finally usable for production features. Where Gemini Nano wins, where Apple Intelligence wins, and the Hungarian-language gap.

Last verified23 April 2026

Listen

By Dezso MezoFounder, DField Solutions

ShareX LinkedIn#

On-device LLMs in 2026: Gemini Nano vs Apple Intelligence

Two years ago, running an LLM on-device was a science project. In 2026 it is a deploy target. Both Google and Apple ship first-party on-device models with public APIs, the RAM ceiling finally fits 2-4B parameter models, and power draw is defensible for features you run a few times a minute.

What we ship on-device today

Smart reply drafting in chat apps · 80-150ms, no spinner.
Receipt / invoice field extraction · fully offline, GDPR-trivial.
Photo caption + search index on-device.
Meeting transcription + bullet summary (with Whisper.cpp + a local summariser).

Gemini Nano · where it wins

Available on a much wider device matrix · Android 15+ with 8GB+ RAM.
AICore handles model updates without a playstore ship.
Summarisation + rewriting APIs are stable and predictable.
Works better with languages beyond English than Apple Intelligence on mid-range hardware.

Apple Intelligence · where it wins

A17 Pro / M-series only · narrower matrix, but the models are meaningfully better.
The Writing Tools API is a drop-in replacement for a cloud call · zero glue code.
Private Cloud Compute fallback is automatic and audit-friendly.
Foundation-models API surface is more coherent · one SDK, not three.

Hungarian-language reality check

Both models underperform on Hungarian vs English. In our evals Gemini Nano gives ~85% acceptable-output rate on Hungarian summarisation, Apple Intelligence ~80%. For comparison, Claude 3.7 Haiku is ~97%. For Hungarian-heavy features we keep a cloud fallback for now.

When we still call the cloud

Any agentic flow with tool calls · on-device tool-use is fragile.
Long-context tasks (> 8k tokens effective) · on-device context windows are still small.
Safety-critical outputs · medical, legal, financial advice · we route to a policy-gated cloud call with audit logs.
Multilingual features where non-English quality matters for conversion.

Always design the UI for a cloud fallback from day one. The right on-device feature feels instant when the model is present and works anyway when it is not.

ShareX LinkedIn#

Dezso Mezo

Founder, DField Solutions

I'm a full-stack engineer and I build across the whole stack myself · AI agents, web and mobile apps, blockchain, backends, security, right down to the OS layer. If it's software, I've probably built it and broken it.

ABOUT Let's talk

Keep reading

22 Apr 2026·8 min read

LLM prompt caching in production · a 60-80% cost cut

Prompt caching is the single biggest LLM cost lever in 2026. 4 patterns, real savings numbers, 2 gotchas worth knowing.

Read

22 Apr 2026·8 min read

Mobile analytics without cross-app tracking · 2026 reality

~65% of iOS users decline tracking. Here's how we run product analytics for mobile apps anyway.

Read

22 Apr 2026·10 min read

Agentic AI · the safe tool-use pattern we ship by default

Agentic AI that can send email and move money is not just a chatbot. Here's the safe tool-use pattern we ship.

Read

RELATED PROJECTS

Websites, web apps & online shops · Custom software · everything else · AI solutions · 2026Vilya ProtectionVilya Protection · assassination-prevention software platform for public figures and large events. The demo shows the full operational dashboard.

Custom software · everything else · Websites, web apps & online shops · AI solutions · 2026AutoImportEU→HU car-import arbitrage platform - turns 'you can buy this car abroad and resell it at home' into a live, scored feed.

AI solutions · Websites, web apps & online shops · Custom software · everything else · 2026ClarixAIA misconception-pattern radar for teachers · open-ended student answers in, the reasoning errors dominating a cohort out.

Let's talk

Would rather build together?

Let's talk about your project. 30 minutes, no strings.

Let's talk