On-device · WebGPU · agentic kernel optimization

Gemma 4 in your browser.
Kernels written by Fable 5.

Gemma 4 E2B (QAT Mobile) — a powerful open-source model — runs fully on-device with WebGPU. Weights cache locally after the first load, and nothing you type ever leaves your machine.

2.3BEffective params
128KContext window
~250tok/s · M4 Max
100%On-device
Model card
WebGPU kernels 100% written & optimized by Fable 5 Tuned for Apple M4 Max · experimental
Chat below
Gemma 4 · E2B
Not loaded

What's on your mind today?

Model runs entirely on your device.