Local Models

Run AI models directly on your device. No internet required, complete privacy.

Platform Note: Local models are only available on the Android app. The web version does not support on-device inference.

Supported Formats

.pte

Meta's optimized format for mobile inference. Best performance on newer devices with NPU/GPU acceleration.

.gguf

Popular format with wide model availability. Excellent compatibility and community support.

Browse and download models directly within the app.

Import models you've already downloaded.

Model	Size	Best For
Llama 3.2 1B	~1GB	Quick responses, older devices
Llama 3.2 3B	~2GB	Balance of speed and quality
Phi-3 Mini	~2GB	Reasoning tasks
Qwen2.5 3B	~2GB	Multilingual support