On-device AI Android Open source

Your assistant, on your phone

OfflineMate runs language models and memory locally—so your everyday chat, notes, and tools stay private. Network use is explicit: model downloads and optional web search when you allow it.

Not available on iOS yet—the shipping build targets Android only.

View on GitHub Privacy

Capabilities

What it does

Inference and personal context stay on your device. You choose a model tier at setup (one-time download per tier); if a model fails to load, the app can fall back to a lighter tier.

Tiered on-device chat

Lite, Standard, and Full use progressively larger quantized models so you can balance speed and quality. Replies stream in, and optional “thinking” text helps you follow reasoning before the final answer.

Example “Explain recursion like I’m new to code” (works offline after the model is downloaded).

Semantic memory (RAG)

On Standard and Full, questions can trigger local embedding + vector search over your indexed notes (sqlite-vec). Lite skips retrieval for speed.

Example “What did I save about the dentist?”

Device tools

A validated planner can run allow-listed tools: calendar, reminders, alarms, contacts, notes—only with the permissions you grant.

Examples “Remind me in 20 minutes to stretch”; “What’s on my calendar tomorrow?”

Optional web search

When enabled in settings, prompts that match web-style phrases—for example search for, look up, or what is the current …—can trigger a DuckDuckGo-backed fetch—not on every message.

Example “Search for population of Tokyo”

Voice in & out

Push-to-talk transcription (Whisper-based path when configured), then the same pipelines as text. Replies can use system text-to-speech.

Example “Add a note to pick up bread” after you release the mic button.

Fast intent routing

Keyword routing steers reminders, memory lookup, or general chat without an extra router model—saving latency and battery on small devices.

Example “Remind me…” is steered toward reminders early.

Multi-chat with private retention

Keep separate conversations, reopen any previous thread, and delete chats when you want. Chat retention is optional in settings and defaults to off for privacy-first behavior.

Examples “Open chats”; “Delete this conversation”; “Keep chat history on device.”

Preview

See OfflineMate in action

Real screenshots from the Android app. Multi-chat management, private on-device chat flow, and settings are shown exactly as they appear in the product.

OfflineMate chat screen with on-device conversation. — On-device chat

OfflineMate settings screen with Lite, Standard, and Full model tier selection. — Settings and model tier

OfflineMate standard model options dialog with downloadable variants such as Gemma, Qwen, and SmolLM2. — Standard model options

OfflineMate chats screen showing multiple saved conversations and delete actions. — Multiple chats and delete controls

OfflineMate tools screen listing available local and web tools. — Built-in tool registry

Trust

Built in the open

Documentation, privacy policy, and terms live in the repository so you can verify behavior, permissions, and data flows—not just marketing copy.

Read the full privacy policy →