Run, manage, and orchestrate AI models on your Mac. Multi-backend inference, OpenAI-compatible API, 20+ built-in tools, and seamless cloud integration — all in one beautiful native app.
From model management to production-grade serving, Ai Keeper gives you full control over your AI stack.
Real-time streaming conversations with persistent history, message bookmarking, regeneration, and export. Supports images, audio, file uploads, and multimodal models.
MultimodalAutonomous multi-step task execution with goal decomposition, automatic tool selection, and safety approval gates. Let AI plan and execute complex workflows.
AutonomousRun multiple models simultaneously on different ports. Independent start/stop controls, auto-restart on failure, GPU memory management, and per-instance configuration.
Multi-ModelBuilt-in proxy server that exposes all your local models through a standard OpenAI-compatible API. Dynamic routing, model aliasing, rate limiting, and API key auth.
APIAutomatic fact extraction from conversations organized by category. Your AI remembers your identity, preferences, projects, and instructions across sessions.
Context-AwareIngest PDFs, text, and markdown files. Automatic chunking, vector embedding, and similarity search with a built-in SQLite vector store for grounded responses.
RetrievalLive dashboards with GPU memory, CPU usage, token throughput, request latency, and health metrics. Rolling performance charts with anomaly detection.
LiveBrowse and download models directly from HuggingFace. Filter by format (MLX, GGUF), size, popularity, and recency. One-click download with progress tracking.
HuggingFaceSide-by-side model comparison with identical prompts. Throughput benchmarking, time-to-first-token measurement, and performance metrics with CSV export.
AnalysisBridge your AI to Telegram. Chat with your local models from anywhere, with voice message transcription, file attachments, and full conversation context.
RemoteExtend with Model Context Protocol servers (stdio & SSE) and a native plugin system. Create tool, preprocessor, and postprocessor plugins with Python.
ExtensibleSpeak to your AI with Whisper-powered voice transcription. Hear responses with text-to-speech via system voices, OpenAI API, or custom TTS servers.
AudioChoose the best backend for your hardware and model. Switch between engines without leaving the app.
Give your models superpowers. From web browsing to code execution, every tool runs locally on your machine.
Seamlessly switch between local models and 13+ cloud providers. One unified interface, one API.
Interactive API testing with templates, custom headers, and instant response inspection.
Capture and debug every API request with headers, bodies, timing, and error tracking.
Remote API for headless operation with HMAC auth, rate limiting, and a built-in web dashboard.
Flash attention, KV cache quantization, speculative decoding, LoRA adapters, and tensor parallelism.
Extended thinking support for reasoning models with visible chain-of-thought and configurable budgets.
Export and import your full configuration, conversations, and memories with one-click backup.
Download Ai Keeper and start running models locally on your Mac today. Free and private.
Download for macOSRequires macOS 14+ • Apple Silicon