Local LLMs
Self-hosted and on-device model workflows for privacy and predictable usage costs.
Browse Local LLMs tools filtered by practical fit and workflow needs.
54 matching tools.
Tools in this category
AUTOMATIC1111
Feature-rich Stable Diffusion WebUI with extensive model, extension, and parameter control.
- Free
- image-generation
- stable-diffusion
- local-inference
Best for: Advanced local Stable Diffusion workflows
CogView 4
THUDM text-to-image model family for high-quality generation in open research and local workflows.
- Free
- image-generation
- text-to-image
- open-weights
Best for: Developer workflows, Faceless content production
ComfyUI TTS
Node-based text-to-speech and voice workflow stack inside ComfyUI using custom audio nodes.
- Free
- text-to-speech
- voiceover
- narration
Best for: Local custom voiceover pipelines, Experimental multi-model TTS workflows
Command R+
Large instruction-tuned model oriented to advanced assistant and retrieval-heavy workflows.
- Free
- local-inference
- open-weights
- self-hosted
Best for: Advanced local assistant deployments, Complex retrieval and planning workflows
Coqui TTS
Open-source toolkit for local text-to-speech and voice cloning workflows.
- Free
- text-to-speech
- voiceover
- local-inference
Best for: Advanced local text-to-speech pipelines
DeepSeek-R1
Reasoning-focused open-weight family with MIT core licensing and smaller distilled options.
- Free
- local-inference
- open-weights
- mit
Best for: Reasoning-heavy workflows on distilled checkpoints, Local experimentation with open model pipelines
DeepSeek-VL2
Mixture-of-experts local vision-language family for OCR, documents, charts, and grounded multimodal reasoning.
- Free
- local-inference
- open-weights
- self-hosted
Best for: Private visual document analysis, Multimodal document understanding
FLUX
FLUX family for quality-first generation, fast local variants, and modern in-context image editing workflows.
- Free
- image-generation
- text-to-image
- image-editing
Best for: Thumbnail and visual concept generation, Fast style exploration for creator content
Fooocus
Beginner-friendly local Stable Diffusion UI focused on high-quality images with minimal setup.
- Free
- image-generation
- stable-diffusion
- local-inference
Best for: Fast local image generation with minimal setup
Forge
Performance-focused Stable Diffusion WebUI fork designed for practical local generation speed and compatibility.
- Free
- image-generation
- stable-diffusion
- local-inference
Best for: Faster local Stable Diffusion workflows in a linear WebUI
Gemma 2
Compact-to-mid-size model family that is efficient for local chat, summarization, and lightweight coding.
- Free
- local-inference
- open-weights
- self-hosted
Best for: Efficient local chat workloads, Summarization and long-form drafting
Gemma 3
Portable open-weight family with long context and multimodal options under custom terms.
- Free
- local-inference
- open-weights
- on-device
Best for: Local assistants with manageable compliance processes, Multimodal summarization and extraction
GLM-4.5 Air
Open-weight GLM model variant for local reasoning, coding, and automation workflows.
- Free
- local-inference
- open-weights
- self-hosted
Best for: Private local LLM workflows, Reasoning and coding support in automation tasks
GLM-4.7-Flash
Lightweight GLM 4.7 branch focused on fast coding, reasoning, and long-context generation.
- Free
- local-inference
- open-weights
- self-hosted
Best for: Fast local coding assistants, Reasoning-heavy drafting with tighter latency budgets
gpt-oss-20b
Apache-2.0 open-weight text model with long context and practical local deployment targets.
- Free
- local-inference
- open-weights
- self-hosted
Best for: Private drafting and extraction workflows, Batch automations with stable cost control
HiDream
Open HiDream family for quality-focused generation and instruction-based image editing.
- Free
- image-generation
- text-to-image
- image-editing
Best for: Thumbnail and visual concept generation, Fast style exploration for creator content
HunyuanDiT
Tencent open-weights diffusion-transformer image model family for high-quality text-to-image generation.
- Free
- image-generation
- text-to-image
- open-weights
Best for: Developer workflows, Faceless content production
InternVL 3.5
Apache-2.0 multimodal family with many size options and a strong focus on reasoning, OCR, and agent-style visual tasks.
- Free
- local-inference
- open-weights
- self-hosted
Best for: Multimodal internal analysis workflows, Builders experimenting with vision-language tasks
InvokeAI
Polished local-first generative image platform with strong workflow UX for Stable Diffusion users.
- Free
- image-generation
- stable-diffusion
- local-inference
Best for: Professional local image workflows with cleaner UX
Kandinsky 3
Open-weights text-to-image model family oriented to prompt-following and stylistic generation.
- Free
- image-generation
- text-to-image
- open-weights
Best for: Thumbnail and visual concept generation, Faceless content production
Kimi K
Open-weight Kimi model line for long-context reasoning and local LLM experimentation.
- Free
- local-inference
- open-weights
- reasoning
Best for: Local long-context drafting and analysis, Builders comparing open-weight LLM stacks
Kokoro TTS
Compact open-weight TTS model for local voice synthesis and experimentation.
- Free
- text-to-speech
- voiceover
- local-inference
Best for: Lightweight local text-to-speech experiments
Kolors
Open-weights text-to-image model family from Kwai for high-quality image synthesis workflows.
- Free
- image-generation
- text-to-image
- open-weights
Best for: Thumbnail and visual concept generation, Faceless content production
LatentSync
Open-source lip-sync framework for generating talking portrait videos from audio and face inputs.
- Free
- avatar-video
- local-inference
- open-source
Best for: Free local talking-head generation
LivePortrait
Open-source local portrait animation tool that turns a single image into a talking video.
- Free
- avatar-video
- local-inference
- open-source
Best for: Local avatar animation workflows
Llama 3.1
Open model family often used as a balanced local default for general chat, writing, and coding.
- Free
- local-inference
- open-weights
- self-hosted
Best for: General local chat and assistant workflows, Summarization and drafting tasks
Llama 3.2 Vision
Vision-capable Llama model for local image-plus-text understanding tasks.
- Free
- local-inference
- open-weights
- self-hosted
Best for: Local image + text analysis workflows, Multimodal document understanding
Llama 3.3
Larger Llama generation aimed at high-quality local reasoning and assistant workflows.
- Free
- local-inference
- open-weights
- self-hosted
Best for: High-quality local assistant workflows, Reasoning-heavy long-form tasks
Llama 4
Open-weight multimodal family with massive context, but significant policy and license constraints.
- Free
- local-inference
- open-weights
- multimodal
Best for: Large multi-document summarization pipelines, Multimodal internal analysis workflows
LocalAI
Open-source local AI runtime with OpenAI-compatible APIs for self-hosted LLM and multimodal workloads.
- Free
- local-inference
- self-hosted
- open-source
Best for: Local model serving and testing, Private local LLM workflows
LocalForge
Open-source local app for running AI models and workflows on your own machine.
- Free
- local-inference
- open-source
- self-hosted
Best for: Local model serving and testing, Private local assistant workflows
MiniCPM-V 2.6
Efficient local VLM with strong OCR, multi-image, and video understanding in an 8B-class footprint.
- Free
- local-inference
- open-weights
- self-hosted
Best for: Private visual document analysis, Multimodal local assistant workflows
Ministral 3 8B
Apache-2.0 open-weight 8B model tuned for efficient local use with very long context.
- Free
- local-inference
- open-weights
- self-hosted
Best for: Long-document summarization and extraction, Private local assistant workflows
Mistral NeMo
Mid-size model line that balances general reasoning, coding support, and local deployability.
- Free
- local-inference
- open-weights
- self-hosted
Best for: Balanced local assistant workloads, Coding and reasoning mixed tasks
Mixtral 8x22B
Mixture-of-experts model family offering strong quality with favorable active-parameter efficiency.
- Free
- local-inference
- open-weights
- self-hosted
Best for: High-end local inference setups, Long-context reasoning workflows
Molmo
Open vision-language family from AI2 focused on strong multimodal quality with Apache-2.0 licensing.
- Free
- local-inference
- open-weights
- self-hosted
Best for: Multimodal document understanding, Private visual document analysis
MuseTalk
Open-source real-time lip-sync framework for talking avatar and portrait video workflows.
- Free
- avatar-video
- local-inference
- open-source
Best for: Free local talking-head generation
NVIDIA Nemotron
Open model family for agentic AI with reasoning-focused releases across edge, single-GPU, and multi-GPU tiers.
- Free
- open-weights
- reasoning
- agentic-ai
Best for: Agentic AI prototyping, Reasoning-heavy developer workflows
Ollama
Local LLM runtime for running open models on your own machine with simple CLI and API workflows.
- Free
- local-inference
- self-hosted
- offline
Best for: Local model serving and testing, Privacy-first AI workflows
Phi-3 Mini
Lightweight Phi model family for fast local inference on modest hardware.
- Free
- local-inference
- open-weights
- on-device
Best for: Low-latency local chat and coding help, Entry-level local LLM deployments
Phi-3.5 Mini Instruct
MIT-licensed small model with long context, optimized for practical local and on-device use.
- Free
- local-inference
- open-weights
- on-device
Best for: Private drafting and summarization on modest hardware, Lightweight offline content automation
Phi-3.5 Vision Instruct
Compact MIT-licensed multimodal model for local image, OCR, chart, and multi-image reasoning tasks.
- Free
- local-inference
- open-weights
- on-device
Best for: Multimodal document understanding, Private visual document analysis
Phi-4
Higher-capability Phi model for instruction-following and reasoning-heavy local tasks.
- Free
- local-inference
- open-weights
- self-hosted
Best for: Reasoning-heavy local workflows, Structured instruction and planning tasks
Phi-4 Reasoning
Reasoning-tuned Phi-4 variant for complex chain-of-thought style local workloads.
- Free
- local-inference
- open-weights
- self-hosted
Best for: Complex reasoning and analytical tasks, Local private inference with explicit step logic
Piper TTS
Fast local neural text-to-speech engine for offline voice generation.
- Free
- text-to-speech
- voiceover
- local-inference
Best for: Local private text-to-speech pipelines
PixArt-Σ
Open-weights text-to-image model line focused on efficient high-resolution generation.
- Free
- image-generation
- text-to-image
- open-weights
Best for: Thumbnail and visual concept generation, Faceless content production
Qwen2.5
Versatile multilingual open model family with strong long-form writing and instruction-following behavior.
- Free
- local-inference
- open-weights
- self-hosted
Best for: Multilingual content generation, Long-form drafting and rewriting
Qwen2.5 VL
Multimodal Qwen model family for local vision-language workflows.
- Free
- local-inference
- open-weights
- self-hosted
Best for: Multimodal local assistant workflows, Private visual document analysis
Qwen3 8B
Apache-2.0 open-weight 8B model with 128K context, local-first deployment, and optional cloud API access.
- Free
- local-inference
- open-weights
- self-hosted
Best for: Private local writing and rewriting, Multilingual content transformation
SadTalker
Open-source audio-driven talking-face generator for creating avatar-style clips from still portraits.
- Free
- avatar-video
- local-inference
- open-source
Best for: Free local talking-head generation
Stable Diffusion
Open model family for text-to-image generation, spanning v1.x, v2.x, SDXL, and SD3/SD3.5.
- Free
- image-generation
- design
- thumbnails
Best for: Faceless content production, Solopreneur operations
SwarmUI
Local-first Stable Diffusion UI focused on multi-model orchestration and scalable generation queues.
- Free
- image-generation
- stable-diffusion
- local-inference
Best for: Local Stable Diffusion workflows with larger job queues
Voicebox
Local-first open-source voice cloning studio powered by Qwen3-TTS.
- Free
- text-to-speech
- voice-cloning
- local-inference
Best for: Local custom voiceover pipelines, Advanced local text-to-speech pipelines
Z-Image
Z-Image text-to-image family for high-fidelity generation and fast iterative visual production.
- Free
- image-generation
- text-to-image
- image-editing
Best for: Thumbnail and visual concept generation, Fast style exploration for creator content