Local LLMs

Self-hosted and on-device model workflows for privacy and predictable usage costs.

Browse Local LLMs tools filtered by practical fit and workflow needs.

54 matching tools.

Tools in this category

AUTOMATIC1111 logo

AUTOMATIC1111

Feature-rich Stable Diffusion WebUI with extensive model, extension, and parameter control.

  • Free
  • image-generation
  • stable-diffusion
  • local-inference

Best for: Advanced local Stable Diffusion workflows

CogView 4 logo

CogView 4

THUDM text-to-image model family for high-quality generation in open research and local workflows.

  • Free
  • image-generation
  • text-to-image
  • open-weights

Best for: Developer workflows, Faceless content production

ComfyUI TTS logo

ComfyUI TTS

Node-based text-to-speech and voice workflow stack inside ComfyUI using custom audio nodes.

  • Free
  • text-to-speech
  • voiceover
  • narration

Best for: Local custom voiceover pipelines, Experimental multi-model TTS workflows

Command R+ logo

Command R+

Large instruction-tuned model oriented to advanced assistant and retrieval-heavy workflows.

  • Free
  • local-inference
  • open-weights
  • self-hosted

Best for: Advanced local assistant deployments, Complex retrieval and planning workflows

Coqui TTS logo

Coqui TTS

Open-source toolkit for local text-to-speech and voice cloning workflows.

  • Free
  • text-to-speech
  • voiceover
  • local-inference

Best for: Advanced local text-to-speech pipelines

DeepSeek-R1 logo

DeepSeek-R1

Reasoning-focused open-weight family with MIT core licensing and smaller distilled options.

  • Free
  • local-inference
  • open-weights
  • mit

Best for: Reasoning-heavy workflows on distilled checkpoints, Local experimentation with open model pipelines

DeepSeek-VL2 logo

DeepSeek-VL2

Mixture-of-experts local vision-language family for OCR, documents, charts, and grounded multimodal reasoning.

  • Free
  • local-inference
  • open-weights
  • self-hosted

Best for: Private visual document analysis, Multimodal document understanding

FLUX logo

FLUX

FLUX family for quality-first generation, fast local variants, and modern in-context image editing workflows.

  • Free
  • image-generation
  • text-to-image
  • image-editing

Best for: Thumbnail and visual concept generation, Fast style exploration for creator content

Fooocus logo

Fooocus

Beginner-friendly local Stable Diffusion UI focused on high-quality images with minimal setup.

  • Free
  • image-generation
  • stable-diffusion
  • local-inference

Best for: Fast local image generation with minimal setup

Forge logo

Forge

Performance-focused Stable Diffusion WebUI fork designed for practical local generation speed and compatibility.

  • Free
  • image-generation
  • stable-diffusion
  • local-inference

Best for: Faster local Stable Diffusion workflows in a linear WebUI

Gemma 2 logo

Gemma 2

Compact-to-mid-size model family that is efficient for local chat, summarization, and lightweight coding.

  • Free
  • local-inference
  • open-weights
  • self-hosted

Best for: Efficient local chat workloads, Summarization and long-form drafting

Gemma 3 logo

Gemma 3

Portable open-weight family with long context and multimodal options under custom terms.

  • Free
  • local-inference
  • open-weights
  • on-device

Best for: Local assistants with manageable compliance processes, Multimodal summarization and extraction

GLM-4.5 Air logo

GLM-4.5 Air

Open-weight GLM model variant for local reasoning, coding, and automation workflows.

  • Free
  • local-inference
  • open-weights
  • self-hosted

Best for: Private local LLM workflows, Reasoning and coding support in automation tasks

GLM-4.7-Flash logo

GLM-4.7-Flash

Lightweight GLM 4.7 branch focused on fast coding, reasoning, and long-context generation.

  • Free
  • local-inference
  • open-weights
  • self-hosted

Best for: Fast local coding assistants, Reasoning-heavy drafting with tighter latency budgets

gpt-oss-20b logo

gpt-oss-20b

Apache-2.0 open-weight text model with long context and practical local deployment targets.

  • Free
  • local-inference
  • open-weights
  • self-hosted

Best for: Private drafting and extraction workflows, Batch automations with stable cost control

HiDream logo

HiDream

Open HiDream family for quality-focused generation and instruction-based image editing.

  • Free
  • image-generation
  • text-to-image
  • image-editing

Best for: Thumbnail and visual concept generation, Fast style exploration for creator content

HunyuanDiT logo

HunyuanDiT

Tencent open-weights diffusion-transformer image model family for high-quality text-to-image generation.

  • Free
  • image-generation
  • text-to-image
  • open-weights

Best for: Developer workflows, Faceless content production

InternVL 3.5 logo

InternVL 3.5

Apache-2.0 multimodal family with many size options and a strong focus on reasoning, OCR, and agent-style visual tasks.

  • Free
  • local-inference
  • open-weights
  • self-hosted

Best for: Multimodal internal analysis workflows, Builders experimenting with vision-language tasks

InvokeAI logo

InvokeAI

Polished local-first generative image platform with strong workflow UX for Stable Diffusion users.

  • Free
  • image-generation
  • stable-diffusion
  • local-inference

Best for: Professional local image workflows with cleaner UX

Kandinsky 3 logo

Kandinsky 3

Open-weights text-to-image model family oriented to prompt-following and stylistic generation.

  • Free
  • image-generation
  • text-to-image
  • open-weights

Best for: Thumbnail and visual concept generation, Faceless content production

Kimi K logo

Kimi K

Open-weight Kimi model line for long-context reasoning and local LLM experimentation.

  • Free
  • local-inference
  • open-weights
  • reasoning

Best for: Local long-context drafting and analysis, Builders comparing open-weight LLM stacks

Kokoro TTS logo

Kokoro TTS

Compact open-weight TTS model for local voice synthesis and experimentation.

  • Free
  • text-to-speech
  • voiceover
  • local-inference

Best for: Lightweight local text-to-speech experiments

Kolors logo

Kolors

Open-weights text-to-image model family from Kwai for high-quality image synthesis workflows.

  • Free
  • image-generation
  • text-to-image
  • open-weights

Best for: Thumbnail and visual concept generation, Faceless content production

LatentSync logo

LatentSync

Open-source lip-sync framework for generating talking portrait videos from audio and face inputs.

  • Free
  • avatar-video
  • local-inference
  • open-source

Best for: Free local talking-head generation

LivePortrait logo

LivePortrait

Open-source local portrait animation tool that turns a single image into a talking video.

  • Free
  • avatar-video
  • local-inference
  • open-source

Best for: Local avatar animation workflows

Llama 3.1 logo

Llama 3.1

Open model family often used as a balanced local default for general chat, writing, and coding.

  • Free
  • local-inference
  • open-weights
  • self-hosted

Best for: General local chat and assistant workflows, Summarization and drafting tasks

Llama 3.2 Vision logo

Llama 3.2 Vision

Vision-capable Llama model for local image-plus-text understanding tasks.

  • Free
  • local-inference
  • open-weights
  • self-hosted

Best for: Local image + text analysis workflows, Multimodal document understanding

Llama 3.3 logo

Llama 3.3

Larger Llama generation aimed at high-quality local reasoning and assistant workflows.

  • Free
  • local-inference
  • open-weights
  • self-hosted

Best for: High-quality local assistant workflows, Reasoning-heavy long-form tasks

Llama 4 logo

Llama 4

Open-weight multimodal family with massive context, but significant policy and license constraints.

  • Free
  • local-inference
  • open-weights
  • multimodal

Best for: Large multi-document summarization pipelines, Multimodal internal analysis workflows

LocalAI logo

LocalAI

Open-source local AI runtime with OpenAI-compatible APIs for self-hosted LLM and multimodal workloads.

  • Free
  • local-inference
  • self-hosted
  • open-source

Best for: Local model serving and testing, Private local LLM workflows

LocalForge logo

LocalForge

Open-source local app for running AI models and workflows on your own machine.

  • Free
  • local-inference
  • open-source
  • self-hosted

Best for: Local model serving and testing, Private local assistant workflows

MiniCPM-V 2.6 logo

MiniCPM-V 2.6

Efficient local VLM with strong OCR, multi-image, and video understanding in an 8B-class footprint.

  • Free
  • local-inference
  • open-weights
  • self-hosted

Best for: Private visual document analysis, Multimodal local assistant workflows

Ministral 3 8B logo

Ministral 3 8B

Apache-2.0 open-weight 8B model tuned for efficient local use with very long context.

  • Free
  • local-inference
  • open-weights
  • self-hosted

Best for: Long-document summarization and extraction, Private local assistant workflows

Mistral NeMo logo

Mistral NeMo

Mid-size model line that balances general reasoning, coding support, and local deployability.

  • Free
  • local-inference
  • open-weights
  • self-hosted

Best for: Balanced local assistant workloads, Coding and reasoning mixed tasks

Mixtral 8x22B logo

Mixtral 8x22B

Mixture-of-experts model family offering strong quality with favorable active-parameter efficiency.

  • Free
  • local-inference
  • open-weights
  • self-hosted

Best for: High-end local inference setups, Long-context reasoning workflows

Molmo logo

Molmo

Open vision-language family from AI2 focused on strong multimodal quality with Apache-2.0 licensing.

  • Free
  • local-inference
  • open-weights
  • self-hosted

Best for: Multimodal document understanding, Private visual document analysis

MuseTalk logo

MuseTalk

Open-source real-time lip-sync framework for talking avatar and portrait video workflows.

  • Free
  • avatar-video
  • local-inference
  • open-source

Best for: Free local talking-head generation

NVIDIA Nemotron logo

NVIDIA Nemotron

Open model family for agentic AI with reasoning-focused releases across edge, single-GPU, and multi-GPU tiers.

  • Free
  • open-weights
  • reasoning
  • agentic-ai

Best for: Agentic AI prototyping, Reasoning-heavy developer workflows

Ollama logo

Ollama

Local LLM runtime for running open models on your own machine with simple CLI and API workflows.

  • Free
  • local-inference
  • self-hosted
  • offline

Best for: Local model serving and testing, Privacy-first AI workflows

Phi-3 Mini logo

Phi-3 Mini

Lightweight Phi model family for fast local inference on modest hardware.

  • Free
  • local-inference
  • open-weights
  • on-device

Best for: Low-latency local chat and coding help, Entry-level local LLM deployments

Phi-3.5 Mini Instruct logo

Phi-3.5 Mini Instruct

MIT-licensed small model with long context, optimized for practical local and on-device use.

  • Free
  • local-inference
  • open-weights
  • on-device

Best for: Private drafting and summarization on modest hardware, Lightweight offline content automation

Phi-3.5 Vision Instruct logo

Phi-3.5 Vision Instruct

Compact MIT-licensed multimodal model for local image, OCR, chart, and multi-image reasoning tasks.

  • Free
  • local-inference
  • open-weights
  • on-device

Best for: Multimodal document understanding, Private visual document analysis

Phi-4 logo

Phi-4

Higher-capability Phi model for instruction-following and reasoning-heavy local tasks.

  • Free
  • local-inference
  • open-weights
  • self-hosted

Best for: Reasoning-heavy local workflows, Structured instruction and planning tasks

Phi-4 Reasoning logo

Phi-4 Reasoning

Reasoning-tuned Phi-4 variant for complex chain-of-thought style local workloads.

  • Free
  • local-inference
  • open-weights
  • self-hosted

Best for: Complex reasoning and analytical tasks, Local private inference with explicit step logic

Piper TTS logo

Piper TTS

Fast local neural text-to-speech engine for offline voice generation.

  • Free
  • text-to-speech
  • voiceover
  • local-inference

Best for: Local private text-to-speech pipelines

PixArt-Σ logo

PixArt-Σ

Open-weights text-to-image model line focused on efficient high-resolution generation.

  • Free
  • image-generation
  • text-to-image
  • open-weights

Best for: Thumbnail and visual concept generation, Faceless content production

Qwen2.5 logo

Qwen2.5

Versatile multilingual open model family with strong long-form writing and instruction-following behavior.

  • Free
  • local-inference
  • open-weights
  • self-hosted

Best for: Multilingual content generation, Long-form drafting and rewriting

Qwen2.5 VL logo

Qwen2.5 VL

Multimodal Qwen model family for local vision-language workflows.

  • Free
  • local-inference
  • open-weights
  • self-hosted

Best for: Multimodal local assistant workflows, Private visual document analysis

Qwen3 8B logo

Qwen3 8B

Apache-2.0 open-weight 8B model with 128K context, local-first deployment, and optional cloud API access.

  • Free
  • local-inference
  • open-weights
  • self-hosted

Best for: Private local writing and rewriting, Multilingual content transformation

SadTalker logo

SadTalker

Open-source audio-driven talking-face generator for creating avatar-style clips from still portraits.

  • Free
  • avatar-video
  • local-inference
  • open-source

Best for: Free local talking-head generation

Stable Diffusion logo

Stable Diffusion

Open model family for text-to-image generation, spanning v1.x, v2.x, SDXL, and SD3/SD3.5.

  • Free
  • image-generation
  • design
  • thumbnails

Best for: Faceless content production, Solopreneur operations

SwarmUI logo

SwarmUI

Local-first Stable Diffusion UI focused on multi-model orchestration and scalable generation queues.

  • Free
  • image-generation
  • stable-diffusion
  • local-inference

Best for: Local Stable Diffusion workflows with larger job queues

Voicebox logo

Voicebox

Local-first open-source voice cloning studio powered by Qwen3-TTS.

  • Free
  • text-to-speech
  • voice-cloning
  • local-inference

Best for: Local custom voiceover pipelines, Advanced local text-to-speech pipelines

Z-Image logo

Z-Image

Z-Image text-to-image family for high-fidelity generation and fast iterative visual production.

  • Free
  • image-generation
  • text-to-image
  • image-editing

Best for: Thumbnail and visual concept generation, Fast style exploration for creator content

Related categories

View all categories · View all tools

Alternatives to explore

Share This Page