| Best first install | qwen2.5-coder:7b | Best quality/speed balance for coding on 16GB RAM + RTX 3050 Ti | Usually smooth for chat-style coding and medium files |
| Fastest coding helper | deepseek-coder:6.7b | Lighter footprint and fast responses for practical code edits | Good speed for short iterations and autocomplete-like tasks |
| Very lightweight | starcoder2:3b | Low memory pressure, easy to keep responsive | Fastest option, but weaker on complex reasoning |
| Solid alternative | starcoder2:7b | Reasonable quality without jumping to heavy model sizes | Balanced for refactors and medium complexity tasks |
| Solid alternative | codellama:7b | Mature coding model family with stable behavior | Works well for common coding workflows |
| Bigger but slower | qwen2.5-coder:14b | Can run, but often spills to system RAM on this hardware class | Noticeably slower token speed than 7B |