What's a good local and free LLM model for Windows?

cheese_greater@lemmy.world · edit-2 27 days ago

What's a good local and free LLM model for Windows?

sobanto@feddit.org · edit-2 27 days ago

Do you have a recommendation for Nvidia RTX 3070ti 8GB, ryzen 5600x +16GB DDR4? Does it even make sense to use it? Last time I tried the results were petty underwhelming.

Toes♀@ani.social · 27 days ago

Try it with this model, using Q4_K_S version.

https://huggingface.co/bartowski/mlabonne_gemma-3-12b-it-abliterated-GGUF

You’ll probably need to play with the context window size until you get an acceptable level of performance. (Likely 4096)

Ideally you’d have more RAM, but I want to say this smaller model should work. Koboldcpp will try to use both your GPU and CPU to run the model.