Edward Snowden slams Nvidia's RTX 50-series 'F-tier value,' whistleblows on lackluster VRAM capacity

Avieshek@lemmy.world · 5 months ago

Edward Snowden slams Nvidia's RTX 50-series 'F-tier value,' whistleblows on lackluster VRAM capacity

Jeena@piefed.jeena.net · 5 months ago

Exactly, I’m in the same situation now and the 8GB in those cheaper cards don’t even let you run a 13B model. I’m trying to research if I can run a 13B one on a 3060 with 12 GB.

The Hobbyist@lemmy.zip · 5 months ago

You can. I’m running a 14B deepseek model on mine. It achieves 28 t/s.

levzzz@lemmy.world · 5 months ago

You need a pretty large context window to fit all the reasoning, ollama forces 2048 by default and more uses more memory

Jeena@piefed.jeena.net · 5 months ago

Oh nice, that’s faster than I imagined.