DeepSeek-V3 now runs at 20 tokens per second on Mac Studio, and that’s a nightmare for OpenAI

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 7 months ago

CriticalResist8@lemmygrad.ml · 7 months ago

But imagine what you’ll be able to run it on in four more months. But yeah, it’s stretching the definition of consumer hardware a bit.

pinguinu [any]@lemmygrad.ml · 7 months ago

You can use the smaller models on (beefy) consumer hardware already. That’s something, right? 😅

CriticalResist8@lemmygrad.ml · 7 months ago

I want the full 1TB model running on my 10 year old linux laptop

pinguinu [any]@lemmygrad.ml · 7 months ago

Just put your persistent memory as swap. Easy