How To Run Deepseek R1 671b Fully Locally On a $2000 EPYC Server

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · edit-2 2 months ago

How To Run Deepseek R1 671b Fully Locally On a $2000 EPYC Server

queermunist she/her@lemmy.ml · 2 months ago

The “wall” they’re talking about is orthodox AI not getting better despite feeding it even more data. DeepSeek sidesteps this by making multiple smaller models that can switched for different tasks, instead of the orthodox method of trying to make a “general intelligence” model that works for everything.

CriticalResist8@lemmygrad.ml · 2 months ago

just reasoning alone almost destroyed the western AI industry overnight. They scrambled to make their own reasoning models in less than a week but it changed everything

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 2 months ago

I think the next big idea could be models dynamically training sub models on demand. There are approaches like HRM being explored that require far less training data and scope of parameters already. Another avenue being explored focuses on creating reusable memory components as seen with MemOS. It blurs the line between training and operational modes, where the model just continuously learns. What we might see is models that create an agent to learn a new task, and then once it’s learned it can be used and shared going forward.

From what we know, human intelligence is also structured hierarchically, where the brain has regions responsible for different tasks like vision processing, and then there’s a high level reasoning system built on top of that.

How To Run Deepseek R1 671b Fully Locally On a $2000 EPYC Server

How To Run Deepseek R1 671b Fully Locally On a $2000 EPYC Server

How To Run Deepseek R1 671b Fully Locally On a $2000 EPYC Server – Digital Spaceport