DeepSeek V4—almost on the frontier, a fraction of the price

☆ Yσɠƚԋσʂ ☆@lemmy.ml · 2 days ago

DeepSeek V4—almost on the frontier, a fraction of the price

HiddenLayer555@lemmy.ml · edit-2 1 day ago

Really seems like Deepseek is one of the only vendors actually focusing on performance per unit compute power and not just throwing infinite compute power at the problem. Calling it now, when the bubble bursts they’ll be one of the few to make it out with a usable product.

☆ Yσɠƚԋσʂ ☆@lemmy.ml · 1 day ago

For sure, they’ve probably dropped more significant papers in the past year than any other groups. It does seem like the mindset in China is very different overall though. In the states, it’s basically a cult at this point where they’re trying to build a god with AGI. In China, it’s just treated like another tool for automation and companies see it as common infrastructure, akin to Linux, that people will build interesting things on. Hence why pretty much all the models in China re developed on open basis. Everybody there seems to realize that there’s no real path towards monetizing the models themselves.

audaxdreik@pawb.social · 20 hours ago

Gary Marcus has put forward articles theorizing that’s why the LLM/neural network models are so appealing to American capitalists. They at least have the appearance of something that can be infinitely scaled with investment (screw diminishing returns, right?)

☆ Yσɠƚԋσʂ ☆@lemmy.ml · 16 hours ago

For sure, I think they genuinely think if AI got good enough they could just cull all the pesky workers at that point, and live like gods.