The chatbot still remains the most popular AI assistant worldwide with over 1.1 billion monthly users, followed by Gemini with 662 million and Claude with 245 million.
The company taking most of chatGPT’s market share is actually Gemini, which I think is cheating because it’s basically just padding the numbers with random google searches.
The good thing is that a deepseek can be run locally relatively well with consumer hardware. I trust chinese companies as much as i trust american companies with my data and my prompts.
I mainly use DeepSeek v4 Flash now, it’s the cheapest around and the quality is high enough for coding. At work we’re throwing tons of money at Claude, but even there I usually stick to Sonnet (as Opus is burning money).
You don’t need 170+ GB of VRAM. Whole model can be run at around 1 token/second on a modern hardware from an ssd. Which is slow, don’t get me wrong, but it still somewhat useable.
Theres a lot of good models free online that are open source/Chinese and the only cost is a bit of slowness on my rig. No token limit or whatever. Totally agree.
So, people are falling for Anthropic’s marketing scheme?
What scheme, their last models have been so much better than OpenAI’s it’s no surprise people are moving to Claude.
The company taking most of chatGPT’s market share is actually Gemini, which I think is cheating because it’s basically just padding the numbers with random google searches.
Personally, I find myself using the Chinese alternatives more and more as they are just way cheaper.
I’ve been loving minimax-m3 since its release
The good thing is that a deepseek can be run locally relatively well with consumer hardware. I trust chinese companies as much as i trust american companies with my data and my prompts.
You have 170+ GB VRAM at home? (:
I mainly use DeepSeek v4 Flash now, it’s the cheapest around and the quality is high enough for coding. At work we’re throwing tons of money at Claude, but even there I usually stick to Sonnet (as Opus is burning money).
You don’t need 170+ GB of VRAM. Whole model can be run at around 1 token/second on a modern hardware from an ssd. Which is slow, don’t get me wrong, but it still somewhat useable.
Theres a lot of good models free online that are open source/Chinese and the only cost is a bit of slowness on my rig. No token limit or whatever. Totally agree.
Its about as good as commercial products now…
Can’t be Europeans as they’re locked out from the new models LOL
As is everyone?
I switched to opencode and deepseek