

For sure. I was wondering what people are doing beyond LLM’s. Is there some next gen take that’s coming out now?
Reddit sucks


For sure. I was wondering what people are doing beyond LLM’s. Is there some next gen take that’s coming out now?


Got it. Thanks!


Oh that’s really interesting! I’m also interested in the classification case. Can you tell me more or direct to where to learn more about DeBerta? Do you train it the same way? Prompt and response sets? Does it work on any open source model? I can only run up to 4B right now.


Thanks! Good to know.


Yeah big move towards agents. They’re based on LLM’s, no?


You have screenshots to prove this? How do you use LLM’s and which ones?


For sure. Thanks!


Thanks! So any gguf file should be safe? I’ve been downloading them from huggingface.
Yeah it’s wild what some people are letting models do with MCP. Really the Wild West.


Thanks! Is Qwen considered trustworthy?
I’ll check out a network sniffer.


For sure. You would need a model that is not censored at training.


The reasoning for hello is crazy haha. I’ve experienced the same, but if you turn off reasoning on launch and explicitly state the rules you want it to break I’ve had some success. I was trying to get it to tell me a story about llamas having sex and it went on forevvver reasoning about why it shouldn’t say things and how to rephrase to not break rules. The funniest part of the reasoning was “llamas don’t have penises (obviously, they’re mammals)”. Haha it reasoned itself into thinking llamas, and mammals, don’t have penises.


How does the model connect to the internet if I don’t give it a tool to? What if I’m not connected to the internet while using? Does it then send the packets after I connect? Is this documented somewhere? What’s a better model that doesn’t do this?


Thanks! I don’t think I can run an 8B yet. Need to invest in a better machine. I’m stuck on 4B Q4.
The uncensored Qwen that I’m using started throwing infinite ?’s at me one time. Had to restart it and has been fine since.


Jesus that MoE wiki is a fucking rabbit hole.
Thanks for sharing! Unfortunately I haven’t invested in a decent computer yet. Using 16GB GPU so been stuck on 4B Q4’s.
I’m not particularly interested in ERP, but I have obviously been using it for testing models. I’m more curious about other topics with guardrails.
I noticed that Qwen 3.5 uncensored is good if I turn off reasoning and explicitly say I want it to break the rules.
I’ll check out sillytavern tho. Thanks!


Thanks for the explanation!
The use case is writing marketing communications to match a library of content that a company has already written.
We’re currently using RAG and it’s okay, but I’m wondering how much better it would be if it were tuned.


Thanks! I’ll check out that model. Is it actually usable or just good at being uncensored?


Thanks! I’ll try it out. I’m on an old phone and resistant to switch to a bigger one.


Thanks! I’ll do some research.
I want to but I don’t think I have the hardware to support it. Need at least a decent GPU, right?