• ☆ Yσɠƚԋσʂ ☆@lemmygrad.mlOP
      link
      fedilink
      arrow-up
      5
      ·
      edit-2
      28 days ago

      If you have the memory, I can highly recommend Qwen3.6-35B-A3B-Q8. It’s hands down the best local model I’ve tried. It only loads 3b params in memory too, so should run with 16gb, or you can drop to a lower quant too.

    • CriticalResist8@lemmygrad.ml
      link
      fedilink
      arrow-up
      4
      ·
      28 days ago

      Deepseek v4 pro! Top up your credit as you go and they’re having a sale until May 31st, but even without the sale 1M output tokens is “only” 3.48. Flash is only 0.28 per 1M output.

      • Che's Motorcycle@lemmygrad.ml
        link
        fedilink
        arrow-up
        2
        ·
        28 days ago

        Not sure if I could swing Deepseek at my job tho. Surprisingly, Cursor still comes with Kimi2 as model option, so there’s that.