Pro@programming.dev to Technology@lemmy.worldEnglish · 9 hours agoAnthropic tested Claude's(LLM, AI Chatbot) ability to manage a physical “storefront” to mixed results, as the AI struggled with pricing strategy and inventory managementwww.anthropic.comexternal-linkmessage-square8fedilinkarrow-up142arrow-down14
arrow-up138arrow-down1external-linkAnthropic tested Claude's(LLM, AI Chatbot) ability to manage a physical “storefront” to mixed results, as the AI struggled with pricing strategy and inventory managementwww.anthropic.comPro@programming.dev to Technology@lemmy.worldEnglish · 9 hours agomessage-square8fedilink
minus-squareWomble@lemmy.worldlinkfedilinkEnglisharrow-up11arrow-down1·8 hours agoI doubt anyone expected it to work completely, but it is interesting to see to what extent it worked and how it failed (halucinations and sycophancy)
minus-squareA_norny_mousse@feddit.orglinkfedilinkEnglisharrow-up4arrow-down2·8 hours agoTrue; I just hate headlines that ask stupid questions. But then again, there’s always the premise that it could work, in such attempts, which annoys me no less.
I doubt anyone expected it to work completely, but it is interesting to see to what extent it worked and how it failed (halucinations and sycophancy)
True; I just hate headlines that ask stupid questions.
But then again, there’s always the premise that it could work, in such attempts, which annoys me no less.