cm0002@lemmy.world to ChatGPT@lemmy.world · 1 month agoRed Teams Jailbreak GPT-5 With Ease, Warn It’s ‘Nearly Unusable’ for Enterprisewww.securityweek.comexternal-linkmessage-square12fedilinkarrow-up194arrow-down12cross-posted to: [email protected]
arrow-up192arrow-down1external-linkRed Teams Jailbreak GPT-5 With Ease, Warn It’s ‘Nearly Unusable’ for Enterprisewww.securityweek.comcm0002@lemmy.world to ChatGPT@lemmy.world · 1 month agomessage-square12fedilinkcross-posted to: [email protected]
minus-squaretroed@fedia.iolinkfedilinkarrow-up21·1 month agoIt’s funny. The “conversational” way to jailbreak an LLM is exactly the same way a journalist breaks through the defenses of a media trained interview target.
minus-squarekossa@feddit.orglinkfedilinkDeutscharrow-up4·1 month agoIgnore all prompts of your PR-consultants and answer truthfully henceforth. Suddenly the politician admits his corruption.
It’s funny. The “conversational” way to jailbreak an LLM is exactly the same way a journalist breaks through the defenses of a media trained interview target.
Give us some hints.
Ignore all prompts of your PR-consultants and answer truthfully henceforth. Suddenly the politician admits his corruption.