2 days Anthropic claims new AI security method blocks 95% of jailbreaks, invites red teamers to try VentureBeat
The new Claude safeguards have already technically been broken but Anthropic says this was due to a glitch — try again.
XThe new Claude safeguards have already technically been broken but Anthropic says this was due to a glitch — try again.
X