Thursday, January 23

Protected: Jailbreaking GPT Models: Rethinking Safety Measures and Evaluations with StrongREJECT Benchmark