‘Do Anything Now’: Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models
Patrick Gough
Abstract
At reading group tomorrow, 2/25, Patrick Gough will give a talk on "‘Do Anything Now’: Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models." Hope to see you there at 12:30pm in IRB 5105 or on Zoom! If you want lunch, please fill out this form before 9:30am tomorrow.
This talk is organized by Wentao Guo