Limited Capacityfull Adding this to your schedule will put you on the waitlist.
I work as AI Governance Lead at Decathlon with a backgrouns in responsible AI and AI safety research engineering. In this talk, I'll present a paper that I've published with NeurIPS and AAAI "Empirical Evidence for Alignment Faking in a Small LLM and Prompt-Based Mitigation Techniques". I have presented a similar talk at the AAAI Fall Symposium Series last year. Given the audience at this summit, I can also spend some time diving into the importance of AI safety in multinational organisations and how we can go beyond policy, to include technical AI safety measures.