Open Philanthropy recommended a grant of $20,000 to ETH Zurich to support a project to develop reinforcement learning techniques for generating prompt injection attacks against LLM agents, led by Xin Chen and Professor Florian Tramer.
This falls within Open Philanthropy’s focus area of potential risks from advanced artificial intelligence.