Can AI follow your company’s rules in long chats? Meet the Pluralistic Behavior Suite
AI assistants are not used in a vacuum—they operate inside hospitals, banks, classrooms, and brands, each with its own rules. This paper introduces Pluralistic Behavior Suite (PBSUITE), a testbed to see whether language models can stick to your custom policies over multi-turn conversations. * 300 realistic behavioral policies across 30