Meet GRACE: a moral governor for safer, more transparent AI
AI agents are getting powerful—so how do we make sure they do the right thing, not just the effective thing?
Meet GRACE, a reason-based moral governor that keeps AI behavior aligned with human norms by separating moral reasoning from goal-driven decision-making.
- Moral Module: uses deontic logic and explicit reasons to decide which high-level actions are permissible.
- Decision-Making Module: the wrapped AI plans optimal low-level actions, but only within the moral boundaries.
- Guard: monitors and enforces compliance, enabling formal checks and statistical guarantees.
Because GRACE reasons with explicit, symbolic factors, its decisions are interpretable, contestable, and justifiable—so stakeholders can inspect, debate, and refine what counts as acceptable behavior.
The authors demo GRACE on a therapy assistant built on an LLM, showing how the system prevents harmful suggestions while still being helpful.
Paper: https://arxiv.org/abs/2601.10520v1
Paper: https://arxiv.org/abs/2601.10520v1
Register: https://www.AiFeta.com
AI AIAlignment AIEthics Safety NeuroSymbolic DeonticLogic LLM ResponsibleAI