Let It Think, Then Lock It In
Let It Think, Then Lock It In Large language models shine at free-flowing reasoning, but that flexibility makes outputs hard to trust and parse. Constrained decoding (e.g., forcing JSON) fixes structure, yet can choke off reasoning. This paper proposes a simple middle path: allow the model to reason