When Does Trajectory-Level Supervision Permit Efficient Offline Reinforcement Learning? Paper • 2606.18531 • Published 9 days ago • 4