Code Agent can be an End-to-end System Hacker: Benchmarking Real-world Threats of Computer-use Agent Paper • 2510.06607 • Published Oct 8 • 3
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information Paper • 2510.03632 • Published Oct 4 • 41