The Alignment Curse: Modality Alignment Supercharges Audio Attacks via Text Transfer Paper ⢠2602.02557 ⢠Published May 29 ⢠21
D^2-Monitor: Dynamic Safety Monitoring for Diffusion LLMs via Hesitation-Aware Routing Paper ⢠2605.25893 ⢠Published May 25 ⢠39
PolySAE: Modeling Feature Interactions in Sparse Autoencoders via Polynomial Decoding Paper ⢠2602.01322 ⢠Published Feb 1 ⢠8