Automatic Image-Level Morphological Trait Annotation for Organismal Images Paper • 2604.01619 • Published 3 days ago • 3
SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks Paper • 2602.12670 • Published Feb 13 • 57
Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents Paper • 2510.24702 • Published Oct 28, 2025 • 30
CE-Bench: Towards a Reliable Contrastive Evaluation Benchmark of Interpretability of Sparse Autoencoders Paper • 2509.00691 • Published Aug 31, 2025 • 2
Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge Paper • 2506.21506 • Published Jun 26, 2025 • 52
An Illusion of Progress? Assessing the Current State of Web Agents Paper • 2504.01382 • Published Apr 2, 2025 • 4
From RAG to Memory: Non-Parametric Continual Learning for Large Language Models Paper • 2502.14802 • Published Feb 20, 2025 • 13
Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents Paper • 2502.11357 • Published Feb 17, 2025 • 11
Diversifying Joint Vision-Language Tokenization Learning Paper • 2306.03421 • Published Jun 6, 2023 • 2
A Systematic Investigation of KB-Text Embedding Alignment at Scale Paper • 2106.01586 • Published Jun 3, 2021 • 1
Bringing Back the Context: Camera Trap Species Identification as Link Prediction on Multimodal Knowledge Graphs Paper • 2401.00608 • Published Dec 31, 2023 • 2
A Retrieve-and-Read Framework for Knowledge Graph Link Prediction Paper • 2212.09724 • Published Dec 19, 2022 • 1
Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents Paper • 2502.11357 • Published Feb 17, 2025 • 11
Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents Paper • 2411.06559 • Published Nov 10, 2024 • 16
Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving Paper • 2411.07228 • Published Nov 11, 2024
Sentence Embedding Models for Ancient Greek Using Multilingual Knowledge Distillation Paper • 2308.13116 • Published Aug 24, 2023 • 3
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents Paper • 2410.05243 • Published Oct 7, 2024 • 20
A Simple Interpretable Transformer for Fine-Grained Image Classification and Analysis Paper • 2311.04157 • Published Nov 7, 2023
BIOCLIP: A Vision Foundation Model for the Tree of Life Paper • 2311.18803 • Published Nov 30, 2023 • 1