AI & ML interests

None defined yet.

Recent Activity

mucai submitted a paper 1 day ago

MuRF: Unlocking the Multi-Scale Potential of Vision Foundation Models

HanSolo9682 authored a paper 5 days ago

Unified Spatio-Temporal Token Scoring for Efficient Video VLMs

HanSolo9682 authored a paper about 1 month ago

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

View all activity

mucai

submitted a paper to Daily Papers 1 day ago

MuRF: Unlocking the Multi-Scale Potential of Vision Foundation Models

Paper • 2603.25744 • Published 1 day ago • 6

HanSolo9682

authored a paper 5 days ago

Unified Spatio-Temporal Token Scoring for Efficient Video VLMs

Paper • 2603.18004 • Published 10 days ago • 12

HanSolo9682

authored 2 papers about 1 month ago

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

Paper • 2601.10611 • Published Jan 15 • 32

Reasoning-Augmented Representations for Multimodal Retrieval

Paper • 2602.07125 • Published Feb 6

HanSolo9682

submitted a paper to Daily Papers about 2 months ago

Reasoning-Augmented Representations for Multimodal Retrieval

Paper • 2602.07125 • Published Feb 6

HanSolo9682

authored a paper over 1 year ago

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Paper • 2410.10818 • Published Oct 14, 2024 • 16

mucai

authored a paper over 1 year ago

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Paper • 2410.10818 • Published Oct 14, 2024 • 16

HanSolo9682

authored 3 papers over 1 year ago

CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples

Paper • 2402.13254 • Published Feb 20, 2024

VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation

Paper • 2407.10972 • Published Jul 15, 2024 • 1

Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos

Paper • 2410.02763 • Published Oct 3, 2024 • 7

mucai

authored 2 papers over 1 year ago

Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos

Paper • 2410.02763 • Published Oct 3, 2024 • 7

LLaRA: Supercharging Robot Learning Data for Vision-Language Policy

Paper • 2406.20095 • Published Jun 28, 2024 • 18

mucai

authored a paper almost 2 years ago

Matryoshka Multimodal Models

Paper • 2405.17430 • Published May 27, 2024 • 34

HanSolo9682

updated 6 models about 2 years ago

AI & ML interests

Recent Activity

Team members 2

CounterCurate's activity