Bozhou Li's picture

Bozhou Li

zooblastlbz

·

AI & ML interests

None yet

Recent Activity

authored a paper 6 days ago

Are Bigger Encoders Always Better in Vision Large Models?

authored a paper 6 days ago

The First Prompt Counts the Most! An Evaluation of Large Language Models on Iterative Example-based Code Generation

authored a paper 6 days ago

GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models

View all activity

Organizations

authored 9 papers 6 days ago

Are Bigger Encoders Always Better in Vision Large Models?

Paper • 2408.00620 • Published Aug 1, 2024

The First Prompt Counts the Most! An Evaluation of Large Language Models on Iterative Example-based Code Generation

Paper • 2411.06774 • Published Nov 11, 2024

GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models

Paper • 2512.15560 • Published Dec 17, 2025 • 25

The Unseen Bias: How Norm Discrepancy in Pre-Norm MLLMs Leads to Visual Information Loss

Paper • 2512.08374 • Published Dec 9, 2025

DiaDem: Advancing Dialogue Descriptions in Audiovisual Video Captioning for Multimodal Large Language Models

Paper • 2601.19267 • Published Jan 27

Research on World Models Is Not Merely Injecting World Knowledge into Specific Tasks

Paper • 2602.01630 • Published Feb 2 • 50

OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models

Paper • 2602.04804 • Published Feb 4 • 50

Semantic Routing: Exploring Multi-Layer LLM Feature Weighting for Diffusion Transformers

Paper • 2602.03510 • Published Feb 3 • 27

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Paper • 2604.04707 • Published 8 days ago • 200

upvoted 2 papers 7 days ago

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

Paper • 2604.04771 • Published 8 days ago • 115

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Paper • 2604.04707 • Published 8 days ago • 200

upvoted a paper 11 days ago

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Paper • 2603.26164 • Published 18 days ago • 350

upvoted a paper about 1 month ago

Kling-MotionControl Technical Report

Paper • 2603.03160 • Published Mar 3 • 26

upvoted a paper about 2 months ago

GENIUS: Generative Fluid Intelligence Evaluation Suite

Paper • 2602.11144 • Published Feb 11 • 55

upvoted 2 papers 2 months ago

OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models

Paper • 2602.04804 • Published Feb 4 • 50

Semantic Routing: Exploring Multi-Layer LLM Feature Weighting for Diffusion Transformers

Paper • 2602.03510 • Published Feb 3 • 27

submitted a paper to Daily Papers 2 months ago

Semantic Routing: Exploring Multi-Layer LLM Feature Weighting for Diffusion Transformers

Paper • 2602.03510 • Published Feb 3 • 27

upvoted a paper 2 months ago

Research on World Models Is Not Merely Injecting World Knowledge into Specific Tasks

Paper • 2602.01630 • Published Feb 2 • 50

upvoted 2 papers 3 months ago

CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation

Paper • 2601.10061 • Published Jan 15 • 32

GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models

Paper • 2512.15560 • Published Dec 17, 2025 • 25