358 504

Yu li

Yukkkop

AI & ML interests

None yet

Recent Activity

liked a model 5 days ago

datalab-to/chandra-ocr-2

liked a model 6 days ago

Infinigence/Megrez2-3x7B-A3B

liked a dataset 6 days ago

thepowerfuldeez/massive-yt-edu-transcriptions

View all activity

Organizations

None yet

liked a model 5 days ago

datalab-to/chandra-ocr-2

Image-Text-to-Text • 5B • Updated Mar 18 • 204k • 287

liked a model 6 days ago

Infinigence/Megrez2-3x7B-A3B

Text Generation • 7B • Updated Sep 18, 2025 • 26 • 18

liked a dataset 6 days ago

thepowerfuldeez/massive-yt-edu-transcriptions

Preview • Updated Feb 21 • 60 • 3

liked a model 6 days ago

thepowerfuldeez/imu1_base

Text Generation • 0.5B • Updated Feb 4 • 22 • 3

upvoted 15 papers 6 days ago

DySCO: Dynamic Attention-Scaling Decoding for Long-Context Language Models

Paper • 2602.22175 • Published 11 days ago • 1

Attention Editing: A Versatile Framework for Cross-Architecture Attention Conversion

Paper • 2604.05688 • Published 20 days ago • 1

NuMuon: Nuclear-Norm-Constrained Muon for Compressible LLM Training

Paper • 2603.03597 • Published Mar 4 • 1

Canzona: A Unified, Asynchronous, and Load-Balanced Framework for Distributed Matrix-based Optimizers

Paper • 2602.06079 • Published Feb 4 • 20

RaBiT: Residual-Aware Binarization Training for Accurate and Efficient LLMs

Paper • 2602.05367 • Published Feb 5 • 8

ReMiT: RL-Guided Mid-Training for Iterative LLM Evolution

Paper • 2602.03075 • Published Feb 3 • 7

Judging What We Cannot Solve: A Consequence-Based Approach for Oracle-Free Evaluation of Research-Level Math

Paper • 2602.06291 • Published Feb 6 • 24

Experiential Reinforcement Learning

Paper • 2602.13949 • Published Feb 15 • 74

Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization

Paper • 2506.13331 • Published Jun 16, 2025 • 2

The Dual-Stream Transformer: Channelized Architecture for Interpretable Language Modeling

Paper • 2603.07461 • Published Mar 8 • 2

H2LooP Spark Preview: Continual Pretraining of Large Language Models for Low-Level Embedded Systems Code

Paper • 2603.11139 • Published Mar 13 • 1

Raising Bars, Not Parameters: LilMoo Compact Language Model for Hindi

Paper • 2603.03508 • Published Mar 3 • 4

liked a Space 6 days ago

ArXiv Daily Papers

📚

Browse daily arXiv paper summaries with filters

Yu li

AI & ML interests

Recent Activity

Organizations

Yukkkop's activity

ArXiv Daily Papers