Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset Paper β’ 2403.09029 β’ Published Mar 14, 2024 β’ 56
SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios Paper β’ 2512.18470 β’ Published 11 days ago β’ 9
To Code, or Not To Code? Exploring Impact of Code in Pre-training Paper β’ 2408.10914 β’ Published Aug 20, 2024 β’ 45
OctoPack: Instruction Tuning Code Large Language Models Paper β’ 2308.07124 β’ Published Aug 14, 2023 β’ 31
π« StarCoder2 Collection StarCoder2 models and datasets! β’ 8 items β’ Updated Mar 1, 2024 β’ 89
TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar Paper β’ 2510.14972 β’ Published Oct 16 β’ 34
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding Paper β’ 2510.14943 β’ Published Oct 16 β’ 39
Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-Horizon Tasks Paper β’ 2510.08002 β’ Published Oct 9 β’ 23
LongCodeZip: Compress Long Context for Code Language Models Paper β’ 2510.00446 β’ Published Oct 1 β’ 106
SoundStorm: Efficient Parallel Audio Generation Paper β’ 2305.09636 β’ Published May 16, 2023 β’ 13