Libor Burian
BurnyCoder
AI & ML interests
deep learning, LLMs, science, physics
Recent Activity
liked
a model
about 1 hour ago
allenai/OLMo-7B-0724-hf
updated
a model
about 13 hours ago
BurnyCoder/Qwen2.5-0.5B-Capybara
upvoted
a
paper
about 23 hours ago
Back to Basics: Revisiting REINFORCE Style Optimization for Learning
from Human Feedback in LLMs