arxiv:2605.20552
Michal Valko
AI & ML interests
large language models, reasoning, fine-tuning, test-time computation, reinforcement learning with human feedback, world models
Recent Activity
liked a dataset about 4 hours ago
ulamai/verified-research-reasoning-trajectories authored a paper 28 days ago
Spectral bandits for smooth graph functions with applications in recommender systems updated a dataset 28 days ago
misovalko/my-research-papers