Growing India News, world news, nation news, our news, people's news, grow news, entertainment, fashion, movies, tech, automobile and many more..
Wednesday, May 7, 2025
Show HN: Kevin-32B – how to do multi-turn RL on writing CUDA kernels https://ift.tt/kXTPjpD
Show HN: Kevin-32B – how to do multi-turn RL on writing CUDA kernels Hey – we just published a blog post about Kevin-32B = K(ernel D)evin. It's to our knowledge the first open-source model that's RL-trained on CUDA kernels. Our goal was to demonstrate multi-turn RL using GRPO. We used 180 Python->CUDA conversion tasks from the KernelBench dataset. The results were surprisingly strong! We were able to outperform top reasoning model like o3 & o4-mini. We're sharing our training setup and learnings in the blogpost. Also the model is on HuggingFace: https://ift.tt/aqge680 https://ift.tt/8m2LWQ0 May 7, 2025 at 01:18AM
Subscribe to:
Post Comments (Atom)
Show HN: Yet Another Memory System for LLM's https://ift.tt/0oZIwAv
Show HN: Yet Another Memory System for LLM's Built this for my LLM workflows - needed searchable, persistent memory that wouldn't bl...
-
Show HN: An AI logo generator that can also generate SVG logos Hey everyone, I've spent the past 2 weeks building an AI logo generator, ...
-
Breaking #FoxNews Alert : Number of dead rises after devastating tornadoes, Kentucky governor announces — R Karthickeyan (@RKarthickeyan1)...
-
Show HN: Snap Scope – Visualize Lens Focal Length Distribution from EXIF Data https://ift.tt/yrqHZtDShow HN: Snap Scope – Visualize Lens Focal Length Distribution from EXIF Data Hey HN, I built this tool because I wanted to understand which...
No comments:
Post a Comment