Growing India News, world news, nation news, our news, people's news, grow news, entertainment, fashion, movies, tech, automobile and many more..
Monday, June 29, 2026
Show HN: NanoEuler – GPT-2 scale model in pure C/CUDA from scratch https://ift.tt/2KW9P3X
Show HN: NanoEuler – GPT-2 scale model in pure C/CUDA from scratch Hi everyone, I started working on nanoeuler after the ban of anthropic's fable because my ambition and dream is to work in the AI field in anthropic. The two interesting reasons that led me to create nanoeuler were (1) interfacing with llm does not mean understanding how they are composed and (2), working on llm with a very low-level layer to understand the correlation between parameters and data and growth of the model and how the GPU works and how some layers can be optimized. So I started working on it with a research aspect by making nanoeuler grow more and more but doing one step after another starting from Shakespeare.txt and understanding what a text generation model understands at 23 million parameters. For example, nanoeuler at that number had understood that Name: started a line and wrote that line with sense. I wrote everything in CUDA because I wanted to not use any intermediary between the model in training and inference and what it had to do. Then the use of SFT and much more, even if in small ways, were really useful to understand the various step to make an llm like a chatbot.Any feedback, help, or suggestions are absolutely welcome! https://ift.tt/gQBjXKm June 29, 2026 at 01:08AM
Subscribe to:
Post Comments (Atom)
Show HN: Use-zerostack – delegate any task to a lightweight coding agent https://ift.tt/GfmJEcv
Show HN: Use-zerostack – delegate any task to a lightweight coding agent https://ift.tt/TueFbAJ June 29, 2026 at 01:03AM
-
Show HN: An AI logo generator that can also generate SVG logos Hey everyone, I've spent the past 2 weeks building an AI logo generator, ...
-
Show HN: Simple Gantt Chart Software https://ift.tt/sa3dQKF May 7, 2022 at 12:39PM
-
Show HN: Jsonnet Course Online Hi HN! I'm usually a lurker here, but I wanted to share this: I'm an enthusiastic user of Jsonnet[1] ...
No comments:
Post a Comment