Growing India News, world news, nation news, our news, people's news, grow news, entertainment, fashion, movies, tech, automobile and many more..
Thursday, April 18, 2024
Show HN: Trained Tiny Tales GPT(30M model)from scratch and deployed in $15 https://ift.tt/dcjRweK
Show HN: Trained Tiny Tales GPT(30M model)from scratch and deployed in $15 For the last few weeks, I have been working on training an LLM from scratch and deploying it in production on Google Cloud Platform. Finally, I trained a 30 million parameter model on 1 billion tokens and deployed it as a web service. You can access the LLM using this site - https://ift.tt/0aBXfI3 The following steps were taken to build Tiny Tales GPT 1. Downloaded and preprocessed 8GB of dataset using multiprocessing library. 2. Tokenized the data using byte pair encoding to create 1 billion tokens sharded in different bin files. 3. Defined a training setup and trained the model on a small version of the LLaMA model architecture with 30 million parameters. 4. The training was done using Distributed Data-Parallel on two A-100 GPUs provided by JarvisLabs.ai (they are most cost-optimized) 5. After the training is done, an inference script is created to predict the tokens from the trained model given the input context vector. 6. Developed REST-based API service using Flask framework to interact with the inference service to the end user. 7. Finally used GCP's virtual machines, instance groups, load balancers, and DNS services to deploy the service on the internet. https://ift.tt/RfqQg8z April 18, 2024 at 02:25AM
Subscribe to:
Post Comments (Atom)
Show HN: Pocket2Linkding – Migrate from Mozilla Pocket to Linkding https://ift.tt/IwYJfju
Show HN: Pocket2Linkding – Migrate from Mozilla Pocket to Linkding With the Mozilla Pocket shutdown coming up in about two weeks, I thought ...
-
Show HN: An AI logo generator that can also generate SVG logos Hey everyone, I've spent the past 2 weeks building an AI logo generator, ...
-
Show HN: Snap Scope – Visualize Lens Focal Length Distribution from EXIF Data https://ift.tt/yrqHZtDShow HN: Snap Scope – Visualize Lens Focal Length Distribution from EXIF Data Hey HN, I built this tool because I wanted to understand which...
-
Show HN: Federated IndieAuth Server implemented as a notebook https://ift.tt/32IC633 April 27, 2021 at 04:37PM
No comments:
Post a Comment