Wednesday, August 9, 2023

Show HN: Ggml.js – Serverless AI Inference on Browser with Web Assembly https://ift.tt/fCirBUv

Show HN: Ggml.js – Serverless AI Inference on Browser with Web Assembly What is ggml.js? Run any ggml ported ML models directly on your web browser with ggml.js This project provides JavaScript bindings on ggml models, so that you can embed GGML models to your web apps to build serverless AI inferencing. Currently the framework supports following model types: - Dolly v2 - GPT2 - GPT J - GPT NEO X - MPT - Replit - StarCoder You can find more details about the framework: https://ift.tt/Gnj734Q For live demo examples, you can visit here: https://ift.tt/y76g2SV Source Code: https://ift.tt/m6L8djl Please do provide any feedback/questions/suggestions in the comments. I'll be happy to take it up :) https://ift.tt/Gnj734Q August 9, 2023 at 12:50AM

No comments:

Post a Comment

Show HN: Yet Another Memory System for LLM's https://ift.tt/0oZIwAv

Show HN: Yet Another Memory System for LLM's Built this for my LLM workflows - needed searchable, persistent memory that wouldn't bl...