Growing India News, world news, nation news, our news, people's news, grow news, entertainment, fashion, movies, tech, automobile and many more..
Monday, February 26, 2024
Show HN: Continuous-eval – Granular evaluation of GenAI pipelines https://ift.tt/j9mZ2gT
Show HN: Continuous-eval – Granular evaluation of GenAI pipelines Hi HN - we are the creators of “continuous-eval”, an open-source tool to test and evaluate generative AI apps. "Continuous-eval" came from our efforts to measure, validate and improve the reliability of a finance AI copilot we were developing for banks. End-to-end evaluation was not enough for us. We wanted to have granular evaluations that help pinpoint the bottlenecks and identify what / how to improve. We’ve since developed more metrics and made the framework more flexible so it can evaluate components like agent tool use, code change, retrieval steps, etc. Let us know what you think of our approach to GenAI App evaluation. https://ift.tt/iHl2P8I February 26, 2024 at 12:11AM
Subscribe to:
Post Comments (Atom)
Show HN: The Cascade Graph – An interactive map of AI and energy constraints https://ift.tt/O47EcUS
Show HN: The Cascade Graph – An interactive map of AI and energy constraints Hello, I wanted to share with you all a interactive map of the ...
-
Show HN: An AI logo generator that can also generate SVG logos Hey everyone, I've spent the past 2 weeks building an AI logo generator, ...
-
Show HN: Simple Gantt Chart Software https://ift.tt/sa3dQKF May 7, 2022 at 12:39PM
-
Show HN: Jsonnet Course Online Hi HN! I'm usually a lurker here, but I wanted to share this: I'm an enthusiastic user of Jsonnet[1] ...
No comments:
Post a Comment