Found 6 items with this tag.
Weave is a lightweight toolkit designed to track and evaluate LLM applications, enhancing…
Galileo is a platform designed for building AI applications, focusing on reducing halluci…
Kolena is an AI evaluation platform that automates the assessment of large language model…
Encord is a leading data development platform that streamlines data management and enhanc…
Patronus AI is an innovative automated evaluation platform that helps enterprises identif…
Flow AI offers advanced tools for evaluating and merging language models, enhancing the d…