Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models.
Researchers at the University of Electro-Communications in Japan have shown that agents built on large language models (LLMs) can develop distinct personalities when they interact freely in a virtual ...