Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models.
Plus, in this week’s Installer: a new Mario Tennis, Sony’s great new buds, a wild time-travel movie, and much more.
一些您可能无法访问的结果已被隐去。
显示无法访问的结果