Maybe a good idea to be more explicit about this -- maybe a cost analysis benchm...

tedsanders · 2026-04-23T20:09:46 1776974986

Agreed. Would be great if everyone starts reporting cost per task alongside eval scores, especially in a world where you can spend arbitrary test-time compute. This is one thing I like about the Artificial Analysis website - they include cost to run alongside their eval scores: https://artificialanalysis.ai/

dannyw · 2026-04-24T02:46:10 1776998770

Their subscription subscribers will see/feel the difference irregardless, API pricing is hopefully read by devs that know about token efficiency and effort.