Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

That M4 Max is really something else, I get also 70 tokens/second on eval on a RTX 4000 SFF Ada server GPU.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: