Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Thanks! Funny thing, we did not use Vision models but text only with the HTML of the current page. However, we intend to add it to boost performance


Interesting that it’s not vision based, I suspect you will get much better performance once vision is incorporated, using e.g LLaVa style models




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: