Thanks! Funny thing, we did not use Vision models but text only with the HTML of... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		DanyWin on March 14, 2024 \| parent \| context \| favorite \| on: LaVague: Open-source Large Action Model to automat... Thanks! Funny thing, we did not use Vision models but text only with the HTML of the current page. However, we intend to add it to boost performance

jerpint on March 14, 2024 [–]

Interesting that it’s not vision based, I suspect you will get much better performance once vision is incorporated, using e.g LLaVa style models

Consider applying for YC's Summer 2026 batch! Applications are open till May 4
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact