Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Intent matters I guess.

Did they accidentally train on that public piece of info they scraped anyway because they are scraping the whole web?

Or did they intentionally scrape chatgpt output to see if that would help?



They could have trained, then modified code, repeat, to better enhance training in the current version.

Then after, train on raw data.




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: