Did they accidentally train on that public piece of info they scraped anyway because they are scraping the whole web?
Or did they intentionally scrape chatgpt output to see if that would help?
Then after, train on raw data.
Did they accidentally train on that public piece of info they scraped anyway because they are scraping the whole web?
Or did they intentionally scrape chatgpt output to see if that would help?