

92·
1 year agoIf they scrape the updated comments again and ingest copyrighted text, you are poisoning the data.
If they scrape the updated comments again and ingest copyrighted text, you are poisoning the data.
I think you missed the part where you were strongly suggested “not” to use copyrighted text.
The point is not to get rid of the original text. It’s to “poison” the training data.
Hear, hear!