Google is cannibalizing the web to feed AI

Sahwa@reddthat.com · 2 months ago

Google is cannibalizing the web to feed AI

Grandwolf319@sh.itjust.works · 2 months ago

Our key finding is that by injecting information through an external synthetic data verifier, whether a human or a better model, synthetic retraining will not cause model collapse.

Yeah if you have a source of truth then your model is basically getting trained on that.

It’s like already having the answer

chunes@lemmy.world · 2 months ago

The point is that it only needs to comprise a very small part of the model.

Grandwolf319@sh.itjust.works · 2 months ago

My point was that having a verifier means your not really training a model on another model’s data, it’s basically as if you get new raw data from a non AI source