• Grandwolf319@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    7
    ·
    3 hours ago

    Our key finding is that by injecting information through an external synthetic data verifier, whether a human or a better model, synthetic retraining will not cause model collapse.

    Yeah if you have a source of truth then your model is basically getting trained on that.

    It’s like already having the answer

    • chunes@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      arrow-down
      1
      ·
      3 hours ago

      The point is that it only needs to comprise a very small part of the model.

      • Grandwolf319@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        2
        ·
        1 hour ago

        My point was that having a verifier means your not really training a model on another model’s data, it’s basically as if you get new raw data from a non AI source