• Tetsuo@jlai.lu
    link
    fedilink
    English
    arrow-up
    8
    ·
    1 day ago

    Isn’t AI training on itself a well known thing to avoid ? If I remember correctly the “” performance “” goes to shit very quickly when you train a model on it’s own output.

    I doubt serious AI actors will make that mistake.

    But on its own, the way they just opened their torrent client and started downloading made me furious.

    In France you can still get caught downloading illegally and it can have serious consequences. But for AI businesses, copyright holders seem to look the other way. Businesses have extra rights to citizens and it’s completely unfair.

      • Tetsuo@jlai.lu
        link
        fedilink
        English
        arrow-up
        2
        ·
        1 day ago

        I read carefully this article but I think it is not about the issue I was mentioning.

        I was talking about “model collapse” and this seems rather about multiple models training on similar datasets (shared learning ressources).

        • DrakeAlbrecht@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          ·
          21 hours ago

          Yeah, I think we’re talking about the same thing. I thought the article I linked was the one I had read about model inbreeding, but now that I look at it a bit closer, it’s probably the product of model inbreeding itself. ;) I thought there was an article published this year about the problem, but now I can’t find it to save myself. It’s possible that I’m hallucinating. My memory is worse than ChatGPT’s.