• getFrog@piefed.social
    link
    fedilink
    English
    arrow-up
    23
    ·
    7 hours ago

    you literally have access to all the code in the world

    I’d like to believe that they were honorable enough to not secretly train on code without people’s permission. But realistically they totally did exactly that, but just made the AI Model this incompetent through some other engineering blunder.

    Also, random side thought - training only on public repos probably yields you way higher code quality as opposed to training on both public and private repos? I assume we all have some very messy private repos that we’re too embarrassed to publish because the code quality is absolute shit … right?

    • drath@lemmy.drath.ru
      link
      fedilink
      arrow-up
      7
      ·
      3 hours ago

      They didn’t check licenses in any way, as it did reproduce the famous quake fast inverse square root function, comments included. And quake, like majority of github projects, is published under GPL, which requires all copies and modifications to be published under GPL as well, after which all sane enterprises have banned copilot usage.

      Though, we’re not living in sane times anymore. Chatgpt, gemini, deepseek, claude, all reproduce copylefted code left and right. Realistically, Stallman should’ve been rolling in cash by now…

    • Azzu@lemmy.dbzer0.com
      link
      fedilink
      arrow-up
      11
      ·
      edit-2
      6 hours ago

      I’m always so extremely confused about the trope of the personal project having shit quality… Like, if I’m doing something for myself, that’s exactly the place where I wanna do something amazing, like literally all my private projects have much higher quality than my work ones - because in the work ones I’m forced to use stupid conventions, old tools, am not supposed to touch “legacy” code, etc etc etc

      As such, since companies have their private code on GitHub, that’s where I would expect the shittiness to come from, not personal private projects.

      • getFrog@piefed.social
        link
        fedilink
        English
        arrow-up
        8
        ·
        5 hours ago

        Like, if I’m doing something for myself, that’s exactly the place where I wanna do something amazing,

        That’s always my intention with my personal projects too! But that always results in “Wow I just learned how to do this thing much better, let me refactor the whole project to do it perfectly everywhere” followed by my Adderall running out. So there’s just so many half-done refactors I either forget about or abandon because I get a new idea the next day, but that’s totally just a skill issue.

        You’re right though, the code I write at work is much worse, but my Company hosts their own GitLab instance so the code we write can’t even be used to poison Copilot :(

      • Bluescluestoothpaste@sh.itjust.works
        link
        fedilink
        arrow-up
        6
        ·
        5 hours ago

        I would love my personal projects to be of the highest quality but unfortunately i need to pay bills so i have to prioritize my work projects that get me paid

      • vanillama@programming.dev
        link
        fedilink
        arrow-up
        3
        ·
        6 hours ago

        Maybe they meant abandoned projects that never quite got through the todo list but you’re right. Even my abandoned projects are generally better than the legacy I’ve seen lol