you literally have access to all the code in the world
I’d like to believe that they were honorable enough to not secretly train on code without people’s permission. But realistically they totally did exactly that, but just made the AI Model this incompetent through some other engineering blunder.
Also, random side thought - training only on public repos probably yields you way higher code quality as opposed to training on both public and private repos? I assume we all have some very messy private repos that we’re too embarrassed to publish because the code quality is absolute shit … right?
They didn’t check licenses in any way, as it did reproduce the famous quake fast inverse square root function, comments included. And quake, like majority of github projects, is published under GPL, which requires all copies and modifications to be published under GPL as well, after which all sane enterprises have banned copilot usage.
Though, we’re not living in sane times anymore. Chatgpt, gemini, deepseek, claude, all reproduce copylefted code left and right. Realistically, Stallman should’ve been rolling in cash by now…
I’m always so extremely confused about the trope of the personal project having shit quality… Like, if I’m doing something for myself, that’s exactly the place where I wanna do something amazing, like literally all my private projects have much higher quality than my work ones - because in the work ones I’m forced to use stupid conventions, old tools, am not supposed to touch “legacy” code, etc etc etc
As such, since companies have their private code on GitHub, that’s where I would expect the shittiness to come from, not personal private projects.
Like, if I’m doing something for myself, that’s exactly the place where I wanna do something amazing,
That’s always my intention with my personal projects too! But that always results in “Wow I just learned how to do this thing much better, let me refactor the whole project to do it perfectly everywhere” followed by my Adderall running out. So there’s just so many half-done refactors I either forget about or abandon because I get a new idea the next day, but that’s totally just a skill issue.
You’re right though, the code I write at work is much worse, but my Company hosts their own GitLab instance so the code we write can’t even be used to poison Copilot :(
I would love my personal projects to be of the highest quality but unfortunately i need to pay bills so i have to prioritize my work projects that get me paid
Maybe they meant abandoned projects that never quite got through the todo list but you’re right. Even my abandoned projects are generally better than the legacy I’ve seen lol
I’d like to believe that they were honorable enough to not secretly train on code without people’s permission. But realistically they totally did exactly that, but just made the AI Model this incompetent through some other engineering blunder.
Also, random side thought - training only on public repos probably yields you way higher code quality as opposed to training on both public and private repos? I assume we all have some very messy private repos that we’re too embarrassed to publish because the code quality is absolute shit … right?
Lol. Lmao
They didn’t check licenses in any way, as it did reproduce the famous quake fast inverse square root function, comments included. And quake, like majority of github projects, is published under GPL, which requires all copies and modifications to be published under GPL as well, after which all sane enterprises have banned copilot usage.
Though, we’re not living in sane times anymore. Chatgpt, gemini, deepseek, claude, all reproduce copylefted code left and right. Realistically, Stallman should’ve been rolling in cash by now…
I’m always so extremely confused about the trope of the personal project having shit quality… Like, if I’m doing something for myself, that’s exactly the place where I wanna do something amazing, like literally all my private projects have much higher quality than my work ones - because in the work ones I’m forced to use stupid conventions, old tools, am not supposed to touch “legacy” code, etc etc etc
As such, since companies have their private code on GitHub, that’s where I would expect the shittiness to come from, not personal private projects.
That’s always my intention with my personal projects too! But that always results in “Wow I just learned how to do this thing much better, let me refactor the whole project to do it perfectly everywhere” followed by my Adderall running out. So there’s just so many half-done refactors I either forget about or abandon because I get a new idea the next day, but that’s totally just a skill issue.
You’re right though, the code I write at work is much worse, but my Company hosts their own GitLab instance so the code we write can’t even be used to poison Copilot :(
I would love my personal projects to be of the highest quality but unfortunately i need to pay bills so i have to prioritize my work projects that get me paid
Maybe they meant abandoned projects that never quite got through the todo list but you’re right. Even my abandoned projects are generally better than the legacy I’ve seen lol