My Lemmy Oracle
  • Communities
  • Create Post
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
corbin@infosec.pub to Technology@lemmy.worldEnglish · 2 days ago

Wikipedia has banned AI-generated text, with two exceptions

www.howtogeek.com

external-link
message-square
88
fedilink
624
external-link

Wikipedia has banned AI-generated text, with two exceptions

www.howtogeek.com

corbin@infosec.pub to Technology@lemmy.worldEnglish · 2 days ago
message-square
88
fedilink
Begone, AI slop.
  • errer@lemmy.world
    link
    fedilink
    English
    arrow-up
    10
    arrow-down
    8
    ·
    2 days ago

    Wikipedia probably wants to sell access to LLMs to train. It’s only valuable if Wikipedia remains a high-quality, slop-free source.

    I think even AI zealots think there should be silos of content to train from that are fully human generated. Training slop on slop makes the slop even worse.

    • Grimy@lemmy.world
      link
      fedilink
      English
      arrow-up
      18
      ·
      2 days ago

      Sell licenses of what? It’s already all in the creative commons iirc.

      • Zagorath@quokk.au
        link
        fedilink
        English
        arrow-up
        4
        arrow-down
        2
        ·
        1 day ago

        The content is CC licensed, but they are trying to block AI scraping because it overloads their servers. They have a paid API that uses a lot less compute for both Wikipedia and the AI, as well as being a revenue source for Wikipedia.

        • ricecake@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          1
          ·
          16 hours ago

          Yes, but…

          https://en.wikipedia.org/wiki/Wikipedia%3ADatabase_download

          That’s because viewing the page uses server resources, as done API access. If you want the data you can download the database directly.

    • SuspciousCarrot78@lemmy.world
      link
      fedilink
      English
      arrow-up
      12
      ·
      2 days ago

      AI already trains on Wikipedia.

      https://commoncrawl.org/

    • MountingSuspicion@reddthat.com
      link
      fedilink
      English
      arrow-up
      8
      ·
      2 days ago

      This was only done because the editors pushed to minimize AI involvement. There’s a comment here already mentioning that: https://lemmy.world/comment/22826863

Technology@lemmy.world

technology@lemmy.world

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


  • @[email protected]
  • @[email protected]
  • @[email protected]
  • @[email protected]
Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 3.48K users / day
  • 9.39K users / week
  • 16.2K users / month
  • 30.8K users / 6 months
  • 1 local subscriber
  • 83K subscribers
  • 19.8K Posts
  • 830K Comments
  • Modlog
  • mods:
  • L3s@lemmy.world
  • enu@lemmy.world
  • Technopagan@lemmy.world
  • L4sBot@lemmy.world
  • L3s@hackingne.ws
  • BE: 0.19.5
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org