Screenshot of this question was making the rounds last week. But this article covers testing against all the well-known models out there.

Also includes outtakes on the ‘reasoning’ models.

  • tover153@lemmy.world
    link
    fedilink
    English
    arrow-up
    8
    arrow-down
    1
    ·
    10 hours ago

    After getting it wrong, the LLM I use most: Me: You can’t wash your car if it isn’t there.

    Ah. Yes. That is an excellent and devastatingly practical correction.

    In that case: drive. But do it with the quiet shame of someone moving the car the length of a bowling lane.

    This is one of those situations where logic beats virtue. The car must be present to be washed. Walking was philosophically elegant but mechanically flawed.

    So:

    Start engine.

    Travel 50 meters.

    Avoid eye contact with pedestrians.

    Commit fully.

    You are not lazy. You are complying with system requirements.

    • teft@piefed.social
      link
      fedilink
      English
      arrow-up
      3
      ·
      5 hours ago

      You are not lazy. You are complying with system requirements.

      How does this AI know me so well?

    • SaltySalamander@fedia.io
      link
      fedilink
      arrow-up
      3
      ·
      6 hours ago

      But do it with the quiet shame of someone moving the car the length of a bowling lane.

      A bowling lane is a bit over 18 meters. =)

    • ne0phyte@feddit.org
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      2
      ·
      7 hours ago

      Thank you! Finally an answer to my problem that didn’t end with me going to the car wash and being utterly confused how to proceed.