Taalas HC1: 17,000 tokens/sec on Llama 3.1 8B vs Nvidia H200’s 233 tokens/sec. 73x faster at one-tenth the power. Each chip runs ONE model, hardwired into the transistors.

  • altphoto@lemmy.today
    link
    fedilink
    arrow-up
    8
    ·
    15 hours ago

    Hopefully the low cost per kill drones get more affordable. Maybe load up Linux into one of those things and just break off the murderous knives.