• De Lancre@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    4 hours ago

    You don’t need 170+ GB of VRAM. Whole model can be run at around 1 token/second on a modern hardware from an ssd. Which is slow, don’t get me wrong, but it still somewhat useable.