My Lemmy Oracle
  • Communities
  • Create Post
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
☆ Yσɠƚԋσʂ ☆@lemmygrad.ml to Technology@lemmygrad.mlEnglish · 29 days ago

Granite 4.1: IBM's 8B Model Is Competing With Models Four Times Its Size - Firethering

firethering.com

external-link
message-square
10
fedilink
  • cross-posted to:
  • [email protected]
13
external-link

Granite 4.1: IBM's 8B Model Is Competing With Models Four Times Its Size - Firethering

firethering.com

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml to Technology@lemmygrad.mlEnglish · 29 days ago
message-square
10
fedilink
  • cross-posted to:
  • [email protected]
IBM just released Granite 4.1, a family of open source language models built specifically for enterprise use. Three sizes, Apache 2.0 licensed and trained on 15 trillion tokens with a level of pipeline obsession that's worth understanding. But there's one result in the benchmarks I keep coming back to. The 8B model. Dense architecture, no MoE tricks, no extended reasoning chains. It matches or beats Granite 4.0-H-Small across basically every benchmark they ran. That older model has 32B parameters with 9B active. This one has 8 billion. Full stop. That result is either very impressive or it means the old model was underbuilt. Probably both. Here's how they built it, what the numbers actually say, and whether any of it matters for your use case.
  • ☆ Yσɠƚԋσʂ ☆@lemmygrad.mlOP
    link
    fedilink
    arrow-up
    4
    ·
    29 days ago

    It’s honestly incredible to see because 8b is getting to the point where it will run well on a lot of consumer hardware. If we can get current frontier performance at that size, then you really would be able to solve most tasks locally.

    • CriticalResist8@lemmygrad.ml
      link
      fedilink
      arrow-up
      5
      ·
      28 days ago

      The 4-bit quantized GGUF for granite 4.1 is sub 5GB, so it’s probably going to run on any modern machine even if it’s not particularly built for Vram… 6 gigs is what I had on my old 1080 gpu.

      https://huggingface.co/unsloth/granite-4.1-8b-GGUF/tree/main

      • ☆ Yσɠƚԋσʂ ☆@lemmygrad.mlOP
        link
        fedilink
        arrow-up
        4
        ·
        28 days ago

        🎉

Technology@lemmygrad.ml

technology@lemmygrad.ml

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

A tech news sub for communists

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 81 users / day
  • 185 users / week
  • 325 users / month
  • 776 users / 6 months
  • 1 local subscriber
  • 1.43K subscribers
  • 1.72K Posts
  • 4.54K Comments
  • Modlog
  • mods:
  • Muad'Dibber@lemmygrad.ml
  • burlemarx@lemmygrad.ml
  • egs81t@lemmygrad.ml
  • BE: 0.19.5
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org