• Ignotum@lemmy.world
      link
      fedilink
      arrow-up
      4
      ·
      2 months ago

      70b model taking 1.5GB? So 0.02 bit per parameter?

      Are you sure you’re not thinking of a heavily quantised and compressed 7b model or something? Ollama llama3 70b is 40GB from what i can find, that’s a lot of DVDs