• neidu2@feddit.nl
    link
    fedilink
    arrow-up
    24
    ·
    edit-2
    4 months ago

    Technically possible with a small enough model to work from. It’s going to be pretty shit, but “working”.

    Now, if we were to go further down in scale, I’m curious how/if a 700MB CD version would work.

    Or how many 1.44MB floppies you would need for the actual program and smallest viable model.

      • Ignotum@lemmy.world
        link
        fedilink
        arrow-up
        4
        ·
        4 months ago

        70b model taking 1.5GB? So 0.02 bit per parameter?

        Are you sure you’re not thinking of a heavily quantised and compressed 7b model or something? Ollama llama3 70b is 40GB from what i can find, that’s a lot of DVDs