What's the experience for using AMD GPUs for COMPUTE on GNU/Linux?

ZkhqrD5o@lemmy.world · 2 days ago

What's the experience for using AMD GPUs for COMPUTE on GNU/Linux?

balance8873@lemmy.myserv.one · 1 day ago

My computer is amd and it computes real nice

BusyBoredom@lemmy.ml · 2 days ago

I have been doing a bit of compute work on nixos with both AMD and nvidia, and I’d say it depends on what you’re doing.

If you’re doing your compute via compute shaders, you’ll have a great experience on AMD. Zero hiccups for me, I just wrote my shaders and ran them no problem. Vulkan is incredible.

If you have to interact with other people’s compute crap though, it might be a bad time. Most folks do GPU compute with cuda, and that won’t be fun for you on AMD. Yes there are translation layers, and you can make them work for some use cases, but its a bad experience. And yeah rocm exists… but does it really? Not many cards actually support rocm, and software support for it is just as sparse.

Sims@lemmy.ml · 2 days ago

I have not tested it, but Zluda are are drop-in cuda replacement for non-nvidia gpus. The speed should be great. You could check if your goto card is supported…

utopiah@lemmy.ml · 2 days ago

A friend of my is a researcher working on large scale compute (>200 GPUs) perfectly aware of ROCm and sadly he said last month “not yet”.

So I’m sure it’s not infeasible but if it’s a real use case for you (not just testing a model here and there but running frequently) you might have to consider alternatives unfortunately, or be patient.

panda_abyss@lemmy.ca · 2 days ago

Rocm is still a huge pain the ass to use with PyTorch.

I’m sure it’s fine for basic stuff, but the AI side is a mess.

I can do most models but none of the new attention architectures. Getting diffusion to work reliably is also difficult, though I do have that working.

0xf@lemmy.ml · 2 days ago

Worked for me on Ubuntu. The instructions from amd is only tailored for one distribution. I think the easiest way to use it is trough docker. I don’t want proprietary drivers conflict for my gaming.

monovergent@lemmy.ml · 2 days ago

I think ROCm is fine with more recent cards, but getting it to work on my RX 480, for which ROCm dropped official support a while ago, was a real pain.

juipeltje@lemmy.world · 2 days ago

Well the good news is that amd, from my understanding atleast, works much better for deeplearning on linux than on windows, because the rocm drivers are much better than the opencl windows drivers. Rocm still lacks behind nvidia though, as with most things when it comes to amd vs nvidia, so i’d say it depends on how important it is for you to get the better performing card. Nvidia drivers have been getting better for linux, so it should be doable to use an nvidia card. But it sucks cause i agree with you that nvidia as a company is ass lol.

anon5621@lemmy.ml · edit-2 2 days ago

Unfortunately state of rocm is sucks,currently nothing can beat nvidia cuda stack

OhNoMoreLemmy@lemmy.ml · edit-2 2 days ago

You can get a very good idea of what works by just looking for AMD GPU cloud compute.

If it was usable and cheaper everyone would be offering it. As far as I can see, it’s still experimental and only the smaller players, e.g. IBM and Oracle are pushing it.

c10l@lemmy.world · 2 days ago

I can run Ollama. I haven’t tried to do much more than that.

I run a Debian host and honestly can’t recall if I ran it directly or on Docker, but it worked and had pretty good performance on a 7900 XTX.