[Rumor] Shipping Listing Suggests 24GB+ Intel Arc B580

brucethemoose@lemmy.world · 1 day ago

Once something gets critical mass and becomes “default,” it doesn’t even matter, people just use it and take it.

brucethemoose@lemmy.world · edit-2 1 day ago

But I am grateful for independent journalism, which is now my main hope for the future.

Well guess who’s in control of eyeballs on those journalists?

Social media companies, who have clear incentives to deprioritize such content and have repeatedly shown they do.

Let’s reclaim music from the technocrats. They have not proven themselves worthy of our trust.

While I agree with the article, I have issue with this line. These are not technocrats, they are “leaders” willing to make companies and their products objectively worse in the name of short term profits. These aren’t ‘technical experts put in charge,’ they are greedy, spineless pigs.

brucethemoose@lemmy.world · edit-2 9 days ago

They’re kinda already there :(. Maybe even worse than raspberry pies.

Intel has all but said they’re exiting the training/research market.

AMD has great hardware, but the MI300X is not gaining traction due to a lack of “grassroots” software support, and they were too stupid to undercut Nvidia and sell high vram 7900s to devs, or to even prioritize its support in rocm. Same with their APUs. For all the marketing, they just didn’t prioritize getting them runnable with LLM software

brucethemoose@lemmy.world · edit-2 9 days ago

B580 24GB and B770 32GB

They would be incredible, as long as they’re cheap. Intel would be utterly stupid for not doing this.

With OpenAPI being backed by so many big names, do you think they will be able to upset CUDA in the future or has Nvidia just become too entrenched?

OpenAI does not make hardware. Also, their model progress has stagnated, already matched or surpassed by Google, Claude, and even some open source chinese models trained on far fewer GPUs… OpenAI is circling the drain, forget them.

The only “real” competitor to Nvidia, IMO, is Cerebras, which has a decent shot due to a silicon strategy Nvidia simply does not have.

The AMD MI300X is actually already “better” than Nvidia’s best… but they just can’t stop shooting themselves in the foot, as AMD does. Google TPUs are good, but google only, like Amazon’s hardware. I am not impressed with Groq or Tenstorrent.

brucethemoose@lemmy.world · edit-2 9 days ago

Its complicated.

So there’s Intel’s own project/library, which is the fastest way to run LLMs on their IGPs and GPUs. But also the hardest to set up, and the least feature packed.

There’s more than one Intel compatible llama.cpp ‘backend,’ including the Intel-contribed SYCL one, another PR for the AMX support on CPUs, I think another one branded as ipex-llm, and the vulkan backend that the main llama.cpp devs seem to be focusing on now. The problem is each of these backends have their own bugs, incomplete features, installation quirks, and things they don’t support, while AMD’s rocm kinda “just works” because it inherits almost everything from the CUDA backend.

It’s a hot mess.

Hardcore LLM enthusiasts largely can’t keep up, much less the average person just trying to self-host a model.

OneAPI is basically a nothingburger so far. You can run many popular CUDA libraries on AMD through rocm, right now, but that is not the case with Intel, and no devs are interested in changing that because Intel isn’t selling any “3090 class” GPU hardware worth buying.

brucethemoose@lemmy.world · edit-2 9 days ago

In practice, almost no one with A770s uses ipex-llm simply because its not as vram efficient as llama.cpp, isn’t as feature rich, and the PyTorch setup is nightmarish.

Intel is indeed making many contributions to the open source LLM space, but it feels… shotgunish? Not unified at all. AMD, on the other hand, is more focused but woefully understaffed, and Nvidia is laser focused on the enterprise space.

brucethemoose@lemmy.world · edit-2 9 days ago

[Rumor] Shipping Listing Suggests 24GB+ Intel Arc B580

brucethemoose@lemmy.world · edit-2 23 days ago

I mean, if you even have to go into the bios or dip into the mechanics of drive letters and formatting, you have already lost most people.

brucethemoose@lemmy.world · 23 days ago

Again, this is awesome but probably not the main factor, as most people dont know how to install an OS.

brucethemoose@lemmy.world · 23 days ago

Ah… Yeah, I’d wager the bulk is going to phones and tablets, and that should be extremely telling for anyone at Microsoft trying to enshittify 11.

brucethemoose@lemmy.world · 23 days ago

What… How is that even possible? Are new machines being sold with 10 instead of 11?

brucethemoose@lemmy.world · 24 days ago

Mmm, I hope the fab survives on their own…

brucethemoose@lemmy.world · edit-2 24 days ago

So what IS their strategy now?

Some of Pat’s initiatives were good (stay the course with Xe and fabs, which take a long time to pan out), but they kept delaying everything!

Yet Intel is kind of screwed without good graphics or ML IP.

If they spin off the fabs, I feel like they are really screwed, as they will be left with nothing but shrinking businesses and no multi year efforts to get out of it.

Like… Even theoretically, I dont know what I would do to right Intel as CEO unless they can fix whatever is causing consistent delays, and clearly thats not happening. What is their path?

brucethemoose@lemmy.world · edit-2 2 months ago

As a fervent AI enthusiast, I disagree.

…I’d say it’s 97% hype and marketing.

It’s crazy how much fud is flying around, and legitimately buries good open research. It’s also crazy what these giant corporations are explicitly saying what they’re going to do, and that anyone buys it. TSMC’s allegedly calling Sam Altman a ‘podcast bro’ is spot on, and I’d add “manipulative vampire” to that.

Talk to any long-time resident of localllama and similar “local” AI communities who actually dig into this stuff, and you’ll find immense skepticism, not the crypto-like AI bros like you find on linkedin, twitter and such and blot everything out.