Running Llama 70B as an on-demand cloud inference endpoint costs roughly $16,000 per month. Running Llama 8B costs about $734. For teams where an 8B model meets the quality bar for their workload, that gap is very hard to ignore.The question enterprise teams are asking is rarely, "how do we get the most powerful model?" It is almost always, "how do we get a model that's fast enough, accurate enough, and affordable enough to run reliably in our environment?" Those are different questions, and they often lead to different answers, pointing toward smaller models more often than teams expect.The c
5 reasons to go with your team to Red Hat Summit 2026Red Hat Summit is where the global community comes together to solve the industry's biggest challenges, and there is no better way to navigate that future than with your team by your side. Register today to join us in Atlanta, May 11-14. Learn more Red Hat Further Drives Digital Sovereignty for the AI Era with Red Hat OpenShift on Google Cloud DedicatedWe’re bringing Red Hat OpenShift to Google Cloud Dedicated, providing a sovereign-ready foundation for the AI era. This collaboration empowers organizations in highly regulated industries to
Disruption in the virtualization market has not slowed down. The fallout from industry licensing and packaging changes continues to push organizations into decisions they were not planning to make this year, and for many, the timelines are getting shorter, not longer. Over the past 12 months, we have worked with hundreds of organizations navigating exactly this situation, and at Red Hat Summit 2026 (May 11–14, Atlanta), many of them will share what they have learned.The early conversations were almost entirely about migration: how to move virtual machines (VMs) safely, how to avoid downtime,
Looking at the release notes or changelogs for QEMU upstream, you might notice that there's something new in version 11.0:SEV-SNP and TDX machines can now be reset.This is a feature we at Red Hat helped implement. The motivations and associated challenges have been explained in detail in a FOSDEM 2026 presentation. Before this feature was available, some confidential guests (AMD SEV-based guests) could be reset normally like other non-confidential guests. Other confidential guests (like TDX, SEV-ES and SEV-SNP guests) would terminate if a reset was attempted (for example, when you initiate a r
Extending confidential computing from individual workloads to the entire cluster is a new frontier in cloud-native security.Today, Red Hat is announcing the Developer Preview of confidential clusters for Red Hat OpenShift, a new feature of OpenShift that extends confidential computing to the cluster infrastructure level. Confidential clusters establish hardware-rooted trust across every node in an OpenShift cluster, creating a fully attested, encrypted, and verifiable execution environment from the ground up.This Developer Preview is available today for OpenShift on Microsoft Azure, powered by
Last month Opera released the Opera GX gaming-focused web browser for Linux. It rolled out in RPM and Debian package format support while now for those interested is also available via Flatpak and Snap sandboxed app formats...
In addition to some network drivers on the chopping block due to AI bug reports for obsolete hardware/drivers and Linux 7.1 dropping various drivers for Russia's Baikal CPUs, the Linux 7.1 kernel as of today also dropped some obsolete PCMCIA host controller drivers...
Oracle announced today they are going to be reducing the frequency of software updates for Solaris 11.4 and their ZFS Storage Appliance software...
Pages