Unlocking the Effective Context Length: Benchmarking the Granite-3.1-8b Model
Large language models (LLMs) are evolving rapidly enabling applications such as chatbots, code generation, and knowledge extraction. One crucial factor influencing their effectiveness is the context length - the number of tokens a model can look at once. While theoretical context lengths continue to grow, the practical, effective context length (ECL) determines real-world usability.In this blog post, we will explore the ECL of the Granite-3.1-8b instruct model and validate its capabilities across various tasks. The study takes its inspiration from the paper "Measuring Effective Context Length
Introducing RHEL AI 1.4: Powering the Next Wave of Generative AI Innovation
The field of generative AI (gen AI) is evolving rapidly, and Red Hat Enterprise Linux AI (RHEL AI) is at the forefront of this transformation. RHEL AI 1.4 introduces support for a new, powerful Granite 3.1 model with multi-language support and a larger context window. The latest version also adds capabilities for evaluating customized and trained models with Document-Knowledge bench (DK-bench), a benchmark designed to measure the knowledge a model has acquired. Finally, we’re introducing the developer preview of our new graphical UI that helps to simplify data ingestion and writing InstructL
Empowering non-technical users to contribute knowledge and enhance model responses
In every release of our products, the User Experience Design team at Red Hat (UXD) prioritizes implementing work that directly relates to customer feedback and insights. For a nascent product like RHEL AI, we have had to be creative about doing so. Feedback channels we used for this upcoming release to help us continuously improve the experience include upstream community engagement and direct feedback from users via Slack, interviews with internal team members, interviews with AI users and enthusiasts who may be interested in adopting RHEL AI, and examining our competitive landscape. This has
Evolving our middleware strategy
Editor’s note: Earlier today, president and chief executive officer of Red Hat, Matt Hicks, shared the following email with Red Hatters.--Hi all,Today, we’re sharing that Red Hat and IBM will join forces to secure the future of the Java application ecosystem for our customers. The Red Hat middleware engineering and product teams will join the IBM Data Security, IAM, and Runtimes organization, forming a single team.* This move supports and grows our combined customer footprint, and builds a unified product strategy for the future of Java applications and Integration solutions in the era of
Open source AI: Red Hat’s point-of-view
TL;DR - Red Hat views the minimum criteria for open source AI as open source-licensed model weights combined with open source software components.More than three decades ago, Red Hat saw the potential of how open source development and licenses can create better software to fuel IT innovation. Thirty-million lines of code later, Linux not only developed to become the most successful open source software but the most successful software to date. Our commitment to open source principles continues today, not only in our business model, but in our corporate culture as well. We believe that these c
Nominations now open for the OpenShift Superhero Awards
Each member of the OpenShift community is a hero, helping contribute to the project’s success and growth. But some members really stand out - they are the advocates and champions that make the community strong and successful. These are our superheroes.The vibrant OpenShift community is built on the contributions of its members and we are now giving users the opportunity to recognize those that help to advance the community. Nominations are currently open for the OpenShift Superhero Awards, acknowledging users for their contributions, participation and innovation.OpenShift superheroes are ma
The EU Cyber Resilience Act - what you need to know
Today marks a new milestone in European cybersecurity: the Cyber Resilience Act (CRA) has been published in the EU’s Official Journal, bringing significant changes for businesses operating in the EU. But what does this mean for companies and users alike, and how is Red Hat positioned to support your needs in the new landscape?The CRA is a robust new legislative framework aimed at enhancing the cybersecurity of (hardware and software) products with digital elements - everything from smart home devices to complex operating systems in critical national infrastructure. The CRA enters into force
Red Hat to Contribute Comprehensive Container Tools Collection to Cloud Native Computing Foundation
The continued importance of cloud-native applications in an AI and hybrid cloud-centric world demands an open, more accessible ecosystem of development tools. Today, we’re pleased to help drive cloud-native evolution further into the next-generation of IT with our intent to contribute a comprehensive set of container tools to the Cloud Native Computing Foundation (CNCF), including bootc, Buildah, Composefs, Podman, Podman Desktop and Skopeo.Upon acceptance by the CNCF, the contributed tools will become hosted projects – alongside technologies like Kubernetes, Prometheus, Helm and many more
The State of Platform Engineering in the Age of AI
Platform engineering has changed how organizations develop, deploy and manage applications by streamlining processes, improving efficiencies and fostering collaboration. As organizations now look to integrate generative AI (gen AI) as quickly and as often as possible - platform engineering is proving to be critical to setting organizations up for success in adopting and implementing these groundbreaking new technologies.Our inaugural State of Platform Engineering in the Age of AI report examines trends, challenges and best practices from industry practitioners to help us to better understand h
How to make generative AI more consumable
Think about some of the past trends in technology, and you’ll start to see some patterns emerge. For example, with cloud computing there’s no one-size-fits-all approach. Combinations of different approaches, such as on premise and different cloud providers, have led to organizations taking advantage of hybrid infrastructure benefits in deploying their enterprise applications. When we think about the future, a similar structure will be essential for the consumption of artificial intelligence (AI) across diverse applications and business environments. Flexibility will be crucial as no single
FAQ: Red Hat to acquire Neural Magic
FAQ: Red Hat to acquire Neural MagicWhat is being announced?On Nov. 12, 2024, Red Hat announced that it has signed a definitive agreement to acquire Neural Magic, subject to regulatory reviews and other customary closing conditions.What is Neural Magic?Neural Magic is a Somerville, Massachusetts-based early stage company developing technology that helps organizations optimize AI models for greater performance and efficiency. Specifically, Neural Magic is a leader in the vLLM community project with expertise in quantization and sparsification.What does Neural Magic provide?Neural Magic provides
Creating cost effective specialized AI solutions with LoRA adapters on Red Hat OpenShift AI
Picture this: you have a powerful language model in production, but it struggles to provide satisfying answers for specific, niche questions. You try retrieval-augmented generation (RAG) and carefully crafted prompt engineering, but the responses still need to be revised. The next step might seem to be full model fine tuning—updating every layer of the model to handle your specialized cases—but that demands significant time and compute resources.What is LoRA?Low-rank adaptation (LoRA) is a faster and less resource-intensive fine tuning technique that can shape how a large language model re
An Introduction to TrustyAI
TrustyAI is an open source community dedicated to providing a diverse toolkit for responsible artificial intelligence (AI) development and deployment. TrustyAI was founded in 2019 as part of Kogito, an open source business automation community, as a response to growing demand from users in highly regulated industries such as financial services and healthcare. With increasing global regulation of AI technologies, toolkits for responsible AI are an invaluable and necessary asset to any MLOps platform. Since 2021, TrustyAI has been independent of Kogito, and has grown in size and scope amidst the
AI: Red Hat's vision for an open source future
The AI landscape is evolving at an electrifying pace. Just as with any technological leap, the question arises: what path will best shape its future? Red Hat believes the answer is clear:The future of AI is open source.This isn’t just a philosophical stance; it’s an approach focused on unlocking the true value of AI and making it something far more accessible, far more democratized and far more powerful.We have always believed in the power of open source development in driving innovation. We’ve seen this play out in the rise of Linux, KVM, OpenStack, Kubernetes and many other projects th
Introducing Climatik: Power capping AI applications for data center sustainability
As the demand for artificial intelligence (AI) and cloud computing grows, so does the power consumption of data centers. Data centers are the backbone of the modern digital economy, but they are also some of the biggest contributors to global energy usage. With the generative AI (gen AI) explosion and the increased demand for AI workloads and power, there has also been an uptick in technology’s carbon footprint.This presents a critical challenge: how can data centers manage their energy consumption without sacrificing the performance needed to support their AI applications? The new Climatik
Get ready for Red Hat Summit 2025: What past speakers want you to know
We’re excited about Red Hat Summit in Boston, MA from May 19-22, 2025. This event is a chance for IT professionals, enterprise leaders, and open source technologists to come together and explore topics like automation, AI, and security.Join us to hear from industry experts and gain insights from leaders shaping technology's future. You'll be part of a dynamic community eager to share knowledge, spark meaningful conversations, and drive innovation. We’ve gathered insights from past speakers who shared their knowledge and experiences. Their stories reflect the excitement of connecting with a
Getting to know Fran Heeran, Vice President, Global Telecommunications, Red Hat
Heeran has over 30 years of experience in the software industry and has spent the past 25 years in the telecom sector. He joins Red Hat to lead the Global Telecommunications organization to accelerate Red Hat’s open source leadership in the telecommunications network environment by closely collaborating with Red Hat’s communities, customers and partners. Heeran will work to scale Red Hat’s telco vision and strategy within the industry and the wider enterprise space. Prior to joining Red Hat, he served in executive technology and business executive roles with several globally renowned org
Open to options: How to build your modern virtualization strategy
Virtualization has long been the backbone of IT infrastructure that, at its core, is an exercise in modernisation, efficiency and value. When paired with a hybrid cloud strategy, which sees workloads live on-premise, in the cloud, and in edge environments, it transformed how enterprises manage IT resources. It helps enterprises innovate faster and adapt to increasingly complex IT ecosystems.Right now, we are seeing the virtualization landscape change rapidly, driven by new dynamics and vendor acquisitions. With this, enterprises are experiencing challenges like rising costs and struggling to r
2024 enterprise trends: cloud meets AI
The digital transformation opportunity has never been greater, especially with cloud now being generally embraced as an enabler. The generative AI (gen AI) boom has further sharpened the focus on modernisation as enterprises assess how they can take advantage. They want a cloud environment that offers choice, flexibility and independence to more safely experiment with, and adopt, new technologies.We set out to discover how organisations are approaching their cloud strategy into 2025, their appetite for AI, and the barriers to adoption of emerging innovations. A survey of 609 enterprise IT mana
Red Hat on Red Hat: How we use our AI solutions to power improved customer experience
Red Hat works closely with customers to provide the AI tools they need to build, deploy, monitor and use AI models and applications, but how do we utilize the power of AI for ourselves? Our Experience Engineering (XE) team is on it. The XE team collaborates with experts across Red Hat to deliver AI solutions to help our customers solve in-the-moment challenges via the Red Hat Customer Portal as we speak.Troubleshooting recommendationsTroubleshooting recommendations are powered by a text-embedding AI model developed in collaboration with IBM Research, fine-tuned for re-ranking purposes. More sp