Red Hat Delivers Enhanced AI Inference Across AWS — TradingView News

AI News


Red Hat, the world’s leading provider of open source solutions, announced an expanded collaboration with Amazon Web Services (AWS) to power enterprise-grade generative AI (gen AI) on AWS using Red Hat AI and AWS AI silicon. With this collaboration, Red Hat is focused on providing IT decision makers with the flexibility to run high-performance, efficient AI inference at scale, regardless of the underlying hardware.

The rise of AI and the resulting need for scalable inference is forcing organizations to reevaluate their IT infrastructure. As a result, IDC predicts that “by 2027, 40% of organizations will use custom silicon, including ARM processors and AI/ML-specific chips, to meet the growing demand for performance optimization, cost efficiency, and specialized computing.” 1 This highlights the need for optimized solutions that increase processing power, minimize costs, and enable faster innovation cycles for high-performance AI applications.

The collaboration between Red Hat and AWS enables organizations to realize a full-stack AI strategy by integrating Red Hat’s comprehensive platform capabilities with AWS cloud infrastructure and AI chipsets, AWS Inferentia2 and AWS Trainium3. The main aspects of collaboration are:

  • Red Hat AI Inference Server on AWS AI Chip: Red Hat AI Inference Server, powered by vLLM, can now run on AWS AI chips such as AWS Inferentia2 and AWS Trainium3, providing a common inference layer that can support any generation of AI models, helping customers achieve higher performance, lower latency, and cost efficiency as they scale their production AI deployments, up to 30-40% faster than comparable GPU-based Amazon EC2 instances today. Delivering excellent price performance.
  • Enabling AI on Red Hat OpenShift: Red Hat collaborated with AWS to develop the AWS Neuron Operator for Red Hat OpenShift, Red Hat OpenShift AI, and Red Hat OpenShift Service on AWS, a comprehensive and fully managed application platform on AWS, giving customers a more seamless and supported pathway to run AI workloads using AWS accelerators.
  • Ease of access and deployment: With support for AWS AI Chips, Red Hat provides enhanced and easier access to high-demand high-capacity accelerators for Red Hat customers on AWS. Additionally, Red Hat recently released the amazon.ai Certified Ansible Collection for Red Hat Ansible Automation Platform, which enables orchestration of AI services on AWS.
  • Contribution to the upstream community: Red Hat and AWS are working together to optimize the AWS AI chip plugin that is upstreamed to vLLM. As a leading commercial contributor to vLLM, Red Hat is committed to enabling vLLM on AWS to accelerate AI inference and training capabilities for users. vLLM is also the foundation of llm-d, an open source project focused on delivering inference at scale, and is now available as a commercially supported feature in Red Hat OpenShift AI 3.

Red Hat has a long history of working with AWS to support customers from the data center to the edge. This latest milestone is designed to address the evolving needs of organizations as they integrate AI into their hybrid cloud strategies to achieve optimized and efficient AI-generated outcomes.

availability

For customers using Red Hat OpenShift or Red Hat OpenShift Service on AWS, the AWS Neuron Community Operator is now available on the Red Hat OpenShift OperatorHub. Support for AWS AI chips in Red Hat AI Inference Server is scheduled to be available in developer preview in January 2026.

support quotes

Joe Fernandez, Vice President and General Manager, Red Hat AI Business Unit

“Building on Red Hat’s open source heritage, this collaboration aims to make generative AI more accessible and cost-effective across hybrid cloud environments.”

Colin Brace, Vice President, AWS, Annapurna Labs

“Enterprises are looking for solutions that offer superior performance, cost efficiency, and operational choice for mission-critical AI workloads. AWS designed the Trainium and Inferentia chips to make high-performance AI inference and training more accessible and cost-effective. In collaboration with Red Hat, we’re delivering generative AI by combining the flexibility of open source with AWS infrastructure and purpose-built AI accelerators to accelerate time-to-value from pilot to production. We provide our customers with a supported path to deploy at scale.”

Jean-François Gamache, Chief Information Officer and Vice President, Digital Services, CAE

“Modernizing our critical applications with Red Hat OpenShift Service on AWS is a key milestone in our digital transformation. This platform supports our developers to focus on high-value initiatives, drives product innovation, and accelerates AI integration across our solutions. Red Hat OpenShift provides the flexibility and scalability that enables you to make a real impact, from actionable insights with live virtual coaching to significantly reduced cycle times for user-reported issues.”

Anurag Agrawal, Founder and Chief Global Analyst, Techaisle

“As the cost of AI inference continues to rise, enterprises are prioritizing efficiency along with performance. This partnership embodies Red Hat’s ‘Any Model, Any Hardware’ strategy by combining an open hybrid cloud platform with the clear economic benefits of AWS Trainium and Inferentia. It enables CIOs to move from costly experimentation to sustainable, managed production and operationalize generative AI at scale.”

1IDC FutureScape: Worldwide Cloud 2025 Forecasts, October 28, 2024, Document #US52640724

additional resources

  • Find Red Hat on AWS Marketplace
  • Sign up for a 60-day trial of Red Hat AI Inference Server
  • Learn more about Red Hat AI
  • Explore the benefits of AI inference

Connect with Red Hat

  • Learn more about Red Hat
  • Get more news from the Red Hat Newsroom
  • Read the Red Hat blog
  • Follow Red Hat on X
  • Follow Red Hat on Instagram
  • Watch Red Hat videos on YouTube
  • Follow Red Hat on LinkedIn

About Red Hat Co., Ltd.

Red Hat is a leader in open hybrid cloud technology, providing a reliable, consistent, and comprehensive foundation for breakthrough IT innovation and AI applications. Our portfolio of cloud, developer, AI, Linux, automation, and application platform technologies enables any application anywhere, from the data center to the edge. As the world’s leading provider of enterprise open source software solutions, Red Hat invests in open ecosystems and communities to solve tomorrow’s IT challenges. Red Hat works with partners and customers to help them build, connect, automate, secure, and manage their IT environments, supported by consulting services and award-winning training and certification services.

Forward-looking statements

Except for the historical information and discussion contained herein, statements contained in this press release may constitute forward-looking statements within the meaning of the Private Securities Litigation Reform Act of 1995. Forward-looking statements are based on the Company’s current assumptions regarding future business and financial performance. These statements involve a number of risks, uncertainties and other factors that could cause actual results to differ materially. The forward-looking statements in this press release speak only as of the date on which they are made. We undertake no obligation to update or revise any forward-looking statements, except as required by law.

###

Red Hat, the Red Hat logo, and OpenShift are trademarks or registered trademarks of Red Hat, Inc. or its subsidiaries in the United States and other countries.

For more information, please contact us below.

Orient Planet Group (OPG)

Email: media@orientplanet.com

Website: www.orientplanet.com

Please send your press release to pressrelease.zawya@lseg.com.

Disclaimer: The content of this press release has been provided by an external third-party provider. This website is not responsible for and has no control over such external content. This content is provided “as is” and “as available” and has not been edited in any way. Neither this website nor any of our affiliates guarantee the accuracy of or endorse the views and opinions expressed in this press release.

Press releases are provided for informational purposes only. This content does not provide tax, legal, or investment advice or opinion regarding the suitability, value, or profitability of any particular security, portfolio, or investment strategy. Neither this website nor any of our affiliates will be responsible for any errors or inaccuracies in the content or for any actions you may take based on the content. You expressly agree that your use of the information in this article is at your own risk.

To the maximum extent permitted by applicable law, this Website, its parents, subsidiaries, affiliates, and their respective shareholders, directors, officers, employees, agents, advertisers, content providers and licensors, shall not be liable for any liability whatsoever, whether in negligence, tort, contract or otherwise. Regardless of any other theory of liability, we will not be liable (jointly or severally) to you for any direct, indirect, consequential, special, incidental, exemplary or punitive damages, including, without limitation, lost profits, lost savings, or lost revenue. EVEN IF THE PARTIES HAVE BEEN ADVISED OR CAN HAVE FORESEEED THE POSSIBILITY OF SUCH DAMAGES.



Source link