Cloudflare makes it easy to deploy AI applications globally with one click using Hugging Face

Applications of AI


[–>Despite significant advances in AI innovation, there remains a disconnect between its potential and the value it brings to business.

Despite significant advances in AI innovation, there remains a disconnect between its potential and the value it brings to business.

Cloudflare, Inc. (NYSE:NET), the leading connected cloud, today announced that developers can deploy AI applications to Cloudflare's global network with one click, directly from Hugging Face, the leading open, collaborative platform for AI builders. announced that it is now possible.

With the general availability of Workers AI, Cloudflare becomes the first serverless inference partner to integrate with Hugging Face Hub to deploy models, allowing developers to manage infrastructure and free up unused compute capacity. Deploy AI quickly, easily, and affordably globally without paying for it.

Despite significant advances in AI innovation, there remains a disconnect between its potential and the value it brings to business. Organizations and their developers need the ability to experiment and iterate quickly and affordably without having to set up, manage, or maintain GPUs or infrastructure. Enterprises need an easy platform that unlocks speed, security, performance, observability, and compliance to quickly deliver innovative, production-ready applications to customers.

“With the recent boom in generative AI, companies across industries are investing huge amounts of time and money into AI. Some of it will work, but the real challenge with AI is how easy it is to demo it, but how can it be put into production?” It’s incredibly difficult to deploy,” said Matthew Prince, CEO and co-founder of Cloudflare.

“We can solve this problem by abstracting away the cost and complexity of building AI-powered apps. Workers AI is one of the most affordable and accessible solutions for performing inference. And both Hugging Face and Cloudflare are deeply aligned in their efforts to democratize AI in a simple and affordable way, giving developers the freedom to choose their model and instantly scale their AI apps from scratch globally. and agility.”

Workers AI is now generally available on GPUs deployed in over 150 cities around the world

Workers AI is now generally available and provides the end-to-end infrastructure needed to scale and deploy AI models efficiently and affordably for the next generation of AI applications. Cloudflare currently deploys GPUs in more than 150 cities around the world, with recent additions to Cape Town, Durban, Johannesburg, and Lagos, our first location in Africa, as well as Amman, Buenos Aires, Mexico City, Mumbai, New Delhi, It was also introduced in Seoul. Deliver low-latency inference worldwide. Workers AI has also been extended to support fine-tuned model weighting, allowing organizations to build and deploy more specialized, domain-specific applications.

In addition to Workers AI, Cloudflare's AI Gateway provides a control plane for AI applications, allowing developers to dynamically evaluate and route requests to different models and providers, and ultimately allowing developers to to create a tweak so that you can run the tweaked job directly. Workers on the AI ​​platform.

Cloudflare powers one-click deployment with Hugging Face

General availability of Workers AI now lets developers deploy AI models with one click directly from Hugging Face, making it the fastest way to access a variety of models and run inference requests on Cloudflare's global GPU network Did.

Developers can choose one of the popular open source models and instantly deploy the model by clicking “Deploy to Cloudflare Workers AI.” There are currently 14 hand-picked Hugging Face models optimized for Cloudflare's global serverless inference platform, supporting three different task categories including text generation, embedding, and sentence similarity.

“We are excited to work with Cloudflare to make AI more accessible to developers,” said Julien Chaumond, co-founder and chief technology officer of Hugging Face. “Providing the most popular open model for GPU-powered serverless APIs in the world is a great proposition for the Hugging Face community, and we can’t wait to see what they build with it. not.”

AI-first companies are building with Workers AI

Companies around the world trust Workers AI and Cloudflare's global network to power their AI applications, including:

  • “Talkmap helps our customers discover and surface real-time conversation intelligence and insights. We have millions of customer conversations every day and need to process CX and EX results quickly. Cloudflare's developer platform helped us keep storage costs and latency low. We chose Cloudflare to scale our generative AI services and simplify our runtime architecture. You can stay focused on adding customer value with conversation insights.'' — Jonathan Eisentzoff, Founder and Chief Strategic Research Officer TalkMap.
  • “ChainFuse transforms the chaos of unstructured data into actionable insights, ensuring every customer feedback, issue, and opportunity is heard and evaluated. Using the product, we successfully analyzed and categorized over 50,000 unique conversations from places like Discord, Discourse, Twitter, G2, etc. Access 28 AI models on the fly for any task. ” – George Portillo, Co-Founder of ChainFuse.com.
  • “Discourse.org is a modern open source discussion platform that powers over 20,000 online communities, from small hobby groups to forums for some of the world's largest companies. Discourse is powered by Cloudflare's Workers AI to run embedded models and power the popular “Related Topics” feature. This produces more relevant results within the community and provides new opportunities for community members to discover and engage with topics of interest. Workers AI is now one of the affordable open source ways to deliver relevant topics using high-performance embedded models. It gives our customers a way to discover more relevant content and provide new ways to increase engagement for their community members. ” – Saif Murtaza, AI Product Manager, Discourse.org
  • “Simmer brings the swipe functionality of dating apps to the world of recipes and cooking, empowering couples to enjoy meals together. Simmer continues to adopt Cloudflare products as its platform grows. , Workers AI was no different, using embedding Workers AI and large-scale language models like Mistral 7B to give users a personalized experience on the app, including curated recipes based on their preferences. Let's start by going to Cloudflare and seeing if their products fit our use case as they are very easy to use. You can also save a lot of money.” – Ben Ankiel, CTO, Simmer.
  • “Audioflare uses AI to convert, inspect, compress, and translate short audio files into various languages. We streamline AI-related tasks such as audio file processing, sentiment assessment, and language translation, and improve overall AI “We're impressed with Cloudflare's ability to simplify the backend operations of our apps. We rely on Cloudflare's consistent improvements and We believe in their dedication and are confident in growing with their platform.” – Sean Oliver, creator of the open source LLM repository Audioflare.

For more information, please see the resources below.



Source link

Leave a Reply

Your email address will not be published. Required fields are marked *