— Definition

Workers AI

Workers AI is Cloudflare’s platform for running machine-learning models — language models, embeddings, image and speech models — on its own GPU network, called from inside a Worker. It lets you add inference to an application without managing GPUs or routing every request to a third-party model provider.

For many features Workers AI is the lowest-friction, lowest-cost way to add a model, because the inference runs on the same platform as the rest of the app and the data does not leave it. For others, a frontier model from Anthropic or OpenAI is the better fit; the engineering choice is about cost, quality, and where the data is allowed to go.

We measure that trade-off rather than guess it — see the cost-quality comparison below.

Want this built or fixed properly?

Tell us what you are working on in two paragraphs — we reply in writing within one business day, with a straight answer on whether we can help.