Run open-source models like Llama 3 on Cloudflare's global network. Serverless inference with zero cold starts.
Configure your model parameter.
Workers AI allows you to run machine learning models, on the Cloudflare network, from your code. No need to manage infrastructure or GPUs.