AI is not a distant API call anymore. Cloudflare Workers AI puts Llama, Mistral, ResNet, and embedding models on GPUs inside every major data center on Earth. Your prompts run at the same edge serving this page.
> model: @cf/meta/llama-3.1-8b-instruct > inference: on-edge GPU > openai key: not required > monthly cost: ~$0 at this scale
Three works. Each one talks to a different model.
> awaiting input