Cloud Inference

Coming soon!

Cloud inference is on the roadmap. We plan to support device="cloud" on every tool config, so existing code written for local GPUs works unchanged against a managed endpoint. We also plan automatic fan-out across the provider’s fleet (no ToolPool() wrapping required), and GUIs for all tools so you can drive any tool through a visual interface instead of Python.