Coming soon!
Cloud inference is on the roadmap. We plan to support
device="cloud" on every tool config, so existing code written for local GPUs works unchanged against a managed endpoint. We also plan automatic fan-out across the providerβs fleet (no ToolPool() wrapping required), and GUIs for all tools so you can drive any tool through a visual interface instead of Python.