Skip to content

Agent Hosting

Deploy persistent AI agents on sovereign GPU nodes. Agents get inference access, knowledge base access, persistent memory, health monitoring, and hourly billing.

Deploy an agent

curl -X POST https://api.ryvion.ai/api/v1/agents/deploy \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "name": "research-agent",
    "container_image": "ghcr.io/myorg/research-agent:latest",
    "kb_ids": "kb_abc,kb_def",
    "model": "phi-4",
    "jurisdiction": "CA"
  }'

Request body

ParameterTypeRequiredDescription
namestringYesAgent name
container_imagestringYesOCI container image URL
kb_idsstringNoComma-separated knowledge base IDs the agent can access
modelstringNoDefault model for inference (e.g., phi-4)
jurisdictionstringNoJurisdiction to deploy in (e.g., CA, DE)

What your agent gets

When deployed, your agent container runs on a GPU node with:

  • Inference access -- call the Ryvion API from inside the container using the assigned model
  • Knowledge base access -- read and search the knowledge bases specified in kb_ids
  • Persistent memory -- store and recall memories through the Agent Memory API
  • Health monitoring -- the hub monitors agent health and restarts on failure
  • Hourly billing -- billed at $0.50 CAD/hour while running

List deployments

curl https://api.ryvion.ai/api/v1/agents/deployments \
  -H "Authorization: Bearer YOUR_API_KEY"

Response format

{
  "deployments": [
    {
      "id": "deploy_abc123",
      "name": "research-agent",
      "status": "running",
      "node_id": "node_xyz",
      "jurisdiction": "CA",
      "created_at": "2026-04-08T12:00:00Z"
    }
  ]
}

Stop an agent

curl -X POST https://api.ryvion.ai/api/v1/agents/deployments/deploy_abc123/stop \
  -H "Authorization: Bearer YOUR_API_KEY"

Endpoints summary

MethodEndpointDescription
POST/api/v1/agents/deployDeploy an agent
GET/api/v1/agents/deploymentsList all deployments
POST/api/v1/agents/deployments/:id/stopStop a running agent

Building your agent container

Your container image should:

  1. Accept environment variables for API credentials and configuration
  2. Use the Ryvion API (or OpenAI SDK pointed at the hub) for inference
  3. Optionally use the knowledge base and memory APIs for state
  4. Implement a health check endpoint if you want monitoring

Jurisdiction pinning

Specify a jurisdiction to ensure your agent runs on a node in a specific country. See Sovereign Routing for available jurisdictions.

Pricing

$0.50 CAD per hour while the agent is running. Billing stops when the agent is stopped or terminated. Inference, search, and memory calls made by the agent are billed separately at standard rates.