Zero-config agent · Kubernetes + Docker

Monitor every container
from one dashboard

Deploy a lightweight agent in 60 seconds. No cloud credentials required. Get full visibility into your infrastructure, CPU, memory, logs, and alerts, in minutes.

app.kubewatchlabs.com / overview
Live
Cluster throughput live
48.2k 12.4%req/min
Requests Errors
Resource usage6 nodes
CPU34%
Memory61%
Disk78%
Latency by servicep95 · ms
authquerylivealertbilltenant
Active alerts2 firing
API latency p99 > 800msgateway2m
Memory pressure on node-3kubelet14m
Pod restart loop clearedbilling1h
Disk usage back under 80%postgres3h
Containers128 total
gateway-7f4c
running
auth-9b2a
running
query-3d8e
running
billing-1a5f
stopped
Cloud spendthis month
$3,284 8.1%
60s
To first metric
< 0.5%
Agent CPU overhead
99.9%
Uptime SLA
365d
Metric retention
Capabilities

Everything you need to stay on top of your containers

From single Docker hosts to multi-cluster Kubernetes environments.

Docker Monitoring

Full visibility into every container, CPU, memory, network I/O, live log streaming, and restart tracking.

  • Container list & status
  • CPU & memory metrics
  • Network I/O per container
  • Live log tail

Kubernetes Monitoring

Understand your cluster at a glance. Track pods, nodes, services, and resource usage without leaving your browser.

  • Pod & node health
  • Service discovery
  • Restart detection
  • Resource usage per namespace

Real-time Alerts

Know before your users do. Set threshold rules, route to Slack or email, and silence noisy alerts intelligently.

  • Threshold-based rules
  • Slack & email channels
  • Silence detection
  • Alert history & audit log

Auto Scaling

Close the loop from watching to acting. Scale Kubernetes pods and Docker containers on the metrics you already track, from one policy.

  • Native HPA & Karpenter on K8s
  • Orchestrated scaling on Docker
  • Dry-run, cooldowns & approvals
  • One-click rollback

Integrations

Monitor the services around your apps. Connect databases, caches, message queues, CI/CD, and observability tools with live health and deep metrics.

  • Postgres, MySQL, Redis, Kafka
  • Argo CD, Jenkins, Prometheus, Grafana
  • Latency & uptime health checks
  • Per-service deep metrics

Load Testing

Run HTTP load tests against any endpoint and read real latency percentiles, throughput, and a full success and error breakdown.

  • Configurable concurrency & duration
  • P50 / P90 / P99 / max latency
  • Requests per second & error rate
  • Per-status and per-error breakdown
Metrics

Real-time metrics with 365-day history

Every container and node streams CPU, memory, network, and request throughput to high-resolution time-series charts. Zoom from the last minute to the last year without sampling gaps.

  • Sub-second collection interval
  • Per-service request & error rates
  • Powered by VictoriaMetrics
Cluster throughput live
48.2k 12.4%req/min
Requests Errors
Alerting

Catch incidents before your users do

Define threshold rules on any metric, route them to Slack or email, and let intelligent silencing cut the noise. A full audit trail shows every fire and resolution.

  • Threshold & anomaly rules
  • Slack and email channels
  • Acknowledge & audit history
Active alerts2 firing
API latency p99 > 800msgateway2m
Memory pressure on node-3kubelet14m
Pod restart loop clearedbilling1h
Disk usage back under 80%postgres3h
AI Observability

Monitor AI and ML workloads, by cost and quality

Track every model call: tokens, latency, error rate, and spend, with a built-in price table for OpenAI, Anthropic, and more. Watch GPU utilization and scrape vLLM, Triton, and KServe inference servers from the same agent.

  • Cost and token tracking per model
  • GPU utilization, memory, and power
  • vLLM / Triton / KServe metrics
AI observability live
Spend
$1,284
Tokens
4.2M
p95
1.3s
gpt-4o18.4k$6120.3% err
claude-sonnet-49.1k$4710.6% err
text-embedding-352k$2010.1% err
Requests & Latency

Request rates, error rates, and latency everywhere

Report application API requests to see throughput, error rate, and p95 latency per route. Synthetic probes measure latency to your services, nodes, and the agent itself, with uptime tracking.

  • Per-route request and error rates
  • p50 / p95 latency breakdowns
  • TCP and HTTP latency probes
API requestslast 24h
48.2kreq/min0.6% error rate
GET /v1/orders38ms0.2%
POST /v1/pay210ms2.1%
GET /v1/users24ms0%
Latency probesagent · node · probe
agent pushagent8 ms
node-1node1.2 ms
node-2node1.4 ms
api.svc/healthprobe42 ms
postgres:5432probe3 ms
vllm:8000probedown
OpenTelemetry

Bring your own telemetry with OTLP

Point any OpenTelemetry SDK or Collector straight at KubeWatch. We ingest traces, metrics, and logs over OTLP/HTTP, decoding both Protobuf and JSON, so your existing instrumentation works with no rewrites and no vendor lock-in.

  • Traces, metrics, and logs over OTLP/HTTP
  • Protobuf and JSON encoding
  • Drop-in for OpenTelemetry SDKs & Collector
  • Open standard, no vendor lock-in
OpenTelemetryOTLP/HTTP
Traces
2.4k
Metrics
38k
Logs
12k
distributed trace · 5 spans312 ms
GET /v1/checkout
312 ms
auth.verify
41 ms
db.query orders
96 ms
payment.charge
108 ms
cache.set
22 ms
protobufJSONgzip ingesting
Auto Scaling

From watching your workloads to scaling them

Set a per-workload policy and KubeWatch acts on the same metrics you already see. On Kubernetes it writes native HorizontalPodAutoscaler and Karpenter objects and lets the cluster execute them. On standalone Docker, where there is no HPA, KubeWatch is the orchestrator: it picks placement, scales containers, and routes traffic through a managed load balancer.

  • Pods on Kubernetes, containers on Docker, one policy
  • Dry-run first, then go live with asymmetric cooldowns
  • Approval gates and an append-only decision log
  • One-click rollback on either runtime
Auto Scaling live
Replicas
3 → 4
CPU target
70%
Bounds
2 to 10
k8scheckout-api · scale up 3→4cpu 84% > 70%
dockerweb · placed on host west-162% headroom
k8sworkers · steadywithin target
Integrations

Watch the services your apps depend on

Your containers are only half the picture. Connect the databases, caches, message queues, CI/CD, and observability tools around them, and KubeWatch tracks their health, latency, and uptime, then pulls deep per-service metrics like connection pools, cache hit rates, and replication lag.

  • Postgres, MySQL, Redis, and Kafka
  • Argo CD, Jenkins, Prometheus, Grafana, and more
  • Continuous latency and uptime health checks
  • Deep metrics, not just up or down
Integrationsdatabases · caches · CI/CD
postgres-prodPostgreSQL2 ms
redis-cacheRedis1 ms
kafka-eventsKafka8 ms
mysql-billingMySQL4 ms
argocdArgo CD41 ms
prometheusPrometheusdown
Load Testing

Know how an endpoint holds up before your users do

Point a load test at any URL, pick the concurrency and duration, and KubeWatch runs it and reports the numbers that matter: throughput, latency percentiles, and a full breakdown of every response so you can see exactly what succeeded and what failed.

  • Configurable concurrency and duration
  • P50 / P90 / P99 / max latency, explained
  • Requests per second and error rate
  • Per-status and per-error breakdown
Load testGET /v1/checkout
Req/sec
48.2k
Avg
34 ms
Errors
0.4%
P50
28 ms
P90
96 ms
P99
180 ms
Max
240 ms
Infrastructure

Pods, nodes, containers, and networks in one view

Full visibility across Docker and Kubernetes, from a single node to multi-cluster fleets.

Pods6 namespaces
Running142
Pending3
Failed1
Nodes4 ready
node-1
node-2
node-3
node-4
CPU Memory
Networksrx / tx
1.8 GB/s 920 MB/s
bridgebridge12 attached
kube-overlayoverlay34 attached
hosthost3 attached
Containers128 total
gateway-7f4c
running
auth-9b2a
running
query-3d8e
running
billing-1a5f
stopped
Latency probesagent · node · probe
agent pushagent8 ms
node-1node1.2 ms
node-2node1.4 ms
api.svc/healthprobe42 ms
postgres:5432probe3 ms
vllm:8000probedown
Setup

Up and running in minutes

No complex setup. No cloud IAM roles. Just deploy and watch.

01

Sign up in 30 seconds

Create your account with just your email. No credit card, no sales call, no waiting.

02

Deploy the agent with one command

Run a single Docker or Helm command on your host. The agent securely streams metrics to your dashboard.

03

See your infrastructure in under 5 minutes

Within minutes you have a live view of every container and node, CPU, memory, logs, and more.

Deployment

Two ways to run KubeWatch

Choose the deployment model that fits your team.

Hosted SaaS

Free to start. We manage the infrastructure.

  • Agent-only deployment
  • We manage the backend
  • Free tier available
  • Scale with your team
Start free

Self-Hosted Enterprise

Enterprise plan. Your data stays in your infrastructure.

  • Full platform deployment
  • Data residency & compliance
  • Docker Compose or Helm install
  • Annual license with updates
Learn more
Pricing

Simple, transparent pricing

Start free. Upgrade when you need more.

Free

$030-day trial
  • 2 agents
  • 1 user
  • 3-day retention
Start free
Most popular

Pro

$99/mo or $990/yr
  • 20 agents
  • 10 users
  • 30-day retention
Start trial

Enterprise

$299/mo or $2,990/yr
  • Unlimited agents
  • Unlimited users
  • 365-day retention
Start trial
Free forever plan · No credit card

See your whole stack in under 5 minutes

Join engineering teams who replaced their scattered dashboards with one place for Docker and Kubernetes.