Launching Resolve AI Labs backed by new $40M Series A Extension

On-call agent

Delegate on-call to agents.

Your on-call engineers start every shift with answers, not alerts.

01On-call

Participates in every on-call rotation.

Autonomously investigates alerts and builds initial findings before the on-call engineer is paged.

Investigations

Agents on every on-call rotation — triaging and investigating alerts in real time.

Investigations108Alerts1658
All teamsLast 3 days
05/1005/1105/1205/13
108 investigations · all triaged by on-rotation agents in the last 3 days
Today · Wed May 13 2026
just now

Scrape Error Rate High: errors > 2% across 2 orgs on same integration

Triaged· 12sChat originOn:monitoring-alerts-systems-prodrotation:systems-oncall
8:46am today

Demo Org not Replying in Slack

Investigating· 4mChat originOn:defaultrotation:support-oncall
6:24am today

High Alert Reception Latency (p90 > 10 mins over 20 min window)

Concluded· 23mPlatformmonitoring-alerts-platform-prodcluster:app0-clusterrotation:platform-oncall
9:49pm yesterday

RDS High Read IOPS: Instance is experiencing high read IOPS

Auto-resolvedPlatformmonitoring-alerts-platform-prodcluster:app0-clusterrotation:db-oncall
02Triage

Triages and investigates every alert

Correlates signals across your observability stack, assesses severity, and identifies blast radius with evidence.

Scrape Error Rate HighTriage CompleteStart Deep Investigation
InvestigationThreads
Assessed3 More Steps

Transient metrics provider outage (HTTP 500) causing scrape failures

Root cause5 evidence

Alert firing due to stale NoData KeepLast state

Contributing factor1 evidence

2 orgs with persistent UNAUTHORIZED errors due to missing events_read scope

Open lead1 evidence
Runbook conclusionAlert DetailsImpact

Runbook conclusion

The metrics provider API had a transient outage affecting 27 orgs' alert scrapes in orders-prod-cluster. The error spike was brief and self-resolved; the alert fired late because of stale evaluation. No action required — this is provider-side.

Alert Details

  • Breach: up to 4 orgs exceeded the 2% error rate on the metrics-provider integration in orders-prod-cluster.
  • Timeline: spike at 9:54pm, peaked at 4 orgs at 9:59pm, back to baseline by 10:10pm. Alert fired at 10:26pm.

Impact

  • Blast radius: 27 orgs affected by transient API errors on scrapeType=alerts.
  • Customer-facing: none — spike resolved before page-load impact.
03Mitigation actions

Resolves alerts with actions.

Silences noise, executes GitHub Actions, and routes to the right team. Engineers approve or let agents handle known patterns autonomously.

Silence alert: Checkout p95 latency above threshold

pending
Why silence Checkout p95 latency above threshold?

Silence Alert

PlatformGrafanaAlertCheckout p95 latency above thresholdDuration60 min
BeforeAlert "Checkout p95 latency above threshold" is firingAfterSilenced for 60 min
04Interface

Available in your collaboration tools.

Findings, priority lists, and actions surface in Slack, MS Teams, CLI, Resolve AI, or your own agent.

#orders-on-call42 members
Confirmed root cause

Missing schema migrations on orders-db causing transaction rollbacks. Rollback ratio crossed 2% threshold at 08:19Z.

View full report
On-Call› #incidents
Resolve AI

Top 3 priorities

  • Apply orders-db migrationshigh
  • Investigate checkout p95 spikemed
  • Review checkout-v2 revert PRlow
resolve
$resolve actions
Revert checkout-v2-routinghigh
Silence Checkout p95 latencymed
Apply orders-db migrationshigh
3 actions pending review
$

Used and loved by engineers

Removing the toil of investigations, war rooms, and on-call.

We pull fewer engineers into war rooms, on-call is materially better, and that translates directly to advertiser trust and revenue protection.

Shahrooz Ansari

Shahrooz Ansari

Sr. Director of Engineering, DoorDash

I don't need more numbers or more data. What I need is a root cause.

Chris Umbel

Chris Umbel

AIOps Lead & SRE, Zscaler

Resolve AI proved it could deliver real results in a constrained environment. It identified dependencies, surfaced accurate root causes 73% faster than our teams, all while integrating cleanly into our existing stack.

Angelo Marletta

Angelo Marletta

Staff Software Engineer, Coinbase

Resolve AI makes our junior on-call engineers as effective as our seniors, flattening the experience curve. We've seen a 2x productivity lift while eliminating the runbook gap.

A.D.

A.D.

Sr. Director of Engineering, Financial Services Company

We pull fewer engineers into war rooms, on-call is materially better, and that translates directly to advertiser trust and revenue protection.

Shahrooz Ansari

Shahrooz Ansari

Sr. Director of Engineering, DoorDash

I don't need more numbers or more data. What I need is a root cause.

Chris Umbel

Chris Umbel

AIOps Lead & SRE, Zscaler

Resolve AI proved it could deliver real results in a constrained environment. It identified dependencies, surfaced accurate root causes 73% faster than our teams, all while integrating cleanly into our existing stack.

Angelo Marletta

Angelo Marletta

Staff Software Engineer, Coinbase

Resolve AI makes our junior on-call engineers as effective as our seniors, flattening the experience curve. We've seen a 2x productivity lift while eliminating the runbook gap.

A.D.

A.D.

Sr. Director of Engineering, Financial Services Company

Recent updates

Shipping every week.

See all updates
  • May 2026

    Autonomous alert triage

    Every alert investigated automatically, 24/7.

  • May 2026

    Alert resolution

    Agents take action directly, including silencing and GitHub Actions.

  • May 2026

    Deployment monitoring

    Agents watch rollouts and investigate before alerts fire.

  • April 2026

    Adaptive learning

    Triage quality improves with agent teams and engineer corrections.

Frequently asked questions

See on-call agents in your environment.