Rapid incident response procedures for OpenEvidence clinical AI integration outages in healthcare environments. Includes severity classification, triage steps, error-specific remediation, fallback procedures, and postmortem templates.
| Level | Definition | Response Time | Examples |
|---|---|---|---|
| P1 | Complete outage | < 15 min | API unreachable, all queries failing |
| P2 | Degraded service | < 1 hour | High latency, partial failures |
| P3 | Minor impact | < 4 hours | DeepConsult delays, webhook issues |
| P4 | No user impact | Next business day | Alert noise, logging issues |
OpenEvidence outages may affect clinical decision-making. Always communicate clearly with clinical staff, ensure fallback procedures are known, and document any clinical impact.
Check OpenEvidence status page, your integration health endpoint, error rate metrics, and recent error logs.
Return helpful message directing to UpToDate, DynaMed, or clinical guidelines directly.
Notify clinical staff via Slack/Teams and email. Update status page.
Collect evidence (logs, metrics, alerts), run postmortem with clinical impact assessment, create action items.
| Error Type | Quick Fix |
|---|---|
| 401/403 Auth | kubectl create secret with new key, kubectl rollout restart |
| 429 Rate Limit | kubectl set env RATE_LIMIT_MODE=queue |
| 500/503 Server | kubectl set env OPENEVIDENCE_FALLBACK=true |
| Timeout | kubectl set env OPENEVIDENCE_TIMEOUT=60000 |
set -euo pipefail
curl -sf https://api.yourhealthcare.com/health/openevidence | jq '.status' || echo "UNHEALTHY"
set -euo pipefail
kubectl set env deployment/clinical-evidence-api OPENEVIDENCE_FALLBACK=true # Enable
kubectl set env deployment/clinical-evidence-api OPENEVIDENCE_FALLBACK=false # Disable
See detailed implementation for advanced patterns.