Back to overview

Bob the Bot Service Interruption

Feb 17 at 10:06pm CST
Affected services
Bob (Bot Service)

Resolved
Feb 17 at 10:06pm CST

Date: Feb 17, 2026 | ⏱ Duration: 27 mins

🧭 Summary

Bob the Bot experienced a brief total service outage followed by a prolonged networking desync. While the bot’s internal functions (Discord Shards/Slash Commands) were manually restored and confirmed operational shortly after the start of the incident, Railway’s edge infrastructure continued to report 502 "Bad Gateway" errors for several minutes due to a persistent routing failure.

🛠️ Resolution

Service was restored internally through a recovery of the application process. Although logs confirmed the bot was "Ready" and processing commands at 9:43 PM, the incident remained open as Railway's proxy failed to route traffic to the container's health-check port. The edge network eventually synchronized, and monitors recovered automatically.

🕑 Timeline

  • 9:38 PM – 🚨 Incident started. Total service failure detected; global 502 errors across all regions.
  • 9:43 PM – 🤖 Application Restored. Internal logs confirm Shards 0-2 connected and Slash Commands registered. Bot functionality fully recovered.
  • 9:46 PM – 🛠️ Incident acknowledged by Zach Goodson. Investigation confirmed a "ghost" networking issue as the bot was functional but the URL remained unreachable.
  • 10:04 PM – 📶 Infrastructure Recovery. Railway's edge proxy successfully re-linked the public domain to the container.
  • 10:05 PM – ✅ Incident resolved.

Root Cause: Application Interruption & Upstream Desync. An initial disruption took the bot offline; however, a secondary infrastructure failure occurred where Railway's load balancer failed to recognize the healthy application (upstreamAddress: "") for 21 minutes after the bot had successfully resumed operations and port binding.