Rithmic Incident Report
Incident: Partial loss of access to Chicago Area servers
Date of Incident: Tuesday January 18, 2022 03:53:22 CST
Date of Resolution : Tuesday January 18, 2022 07:12:58 CST
Date of Report: Tuesday January 18, 2022
Scope of Failure: Some Rithmic customers found that logins to the Chicago Area could not proceed.
Root Cause Analysis
Failure of a Network Hardware component led to a transient network outage. A Rithmic Software component implemented an automated recovery process, but some subsystems failed to recover to an open state. This was remedied by Rithmic Operations at 7:12 CST.
Action Items
- Rithmic Operations has an open issue item with the Hardware Vendor. In the meantime, acting on directions of Vendor representatives, features in the Hardware components which contributed to the outage have been disabled to prevent a recurrence.
- Rithmic Operations is enhancing the discovery of failed software routing subsystems, and is liaising with Rithmic Development to improve automated recovery.
Comments
0 comments
Article is closed for comments.