We are truly grateful for your continued support and loyalty. We value your feedback and appreciate your patience as we worked to resolve this incident.
Description:
On November 30, 2023, our customer experience team identified an issue with Serraview Live 2 causing a delay between receiving data and publishing it to the Serraview Live Servers.
Upon investigation, our engineering team identified that the root cause of the problem was that we had reached a limit on our servers as our client count had grown.
Type of Event:
Unplanned delay in SVLive 2 data being processed.
Services/Modules Impacted:
Locator, Engage and reports (including Insights).
Remediation:
We tripled the maximum number of server shards, enabling us to scale them for each client and effectively handle the increased load. As a result, we could resume processing SVLive 2 data.
Timeline (AEDT):
30th November
1st December
Total Duration of Event:
~ 11 hours and 30 minutes.
Root Cause Analysis:
To accommodate the growing number of new clients, we needed to increase the maximum number of shards on the AWS server.
Preventative Action: