We recently addressed issues affecting most parts of the Box webapp and public API; including Logins, Uploads and Downloads. We would like to take the opportunity to further explain these issues and the steps we have taken to keep them from happening in the future.
On January 9, 2024, between 11:10 AM PT and 12:15 PM PT, users experienced difficulties while working in Box. During this time, users experienced slowness and failures interacting with most parts of the Box webapp and public API, including Logins, Uploads and Downloads. The issue was triggered by a change that impacted Kubernetes clusters in one of the regions where we run our critical services. At 11:18 AM PT this change was rolled back. After rolling back the change, systems began to recover around 11:27 AM.
Analysis
On January 9th, 2024, we made a configuration change to deploy a new daemonset to the dedicated Kubernetes clusters running our core data services, which caused resource exhaustion and eviction of our core data services on those nodes. The daemonset deployment pipeline erroneously allowed the change to be promoted to an entire region instead of the planned gradual rollout, which resulted in service interruption impacting most of our customers. To address this issue, we rolled back the daemonset deployment. After the daemonset was rolled back, the evicted core data services began to be restored and our systems began to recover; however, in two of our three clusters, this process was delayed by a scheduler limitation, elongating the duration of impact. We adjusted the resource requests for our core data services to speed up recovery of these two clusters, restoring normal operations at 12:15pm.
Corrective Actions
The following corrective actions have been completed or are planned:
We are continuously working to improve Box and want to make sure we are delivering the best product and user experience we can. We hope we have provided some clarity here and we would be happy to answer any questions you may still have regarding this matter.
Sincerely,
The Box Team