» Published on
The complete post mortem is available for review here: https://docs.google.com/document/d/1HOjyYoyIMKEEjHuKGLuCgf7zqV6lwxTXXtks-QZtTY8/edit?usp=sharing
» UpdatedAll Clouds should now be restored to their last backups. Please open a ticket from the MODX Cloud Dashboard if you require assistance with anything related to the paas2.ams platform. We will release a postmortem on this incident early next week and be reaching out to all customers with instances on the paas2.ams platform.
» UpdatedAt this time, production sites on this platform should be back online. If they are not, please contact support. We will continue restoring development clouds next.
» UpdatedAt this point, partition copy is at 95%.
» UpdatedAt this time, it's about 90% complete on the transfer.
» UpdatedWe apologize for a lack of update. Due to a misconfiguration, we had to take the services offline to migrate from one partition to another. At this time, it's about 85% completed.
» UpdatedAll production Clouds should now be restored; non-production Clouds without domains assigned are now being restored. We expect this to continue for the next ~18 hours. If you have any questions about sites that are not working as expected, please let us know by opening a support ticket in the MODX Cloud Dashboard. We will continue monitoring progress through the weekend.
» UpdatedProduction sites with domains are being restored from the most recent backups to paas2.ams. We expect this to continue for the next few hours when sites without domains will start restoring. Please open a ticket from the Dashboard if you have any questions or require assistance.
» UpdatedOver the last 8 hours, IBM Cloud data center engineers performed multiple rounds of hardware replacement including all drives and a RAID controller (which is supposed to handle redundant local storage). Due to the severity of the failures, our team had to reinstall the operating system and configure it as new.
The server is now back online and we have begun the process of restoring all customer sites from backups.
Sites with custom domains will be restored first, followed by development Clouds (with no custom domains activated).
We sincerely apologize for the unusual nature of the downtime. If you have any questions, please use the Help button from the lower right of the MODX Cloud dashboard to ask for assistance.
» UpdatedWe have worked with data center technicians overnight to replace all affected hardware including all disks and the RAID controller. We are preparing the platform for recovering sites and site functionality.
At this time we do not have an estimated time of completion.
To get your site back online quickly, we recommend you create a new Cloud at another location such as London, Frankfurt or Amsterdam 1, and restore your most recent backup or Snapshot into the newly created Cloud. Then add any custom domains and point your A Record to direct traffic to the new location. Once that is done you can install a free SSL certificate if needed and copy over your original web rules.
We sincerely apologize for the extended outage and will continue to update this incident as we have more information.
» UpdatedWe continue to run disk recovery operations with IBM. Due to the nature of this process, we do not have an ETA at this point.
To get your site back online quickly, we recommend you create a new Cloud at another location, and restore your most recent backup or Snapshot into the newly created Cloud. Then add any custom domains and point your A Record to direct traffic to the new location. Once that is done you can install a free SSL certificate if needed and copy over your original web rules.
We sincerely apologize for this outage and will continue to update this incident as we have more information.
We sincerely apologize for this outage and will continue to update this incident as we have more information.
» UpdatedWe're still working to restore service to Amsterdam 2 Platform with IBM Cloud engineers. At this time we expect the outage to continue for some time as we are working to confirm the integrity of hardware and data and identify the cause.
» UpdatedWe are working with our infrastructure partner, IBM Cloud, to restore service to our Amsterdam 2 Platform, as soon as possible.
» UpdatedWe continue to work to bring the Amsterdam 2 Platform back online.
» UpdatedWe're currently investigating the cause of an outage that's occurring on our Amsterdam 2 platform that will affect sites containing the Cloud URL of paas2.ams.modxcloud.com.
We're working with our upstream partner, IBM Cloud to identify the source of the issue and recover normal operations.
» Updated