[Hpc-notice] RCC/HPC outage in Sliger Data Center from April 29 - May 4
Casey Mc Laughlin
cmclaughlin at fsu.edu
Mon Feb 15 10:35:54 EST 2021
Hello FSU Research Community,
As part of the ongoing Sliger Renovation, the contractor (Arbitron-Williams) will be working on the data center's electrical and HVAC systems over the first weekend in May. Everything hosted in the Sliger Data Center will have to be shut off before that happens, including all RCC systems.
Given the complexity of the task, we will need a full day before the actual outage begins to shut down all RCC systems and a full day to bring them back online after the outage is over. The outage will be for all RCC services, including, but not limited to the following:
* The High-Performance Computing cluster,
* The Interactive Computing cluster (Spear),
* GPFS and Archival storage,
* Virtual Machines running on the "SKY" cluster
* The "vpn.fsu.edu/hpc" profile for the FSU VPN
The schedule will be as follows:
* Thursday, April 29; 8am - RCC powers down all RCC systems and will remain offline through Tuesday, May 4
* Tuesday, May 4; noon - Power and HVAC restored and running
* Tuesday, May 4; 6pm - Most RCC systems back online (we will send a notice about what's available and what's not)
* Wednesday, May 5; 5pm - All RCC systems back online
These are the best estimates we can provide at this time, and they may be subject to change between now and when the maintenance occurs. The RCC will make every effort to communicate schedule changes promptly on our website, newsletter, and this email notice list.
What's being done
The reason for this power outage is to complete necessary upgrades to power and cooling infrastructure as part of the Sliger Renovation project scheduled to end in August 2021. The Sliger Building is decades old, and FSU has committed two million dollars to bring the infrastructure up-to-code.
While most of the improvements will be behind the scenes, there are a few notable upgrades; some highlights include:
* Enhanced power and cooling infrastructure to support RCC systems and colocation customers
* Dedicated 10GbE switch for each rack, to decrease network bottlenecks
* Replacement of original fire suppression system
For questions, please reach out to us
Contact us at support at rcc.fsu.edu<mailto:support at rcc.fsu.edu>.
You can also stay up-to-date at this URL on our website: https://fla.st/37fdiCa
Best regards,
The RCC Team
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.fsu.edu/pipermail/hpc-notice/attachments/20210215/238d0352/attachment.html>
More information about the Hpc-notice
mailing list