Outages and Maintenance
-
We are currently experiencing network connectivity problems with the Gautschi community cluster. Engineers are investigating and will provide an update by noon 1/18/2025. Update: This has been resolved.
-
Gautschi EUP Scheduled Maintenance
The Gautschi cluster will be unavailable on Tuesday, January 21st between 8:00am-5:00pm EDT for the early user period's scheduled Tuesday maintenance. During this time, we will be modifying some of the Slurm configurations around CPU allocation on th...
-
The Anvil cluster began experiencing issues with electrical power around 2:30 PM EST. RCAC engineers are working with Purdue electricians to safely restore power. Anvil is operating at reduced capacity while a handful of nodes were shut down as a pre...
-
Update: Tuesday, January 21st, 2025 at 3:02pm EST: The situation has been corrected and job scheduling is running again on Negishi. The Negishi cluster began experiencing issues with electrical power around 2:30 PM. RCAC engineers are working with P...
-
From 5:00pm to 7:00pm on Tuesday, January 21st, 2025, the Weber VPN (REED VPN) will be undergoing a regular software upgrade. No actions need to be taken by users. While no cluster downtime is anticipated, users may experience their connection resett...
-
Hammer Slurm Maintenance 1/22/2025
We will be upgrading backend servers on Wednesday, January 22nd. These upgrades will require the Slurm scheduler to be idle and shut down while transferring to the new server. There will be a period of roughly 2 hours where the Slurm scheduler will b...
-
The Fortress storage system began experiencing issues earlier today related to one of its adminstrative servers. This results in access being denied to users attempting to connect/authenticate; e.g., via hsi or htar command-line tools or the Globus t...
-
The Bell cluster will be unavailable from Monday, January 27th, 2025 at 8:00am EST to Tuesday, January 28th, 2025 at 5:00pm EST for scheduled maintenance. During this time, RCAC staff will reorganize the data center space for Bell and upgrade its Lus...
-
Fortress Archive Monthly Maintenance
The Fortress Archive will be unavailable Wednesday, February 5, 2025 from 8:00am - 12:00pm EST for scheduled monthly maintenance (first Wednesday of every month). During this time, Fortress will receive normal software and hardware updates. Any trans...
-
The Purdue Github services will be unavailable Wednesday, February 5, 2025 from 3:00pm - 5:00pm EDT for scheduled monthly maintenance (first Wednesday of every month). During this time, the Github appliances will receive normal software updates, and...
-
The Gautschi cluster began experiencing issues with internal fabrics around 02:30 2025-02-13. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue is being addressed. We will...
-
Gautschi EUP Scheduled Maintenance
The Gautschi cluster will be unavailable on Tuesday, Febuary 18th between 8:00am-5:00pm EDT for the early user period's scheduled Tuesday maintenance. During this time, we will be replacing some network cables and updating firmware on the AI nodes.
-
RCAC License Server Maintaince
Update: Tuesday, February 11th, 2025 at 6:12pm EST This is a reminder that the RCAC license server will be unavailable on today from 11:00 AM to 11:30 AM EST for scheduled maintenance. Licenses hosted by RCAC will be unavailable for checkout for the...
-
We have noticed a discrepancy in the allocation usage after the outage, so you may see incorrect usage for your allocation(s) from mybalance. Our engineers are woking on the fix. Job scheduling will NOT be impacted. We will provide an update by 5:00p...
-
The Gilbreth cluster is scheduled for maintenance and will be unavailable on Tuesday, February 25th, 2025 at 8:00am EST through Wednesday, February 26th, 2025 at 5:00pm EST. During this maintenance, Gilbreth will have its software stack modernized. W...
-
Gautschi EUP Scheduled Maintenance
The Gautschi cluster will be unavailable on Tuesday, Febuary 25th between 8:00am-5:00pm EDT for the early user period's scheduled Tuesday maintenance. During this time, we will be updating system firmware.
-
Unscheduled Gautschi cluster outage
The Gautschi cluster began experiencing issues with its power feed around 06:45am. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue is being addressed. We will provide an...
-
The Bell cluster is scheduled for maintenance and will be unavailable on Tuesday, March 4th, 2025 at 8:00am EST through Wednesday, March 5th, 2025 at 5:00pm EST. During this maintenance, Bell will have its software stack modernized. What is being upg...
-
Fortress Archive Monthly Maintenance
The Fortress Archive will be unavailable Wednesday, March 5, 2025 from 8:00am - 12:00pm EST for scheduled monthly maintenance (first Wednesday of every month). During this time, Fortress will receive normal software and hardware updates. Any transfer...
-
At around 11:00am, Bell's scratch filesystem began to show signs of a severe performance degradation. We have paused job scheduling on Bell while engineers investigate the slow down and work to brings things back up to speed. We will provide an upda...