Outages and Maintenance
-
Unscheduled Outage to Multiple Systems
Hammer, Scholar, Snyder, WCERES, WSC Hadoop, and Data Depot began experiencing issues with networking around 10:00am EST. Engineers are currently diagnosing the issue and are working to identify a fix. We will provide an update by 1:00pm if the issue...
-
Rice will be transitioning to a new batch scheduler on Tuesday, February 18th, 2020! This is a necessary upgrade and will require faculty and students to modify how they interact with the batch system. Please review the information below prior to t...
-
Preventative Scholar Maintenance
In order to alleviate gradual degradation of performance on Scholar, all the front-end login nodes will be restarted shortly after midnight Sunday (Saturday night). This may be necessary on a weekly basis, although engineers continue to monitor these...
-
Halstead will be transitioning to a new batch scheduler on Tuesday, February 25th, 2020! This is a necessary upgrade and will require faculty and students to modify how they interact with the batch system. Please review the information below prior...
-
Brown will be transitioning to a new batch scheduler on Tuesday, March 3rd, 2020! This is a necessary upgrade and will require faculty and students to modify how they interact with the batch system. Please review the information below prior to this...
-
Snyder will be transitioning to a new batch scheduler on Tuesday, March 10th, 2020! This is a necessary upgrade and will require faculty and students to modify how they interact with the batch system. Please review the information below prior to th...
-
Gilbreth will be transitioning to a new batch scheduler on Tuesday, March 17th, 2020! This is a necessary upgrade and will require faculty and students to modify how they interact with the batch system. Please review the information below prior to...
-
The Research Computing GitHub service will undergo emergency maintenance beginning at 11:00am on March 18, 2020 to address a storage issue. During this time the service will be unavailable. We expect the GitHub service to resume normal operation by...
-
The Data Workbench cluster will be unavailable beginning Tuesday, March 24, at 8:00am for scheduled maintenance. During this time, the cluster will have minor operating system security patches and upgrades applied. Workbench will be returned to servi...
-
The Research Computing GitHub service will undergo maintenance beginning at 6:00PM on April 15, 2020. During this time the service will be unavailable. We expect the GitHub service to resume normal operation by 8:00PM.
-
Unscheduled Data Depot outage on the clusters
The Brown, Gilbreth, Halstead, Hammer, Rice, Scholar, Snyder, and Workbench clusters began experiencing issues with connection to Data Depot filesystem around 5:00pm EDT on Friday, April 17th, 2020. Engineers are currently diagnosing the issue and ar...
-
Running Jobs on Community Clusters While Data Depot is Unavailable
Since Friday, April 17, the Research Data Depot filesystem has been unavailable on community cluster systems due to an ongoing filesystem verification. While we don't believe there is any danger of data loss, the filesystem verification will continu...
-
Unscheduled Data Depot windows network drive outage
Since Friday, April 17, the Research Data Depot filesystem has been unavailable on community cluster systems, but remained available through other means of access (such as Windows Network Drive). Around 9:00am EDT on Tuesday, April 21st, 2020, the D...
-
Engineering Computing Network (ECN) has reported an outage on the software license servers for ITaP Research Computing systems that are hosted by ECN. ITaP Research Computing cluster job scheduling is not affected by the outage, but licenses for soft...
-
The Weber cluster began experiencing issues around 10:00am EDT. Engineers are currently diagnosing the issue and are working to identify a fix. We will provide an update by 2:00pm.
-
The Data Depot storage system began experiencing issues with No space left on device error messages around 10:30am EDT. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling on community clusters has been paus...
-
Unscheduled Brown scratch outage
The Brown cluster began experiencing issues with its scratch filesystem around 4:30pm EDT. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue is being addressed. We will pro...
-
The Fortress Archive will be unavailable Wednesday, May 6, 2020 at 8:00am EDT for scheduled monthly maintenance. During this time, Fortress will receive normal updates and have some work done on its battery backup system. Any work which requests file...
-
Unscheduled Home Directory Outage
The Brown, Gilbreth, Halstead, Rice, Scholar, Snyder, and Workbench clusters began experiencing issues with intermittently slow home directories access around 2:30pm EDT. The issue has been traced to a high load on one of the filesystem's back-end se...
-
Unscheduled data.rcac Transfer Node Outage
The data.rcac.purdue.edu data transfer node began experiencing issues and was taken down at 3:00pm EDT. Engineers are currently diagnosing the issue. Data may be transferred to/from other clusters using those clusters' login nodes, and for Data Depot...