Outages and Maintenance
-
The Halstead cluster began experiencing issues with its scratch filesystem around 1:15 pm, Sunday 11 Oct. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while this issue is being address...
-
As of about 8:15 pm yesterday (Monday 12 October) the Slurm batch scheduler on Rice entered a degraded state due to a problem with its internal database. Job Scheduling on Rice has been paused while we work on this issue. We will provide an update b...
-
Unscheduled RCAC GitHub outage
The github.rcac server will be briefly unavailable Friday, October 16, 2020 from 7:00pm – 11:59pm for an emergency maintenance. During this time, the server will undergo maintenance tasks that can not be completed with the server in production. Opera...
-
Bell will be unavailable Tuesday night through Thursday
The Bell Cluster will be unavailable Tuesday, October 20, 2020 at 8:00pm EDT. During this time, our Engineering team will be working with vendor representatives to complete benchmarking steps and finalize the cluster's internal configuration. During...
-
Bell cluster will be unavailable Wednesday
The Bell cluster will be unavailable Wednesday, October 28, 2020 at 8:00am EDT for scheduled maintenance. During this time, our Engineering team will be working with vendor representatives to complete benchmarking steps and finalize the cluster's int...
-
Bell, Halstead, Hammer and CMS Clusters Maintenance
The Bell, Halstead, Hammer and CMS clusters will be unavailable Tuesday, November 3, 2020 at 8:00am EST for scheduled maintenance. The clusters will return to full production by %enddatetime%. During this time, the clusters will have their operating...
-
Home and Applications Filesystem Maintenance - All Clusters
Most of the research computing clusters (Brown, Gilbreth, Halstead, Hammer, Rice, Scholar, Snyder, WCERES, Workbench, and WSC Hadoop) as well as some other minor systems will be unavailable beginning at Tuesday, November 3rd, 2020 at 9:00am EST, for...
-
The Brown cluster began experiencing issues with its job scheduler around 4:00pm EST. The problem manifests itself as Slurm-related commands (slist, squeue, sinteractive, sbatch, etc) being slow, unresponsive or timing out. Queue selection dialogs in...
-
Bell cluster will be unavailable Wednesday
The Bell cluster will be unavailable Wednesday, November 11, 2020 at 8:00am EST for scheduled maintenance. During this time, our Engineering team will be working with vendor representatives to fine-tune performance of the Bell scratch filesystem and...
-
Bell cluster will be unavailable Wednesday
The Bell cluster will be unavailable Wednesday, November 25, 2020 at 8:00am EST for scheduled maintenance. During this time, our Engineering team will work on finalizing the cluster's internal configuration. Both cluster front-ends and compute nodes...
-
Monthly RCAC GitHub Maintenance
The Research Computing GitHub service (github.rcac.purdue.edu) will be unavailable Tuesday, December 1, 2020 from 9:00am - 12:00pm EST for scheduled maintenance. The service will return to full production by Tuesday, December 1st, 2020 at 12:00pm EST...
-
The ITaP GitHub service (github.itap.purdue.edu) will be unavailable Tuesday, December 1, 2020 from 1:00pm - 4:00pm EST for scheduled maintenance. The service will return to full production by Tuesday, December 1st, 2020 at 4:00pm EST. During this ti...
-
Fortress Archive Monthly Maintenance
The Fortress Archive will be unavailable Wednesday, December 2, 2020 from 8:00am - 12:00pm EST for scheduled monthly maintenance (first Wednesday of every month). During this time, Fortress will receive normal software and hardware updates. Any trans...
-
The Scholar cluster will be taken down for regular inter-semester maintenance and upgrades starting at Wednesday, December 16th, 2020 at 8:00am EST. All jobs which cannot complete before then will be held queued during this time, and no one will be a...
-
The Bell cluster will be unavailable Wednesday, December 16, 2020 at 11:00am EST for scheduled maintenance. During this time, work will be performed on several auxiliary servers. Prior to the maintenance, any SLURM jobs which request a walltime which...
-
Access to RCAC Resources During ITaP Central Authentication Outage
On Sunday, December 27th, 2020, ITaP staff will perform major upgrades to the central authentication infrastructure. All applications that require logging in with BoilerKey or Career Account credentials will be unavailable Sunday, December 27, 2020 f...
-
Fortress Archive Monthly Maintenance
The Fortress Archive will be unavailable Wednesday, January 6, 2021 from 8:30am - 12:00pm EST for scheduled monthly maintenance (first Wednesday of every month). During this time, Fortress will receive normal software and hardware updates. Any transf...
-
The Bell cluster began experiencing issues with metadata on its scratch filesystem around 9:00pm. The problem manifests itself as ls -l command hangs indefinitely, while the plain regular ls (or \ls, or stat FILE) appear to be working. Engineers are...
-
The Bell cluster began experiencing issues with its scratch filesystem around 5:00am EST. Engineers are currently diagnosing the issue and have opened the ticket with the vendor to identify a fix. Job scheduling has been paused while this issue is be...
-
The Bell cluster began experiencing issues with its scratch filesystem around 4:00pm EST. Engineers are currently diagnosing the issue and have opened a ticket with the vendor to identify a fix. Job scheduling has been paused while this issue is bein...