Halstead
-
Scheduling paused on multiple clusters
The Bell, Brown, Gilbreth, Halstead, and Scholar clusters began experiencing issues with their Data Depot mounts around 9:50am EST. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been paused while...
-
Scheduling Paused on Brown, Gilbreth, Halstead, and Hammer
As of 11:30am EDT, the Brown, Gilbreth, Halstead, and Hammer clusters began experiencing issues with their filesystems which may cause login failures. Engineers are currently investigating the root cause, and in the interim, job scheduling has been p...
-
MATH data center cooling outage
The Math building data center began experience issues with its cooling system around 1:40pm EDT. To minimize thermal load on the cooling infrastructure, job scheduling has been paused and all idle compute nodes on Anvil, Bell, Geddes, Gilbreth, and...
-
The Halstead cluster began experiencing issues with its scratch file system around 8:00am EDT. The problem manifests as various I/O errors or hangs when reading, writing or listing scratch directories. Engineers are currently diagnosing the issue and...
-
Unscheduled Data Depot Slowdown on Community Clusters
As of 9:00am EDT, users of community clusters may experience slowness while trying to access Data Depot (including loading modules, starting applications or reading data) . The symptoms appear on both login and compute nodes. System engineers are act...
-
Unscheduled Math Data Center Cooling Outage
The Math building data center began experience issues with its cooling system around 11:40am EDT. As one of manifestations, users may experience issues while logging in to the Anvil, Bell, Gilbreth, and Halstead clusters. To minimize thermal load on...
-
Whole-Floor Cluster Maintenance
The majority of Research Computing computational resources (Bell, Brown, Geddes, Gilbreth, Halstead, Hammer, Scholar, Weber, and Workbench clusters) will be unavailable March 15, 2022 4:00pm - March 16, 2022 12:00pm EDT during Whole-Floor Data Depo...
-
Unscheduled Math data center cooling outage
The Math building data center began experience issues with its cooling system around 11:40am EST. As one of manifestations, users may experience issues while logging in to the Anvil, Bell, Gilbreth, Halstead, Workbench, and Data Depot clusters. To m...
-
As of 8:00pm EST on Friday, February 11th, 2022 the Data Depot filesystem outage has been resolved and scheduling has been resumed on all clusters. The Bell, Brown, Gilbreth, Halstead, Scholar, Workbench, and Data Depot cluster began experiencing i...
-
Research Computing Holiday Break
Research Computing personnel will observe the university winter break from 5:00pm EST EST on Wednesday, December 22nd, 2021, and will resume normal business hours on Monday, January 3rd, 2022. During this time, Research Computing services will conti...
-
Unscheduled multiple clusters and Data Depot outage
The Bell, Brown, Gilbreth, Halstead, Hammer, Scholar, Workbench clusters and Data Depot began experiencing issues with intermittent high load on the Data Depot servers around 4:30pm EDT. Engineers are currently diagnosing the issue and are working to...
-
Unscheduled multiple clusters and Data Depot outage
The Bell, Brown, Gilbreth, Halstead, Hammer, Scholar, Workbench clusters and Data Depot servers began experiencing issues with Data Depot mounting on Wednesday, September 29th, 2021 around 4:40pm EDT. Engineers are currently diagnosing the issue and...
-
Unscheduled Bell, Brown, Gilbreth, Halstead, Hammer, Scholar, and Data Depot outage
The Bell, Brown, Gilbreth, Halstead, Hammer, Scholar, and Data Depot cluster began experiencing issues with Data Depot mounting around 7:00am EDT. Engineers are currently diagnosing the issue and are working to identify a fix. Job scheduling has been...
-
Unscheduled Brown, Halstead, Hammer Gilbreth, and Workbench outage
The Brown, Gilbreth, Halstead, Hammer, and Workbench clusters began experiencing issues with home mounts around Thursday, September 16th, 2021 at 11:00am EDT. Engineers are currently diagnosing the issue and are working to identify a fix. Job schedul...
-
Unscheduled Data Depot and community clusters outage
At about 9:30am EDT, Data Depot servers started experiencing a ramping high load. Coupled with an ongoing scaling issues with the metadata subsystem, this caused Data Depot to become increasingly unresponsive for both community clusters and network d...
-
Unscheduled Data Depot outage on multiple clusters
The Bell, Brown, Gilbreth, Halstead, Scholar, and Workbench clusters began experiencing issues with mounting old Data Depot filesystem around 12:30am EDT. Multiple nodes are flagged offline by an automatic check, and bioinformatics application suite...
-
RCAC Whole-Floor Downtime and Power Work
The majority of the Research Computing computational resources will be unavailable July 30, 2021 7:00am - August 1, 2021 12:00pm EDT for a whole-floor downtime due to electrical power work in MATH and POD data centers. Along with a required preven...
-
Scheduling Paused on Multiple Clusters
At about 4:00 pm today (Wednesday, 21 July, 2021) System Engineers found an issue with the schedulers on the Bell, Brown, Gilbreth, Halstead, and Scholar clusters. Job scheduling has been paused while this is being investigated. Symptoms of this pro...
-
Intermittent Access Failures on Data Depot
As of Thursday, June 17th, 2021 at 11:00am EDT, users of community clusters may experience intermittent "permission denied" errors while trying to access their files on Data Depot. Errors may come and go, and may appear on both login and c...
-
Whole-Floor Cluster Maintenance
The majority of Research Computing computational resources (Bell, Brown, Gilbreth, Halstead, Hammer, Scholar, WCERES, Workbench, and WSC Hadoop clusters) will be unavailable Tuesday, May 11, 2021 at 5:00pm EDT for Data Depot migration work. The clust...