<?xml version="1.0" encoding="UTF-8" ?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:media="http://search.yahoo.com/mrss/">
	<channel>
		<title>RCAC - Outages and Maintenance, Outages, Maintenance</title>
		<link>https://rcac.purdue.edu/news/rss/Outages%20and%20Maintenance</link>
		<description><![CDATA[news::news.feed description]]></description>
		<atom:link href="https://rcac.purdue.edu/news/rss/Outages%20and%20Maintenance" rel="self" type="application/rss+xml" />
		<language>en</language>
		<lastBuildDate>Sun, 05 Apr 2026 13:16:35 EDT</lastBuildDate>
					<item>
				<title><![CDATA[Github monthly maintenance]]></title>
				<link>https://rcac.purdue.edu/news/7652</link>
				<guid isPermaLink="true">https://rcac.purdue.edu/news/7652</guid>
				<description><![CDATA[<p>The Purdue Github services will be unavailable Wednesday, April 1, 2026 from 3:00pm - 5:00pm EDT for scheduled monthly maintenance (first Wednesday of every month).</p>
<p>During this time, the Github appliances will receive normal software updates, and you may have difficulties accessing your repositories.</p>
]]></description>
				<pubDate>Wed, 01 Apr 2026 15:00:00 -0400</pubDate>
									<category>Maintenance</category>
							</item>
					<item>
				<title><![CDATA[Fortress Archive Monthly Maintenance]]></title>
				<link>https://rcac.purdue.edu/news/7648</link>
				<guid isPermaLink="true">https://rcac.purdue.edu/news/7648</guid>
				<description><![CDATA[<p>The Fortress Archive will be unavailable Wednesday, April 1, 2026 from 8:00am - 12:00pm EDT for scheduled monthly maintenance (first Wednesday of every month).</p>
<p>During this time, Fortress will receive normal software and hardware updates. Any transfers which request files sent to or from Fortress will either block or fail until this maintenance is completed.</p>
]]></description>
				<pubDate>Wed, 01 Apr 2026 08:00:00 -0400</pubDate>
									<category>Maintenance</category>
							</item>
					<item>
				<title><![CDATA[Negishi Cluster Filesystem Interruption (Resolved)]]></title>
				<link>https://rcac.purdue.edu/news/7651</link>
				<guid isPermaLink="true">https://rcac.purdue.edu/news/7651</guid>
				<description><![CDATA[<p>Between approximately 9:30 PM and 10:45 PM EDT on March 30, 2026, the Negishi home file system experienced issues that prevented users from successfully connecting to login nodes.</p>
<p>Service was fully restored at 10:45 PM EDT, and login functionality has returned to normal. No data loss occurred.</p>
]]></description>
				<pubDate>Mon, 30 Mar 2026 21:30:00 -0400</pubDate>
									<category>Outages</category>
							</item>
					<item>
				<title><![CDATA[Temporary Impact to Data Depot Operations]]></title>
				<link>https://rcac.purdue.edu/news/7650</link>
				<guid isPermaLink="true">https://rcac.purdue.edu/news/7650</guid>
				<description><![CDATA[<p>We’re currently investigating an issue affecting IO operations to the Data Depot. Our monitoring has detected that the primary storage disks used for data writes are nearing capacity, which is causing IO failures for jobs writing to the Depot.</p>
<p>To prevent further impact, job submissions to clusters connected to Depot have been temporarily paused while the team works restore normal operation.</p>
<p><strong>Impact:</strong> Users may experience job delays or failures when writing to the Data Depot.</p>
<p><strong>Current Status:</strong> Our engineering team is actively addressing the capacity issue.</p>
<p><strong>Next Update:</strong> We will provide an update once the service is restored or we have more information to share.</p>
]]></description>
				<pubDate>Mon, 30 Mar 2026 09:30:00 -0400</pubDate>
									<category>Outages</category>
							</item>
					<item>
				<title><![CDATA[Weber Cluster Maintenance]]></title>
				<link>https://rcac.purdue.edu/news/7636</link>
				<guid isPermaLink="true">https://rcac.purdue.edu/news/7636</guid>
				<description><![CDATA[<p>As of 11:00am EDT, engineers have completed maintenance and have returned the Weber cluster back to normal service. Please report any issues to <a href="mailto:rcac-help@purdue.edu">rcac-help@purdue.edu</a></p>
<p>The Weber cluster will be unavailable Wednesday, March 18, 2026 from 8:00am - 5:00pm EDT for scheduled maintenance. The cluster will return to full production by 5:00pm EDT.</p>
<p>During this time, Weber will have its operating system patched.</p>
<p>Any Slurm jobs which request a walltime which would take them past Wednesday, March 18, 2026 at 8:00am EDT will not start and will remain in the queue until after the maintenance is completed.</p>
]]></description>
				<pubDate>Wed, 18 Mar 2026 08:00:00 -0400</pubDate>
									<category>Outages and Maintenance</category>
							</item>
					<item>
				<title><![CDATA[Power Outage Impacting Multiple Clusters — Recovery Underway]]></title>
				<link>https://rcac.purdue.edu/news/7640</link>
				<guid isPermaLink="true">https://rcac.purdue.edu/news/7640</guid>
				<description><![CDATA[<p>At approximately 6:00 AM EDT, a power outage impacted systems in the Math Data Center. Most services have now been restored.</p>
<p>Due to the outage, some jobs on Gilbreth did not requeue automatically. Users should check the status of any jobs that were running early this morning and resubmit them if needed.</p>
]]></description>
				<pubDate>Wed, 18 Mar 2026 06:00:00 -0400</pubDate>
									<category>Outages</category>
							</item>
					<item>
				<title><![CDATA[Negishi experiencing 2nd service disruption]]></title>
				<link>https://rcac.purdue.edu/news/7639</link>
				<guid isPermaLink="true">https://rcac.purdue.edu/news/7639</guid>
				<description><![CDATA[<p>We are again investigating an issue affecting Negishi. The system is currently unresponsive or unavailable for some users, including SSH access. We will provide an update when service is restored or as soon as we have additional information.</p>
]]></description>
				<pubDate>Tue, 17 Mar 2026 14:30:00 -0400</pubDate>
									<category>Outages and Maintenance</category>
							</item>
					<item>
				<title><![CDATA[Negishi experiencing service disruption]]></title>
				<link>https://rcac.purdue.edu/news/7638</link>
				<guid isPermaLink="true">https://rcac.purdue.edu/news/7638</guid>
				<description><![CDATA[<p>We are investigating an issue affecting Negishi. The system is currently unresponsive or unavailable for some users, including SSH access. We will provide an update when service is restored or as soon as we have additional information.</p>
]]></description>
				<pubDate>Tue, 17 Mar 2026 11:00:00 -0400</pubDate>
									<category>Outages and Maintenance</category>
							</item>
					<item>
				<title><![CDATA[Negishi Cluster Service Interruption]]></title>
				<link>https://rcac.purdue.edu/news/7629</link>
				<guid isPermaLink="true">https://rcac.purdue.edu/news/7629</guid>
				<description><![CDATA[<p>We are currently experiencing an unexpected outage affecting the Negishi cluster. We are actively working to restore service and will provide an update once the system is stable.</p>
<ul>
<li>
<p><strong>Impact:</strong> Users are currently unable to access their home directories, and SSH connections or active terminal sessions may freeze or be denied.</p>
</li>
<li>
<p><strong>Current action:</strong> System administrators are performing a system reboot of the affected storage infrastructure to clear the error and restore file access.</p>
</li>
</ul>
]]></description>
				<pubDate>Wed, 11 Mar 2026 15:00:00 -0400</pubDate>
									<category>Outages</category>
							</item>
					<item>
				<title><![CDATA[Weber Authentication Changes]]></title>
				<link>https://rcac.purdue.edu/news/7622</link>
				<guid isPermaLink="true">https://rcac.purdue.edu/news/7622</guid>
				<description><![CDATA[<p>As part of the efforts to certify Purdue as a CMMC Level 2 compliant institution, authentication to Weber, ThinLlinc access, and data egress will change during a maintenance scheduled for 8 AM -12 PM on March 5th, 2026.</p>
<p>** How does this change impact you? **</p>
<p>Login to Weber: The change will simplify logging into Weber. Weber users currently need to authenticate to Luna VPN or VDI with Luna credentials, and then additionally authenticate to Weber with BoilerAD credentials. After the change, users will just need to use their Luna password when logging into Weber resources, including via ThinlLinc, SMB, and SSH. Luna password will be the same password as is currently used to log into the VPN.</p>
<p>ThinlLinc: Anyone accessing Weber via ThinLinc directly from their endpoint will need to switch to using ThinLinc from a Luna Desktop.</p>
<p>Data Egress: File egress will be disabled, and users will need to submit a ticket each time they wish to transfer files outside weber. Egress to Apricon drives will no longer be supported on regular endpoints.</p>
<p>During the maintenance window, job scheduling though Slurm and network drive mounts of Weber storage will be unavailable.</p>
<p>These changes position Purdue to meet the requirements of CMMC Level 2 compliance and will be crucial for continuation of various grants and contracts.</p>
<p>Please reach out to <a href="mailto:rcac-help@purdue.edu">rcac-help@purdue.edu</a>? if you have any questions.</p>
]]></description>
				<pubDate>Thu, 05 Mar 2026 08:00:00 -0500</pubDate>
									<category>Maintenance</category>
							</item>
					<item>
				<title><![CDATA[Github monthly maintenance]]></title>
				<link>https://rcac.purdue.edu/news/7626</link>
				<guid isPermaLink="true">https://rcac.purdue.edu/news/7626</guid>
				<description><![CDATA[<p>Due to prolonged software updates the Purdue Github services will be extending service unavailability today Wednesday, March 4, 2026 from 3:00pm - 7:00pm EST (previously 3:00pm - 5:00pm) for scheduled monthly maintenance (first Wednesday of every month).</p>
<p>During this time, the Github appliances will receive normal software updates, and you may have difficulties accessing your repositories.</p>
]]></description>
				<pubDate>Wed, 04 Mar 2026 17:00:00 -0500</pubDate>
									<category>Maintenance</category>
							</item>
					<item>
				<title><![CDATA[Scheduling paused on Gautschi CPU cluster due to cooling leak]]></title>
				<link>https://rcac.purdue.edu/news/7627</link>
				<guid isPermaLink="true">https://rcac.purdue.edu/news/7627</guid>
				<description><![CDATA[<p>We have identified a cooling leak in Gautschi cluster that impacts CPU nodes and are actively working to resolve the issue.</p>
<p>**How does this impact you?
**
We have paused job scheduling on the CPU nodes. You can continue to submit jobs to Gautschi nodes, however the jobs will remain queued until after the maintenance is complete.</p>
<p>The cooling leak does not affect Gautschi AI nodes. Therefore, those nodes continue to operate normally.</p>
<p>We will provide additional updates by Thursday, March 5 at 12 pm. Please reach out to <a href="mailto:rcac-help@purdue.edu">rcac-help@purdue.edu</a> if you have any questions or need support.</p>
]]></description>
				<pubDate>Wed, 04 Mar 2026 17:00:00 -0500</pubDate>
									<category>Maintenance</category>
							</item>
					<item>
				<title><![CDATA[Github monthly maintenance]]></title>
				<link>https://rcac.purdue.edu/news/7624</link>
				<guid isPermaLink="true">https://rcac.purdue.edu/news/7624</guid>
				<description><![CDATA[<p>The Purdue Github services will be unavailable Wednesday, March 4, 2026 from 3:00pm - 5:00pm EDT for scheduled monthly maintenance (first Wednesday of every month).</p>
<p>During this time, the Github appliances will receive normal software updates, and you may have difficulties accessing your repositories.</p>
]]></description>
				<pubDate>Wed, 04 Mar 2026 15:00:00 -0500</pubDate>
									<category>Maintenance</category>
							</item>
					<item>
				<title><![CDATA[MATH Datacenter Cooling issue - Job scheduling paused on Anvil/Gautschi]]></title>
				<link>https://rcac.purdue.edu/news/7625</link>
				<guid isPermaLink="true">https://rcac.purdue.edu/news/7625</guid>
				<description><![CDATA[<p>The MATH datacenter started experiencing issues with cooling systems around 12pm. Job scheduling on the Anvil and Gautschi clusters was paused shortly after and scheduling resumed at 1:30pm.</p>
]]></description>
				<pubDate>Wed, 04 Mar 2026 12:00:00 -0500</pubDate>
									<category>Outages</category>
							</item>
					<item>
				<title><![CDATA[Fortress Archive Upgrade]]></title>
				<link>https://rcac.purdue.edu/news/7617</link>
				<guid isPermaLink="true">https://rcac.purdue.edu/news/7617</guid>
				<description><![CDATA[<p>The Fortress Archive will be unavailable March 4, 2026  8:00am - March 5, 2026  9:00am EST for an software upgrade.  Fortress will be updated to HPSS 10.3.</p>
<p>During this time, Fortress will receivea major upgrade to its software.  This will include some database conversions to add/modify some database tables and columns.   Any transfers which request files sent to or from Fortress will either block or fail until this maintenance is completed.</p>
<p>Note that this outage is entended to the next morning because of full floor power work being done in the data center that Fortress is housed in.  It was deemed better to leave the system unavailable rather than bring it up and then back down for the power outage.</p>
]]></description>
				<pubDate>Wed, 04 Mar 2026 08:00:00 -0500</pubDate>
									<category>Maintenance</category>
							</item>
					<item>
				<title><![CDATA[Data Depot Filesystem issue: Scheduling Resumed]]></title>
				<link>https://rcac.purdue.edu/news/7611</link>
				<guid isPermaLink="true">https://rcac.purdue.edu/news/7611</guid>
				<description><![CDATA[<p>An internal portion of the Data Depot filesystem is currently offline, as a result, all scheduling has been paused until this issue is resolved.</p>
<p><strong>Impact to you</strong>
Attempts to read files that are on the affected storage may result in error messages</p>
<p>Our IT team is actively working with the vendor to restore service as quickly as possible. We will send an update as soon as more information is available.</p>
]]></description>
				<pubDate>Wed, 11 Feb 2026 14:30:00 -0500</pubDate>
									<category>Outages</category>
							</item>
					<item>
				<title><![CDATA[Gilbreth Remains Unavailable – Update Expected by 10:00 AM]]></title>
				<link>https://rcac.purdue.edu/news/7608</link>
				<guid isPermaLink="true">https://rcac.purdue.edu/news/7608</guid>
				<description><![CDATA[<p>The scheduled Math Data Center maintenance is complete. The <strong>Gilbreth cluster remains unavailable</strong> as we continue post‑maintenance work. <strong>The next update will be provided by 10:00 AM tomorrow (Friday), or sooner if available.</strong></p>
<p>We appreciate your patience and understanding as we work to restore Gilbreth service.</p>
]]></description>
				<pubDate>Fri, 06 Feb 2026 00:15:00 -0500</pubDate>
									<category>Outages</category>
							</item>
					<item>
				<title><![CDATA[Unscheduled Data Depot Service Outage]]></title>
				<link>https://rcac.purdue.edu/news/7607</link>
				<guid isPermaLink="true">https://rcac.purdue.edu/news/7607</guid>
				<description><![CDATA[<p>RCAC’s Data Depot storage system is currently unavailable. Our team is actively investigating and working to restore service as quickly as possible.</p>
<p>We will provide updates as the situation develops and once service is fully restored.</p>
<p>We appreciate your patience and understanding.</p>
]]></description>
				<pubDate>Thu, 05 Feb 2026 08:00:00 -0500</pubDate>
									<category>Outages</category>
							</item>
					<item>
				<title><![CDATA[February 5 Maintenance – Math Data Center Upgrades and Service Impact]]></title>
				<link>https://rcac.purdue.edu/news/7591</link>
				<guid isPermaLink="true">https://rcac.purdue.edu/news/7591</guid>
				<description><![CDATA[<p>On Thursday, February 5, RCAC will perform planned maintenance in the MATH data center to support cooling upgrades and capacity improvements as part of the ongoing MATH datacenter renovation project.</p>
<p>During this maintenance window, several clusters will experience a temporary outage so that hardware can be safely powered down while facility work is performed:</p>
<ul>
<li>
<p>Gautschi, Gilbreth, Negishi, Bell, and Anvil cluster nodes will be powered down.</p>
</li>
<li>
<p>The Gilbreth’s legacy V100 GPUs, that are well past their lifetime, will be decommissioned.</p>
</li>
<li>
<p>Hammer (Math nodes) and Geddes: A subset of nodes will be powered down but the services will be available, unless communicated separately.</p>
</li>
</ul>
<h3>How does this maintenance impact you?</h3>
<ul>
<li>
<p>Clusters listed in this message won’t be available to run jobs during the maintenance.</p>
</li>
<li>
<p>Any jobs requesting a walltime which would take them past the start of the maintenance will not start and will remain in the queue until after the maintenance is completed.</p>
</li>
<li>
<p>Users can continue to access their data.</p>
</li>
<li>
<p>GenAI studio will remain available. This maintenance will position Purdue to support growing computational needs. Users should see long‑term benefits in system reliability and our ability to support future computing and AI resources.</p>
</li>
</ul>
<p>If you have questions about how this outage will affect your work or need support, please contact <a href="mailto:rcac%E2%80%91help@purdue.edu">rcac-help@purdue.edu</a>.</p>
]]></description>
				<pubDate>Thu, 05 Feb 2026 07:00:00 -0500</pubDate>
									<category>Maintenance</category>
							</item>
					<item>
				<title><![CDATA[Github monthly maintenance]]></title>
				<link>https://rcac.purdue.edu/news/7606</link>
				<guid isPermaLink="true">https://rcac.purdue.edu/news/7606</guid>
				<description><![CDATA[<p>The Purdue Github services will be unavailable Wednesday, February 4th 2026 from 3:00pm - 5:00pm EST for scheduled monthly maintenance (first Wednesday of every month).</p>
<p>During this time, the Github appliances will receive normal software updates, and you may have difficulties accessing your repositories.</p>
]]></description>
				<pubDate>Wed, 04 Feb 2026 15:00:00 -0500</pubDate>
									<category>Maintenance</category>
							</item>
			</channel>
</rss>