Article #7443: Unscheduled Data Depot Outage
The Data Depot storage system began experiencing issues starting around 4:30pm EDT today. Engineers are currently diagnosing the issue and are working...
The Data Depot storage system began experiencing issues starting around 4:30pm EDT today. Engineers are currently diagnosing the issue and are working...
The Geddes cluster began experiencing issues with its storage system around 4 AM this morning. Some users may be experiencing no space left on device...
Edit: The Data Depot file system has returned to full service and scheduling has resumed on all clusters. The Data Depot storage system began experie...
Edit: Data Depot functionality has been restored. The Data Depot file system began experiencing issues with writes around 2:30pm EDT. The data migrat...
Anvil is experiencing a problem with new user and allocation propagation. Our engineers are working on the fix, and will keep this updated. The proble...
The Gautschi cluster began experiencing issues with cooling around 2:00pm EST. Engineers are currently diagnosing the issue and are working to identif...
Today, RCAC user management systems sent incorrect email messages to many faculty partners and their resource managers. Please ignore any recent email...
A power outage affected our datacenters at approx 3:15pm. It appears cluster services are restored, however access and authentication may be slowed wh...
The Weber cluster began experiencing issues with logins around 10:00am EDT. Engineers are currently diagnosing the issue and are working to identify a...
Beginning at 10:00 AM EST on June 10th, the Anvil cluster experience a brief network interruption to fix an issue related to network connectivity. We...
Beginning at 19:00 EST, the Anvil cluster will experience a brief network interruption to fix an issue related to network connectivity. Expected retu...
RCAC systems are experiencing networking related issues that impact access to some destinations on the Internet. We are actively monitoring the situat...
The Anvil cluster began experiencing issues with permissions issues of project directories around noon. We are working on the fix. Job scheduling has...
Due to a campus-wide power outage, the Anvil, Negishi, Rowdy, and Scholar clusters experienced an unscheduled reboot at Friday, April 4th, 2025 at 8:3...
At around 10:45am EDT, Gilbreth scheduling was paused in order to reduce thermal load while emergency plumbing work is performed on the cooling loop t...
At around 1:30 PM, Bell, Negishi, and Gilbreth began to exhibit exessively high temperatures. Job scheduling has been paused while this issue is being...
At around 11:00am, Bell's scratch filesystem began to show signs of a severe performance degradation. We have paused job scheduling on Bell while eng...
The Gautschi cluster began experiencing issues with its power feed around 06:45am. Engineers are currently diagnosing the issue and are working to ide...
We have noticed a discrepancy in the allocation usage after the outage, so you may see incorrect usage for your allocation(s) from mybalance. Our engi...
The Gautschi cluster began experiencing issues with internal fabrics around 02:30 2025-02-13. Engineers are currently diagnosing the issue and are wor...