OCF Mail Down
OCF mail is down; please stay tuned for updates.
OCF mail is down; please stay tuned for updates.
The OCF's DNS server just underwent spontaneous massive existence failure and we are trying to get the system to boot. Further bulletins as events warrant.
Hi,
Tonight from 2am-6am Esheleman Hall will have a power outage, stay tuned for updates.
Update 8:44: Everything except the OCF webserver/disk array/authentication servers has now been shutdown.
Update 9:00pm Shutting down mysql
Update: 9:15pm Webserver shutdown
Update: 9:20pm Disk array shutdown/authentication servers shut down. The only things that are still up are infrastructure related servers. These allow us to manage machines remotely. The UPS says 2:15 of runtime, the outage is 4hrs, lets hope 4hrs is a conservative estimate?
Update 6:36am Marginally restored our infrastructure, we are running tests to make sure everything that is up is working. Login Servers will be up shortly
Update 7:04am There are a few issues coming back up we are looking to get them resolved ASAP
Update 7:31am We had some disk array issues, they seemed to be resolved for the time being, all the windows machines work, printing works, and we will soon boot up the login servers after we are sure permissions and such are working properly
Update 7:56 Our DNS server is being stubborn, seems to be the root cause of recent issues. Mysql should be back up
Update 8:04am FSCK time, what a fun way to start the moring, fsck'ing broken filesystems
Update 8:12am The webserver should be working again
Update 8:22am DNS is plodding along, expect a delay between 9-10:30 since I have class at this time.
Update 8:41am FSCK on the DNS server, will likely be down for a while, login servers should be up and running, docs and webmail should work too.
Update 8:44am spoke too soon disregard the previous post
Update 10:56am still working on getting dns up.
Update 11:11am DNS should be up now
Update 6:40pm Reaching hour 30 of this adventure, most of our services have been restored. Mail is a work in progress, but your stored email should be fully accessible now. apocalypse.ocf.berkeley.edu doesn't seem to turn on, so we will keep that off for now (while we straighten out everything else).
In a rather unlucky streak of timing here are other known failures in the OCF (today was not the best day).
2 printers (1 critically)
3 infrastructure related servers
Will update you as we get things fixed.
Mysql went down for a short period of time yesterday, we hosted the service from backups in a read-only mode for the night.
UPDATE (15:41):
ETA of 8-10hrs before we get mysql up and running again.
UPDATE (19:41):
mysql, postgresql back in business. props to jaws.ocf.berkeley.edu, for performing admirably
Hi all,
Yesterday's reboot left behind some nasty problems in our mail volume, we are going to take some downtime to fix it. 7pm-11pm, I expect system-wide downtime, I apologize for whatever inconvenience this has caused. In hindsight we should have handled this better, but we learn from our mistakes.
Sorry for the trouble.
Update 4:56pm
We may start working a bit early at around 5:30-6:00 so don't be surprised if /var/mail disappears during that time.
Update 8:40pm
Stuff should be working again for the most part, thanks for your patience
Hi all,
We rebooted our disk array, resulting about 30 mins of downtime system wide. Just notifying all in case there are any stray stale nfs handles hanging around, or if any scripts broke. Don't panic if you couldn't log into our machines during this time, it was nothing serious.
We are moving the webserver onto some better hardware, it should be a bit faster after the transition...
We've restored webmail functionality. Please let us know if there are any problems or hiccups.