April 2011 Archives

April 26, 2011

Gosset Fixed

Gosset is working again after a second visit from the HP technician

April 18, 2011

Gosset Down

Gosset did not come up properly after Friday's power problem; we are investigating.  It may be several days before we get it back up.http://www.math.mcmaster.ca/blogs/archives/computing_news/2011/04/serverpower-pro.html

April 15, 2011

Possible Service Interruptions

We are still trying to pinpoint the source of the partial power failure in the server room earlier this afternoon.  We know that something went wrong with a UPS unit which has served us faithfully for six years now, but we don't know precisely what.

Depending on what we find, we may need to shut down the storage array and compute servers with very little notice.   And a similar power failure might be possible, too.

There may be loss of access to home directories and interruptions to mail and web service with little or (should the power fail) no notice.  So: save early and save often.  I'll post an update once we know that things are stable again.


Enhanced by Zemanta

Server/Power Problem Friday Afternoon

We lost power to part of the Hamilton Hall server room on Friday afternoon just before 3:00 pm.  The main server wasn't affected, but the main storage array was, which means that mail, web and workstations were unavailable until the problem was corrected.  Web sites were back up by half past three, but other services were spotty until about four o'clock.

There was no damage to the files on the storage array, though some mail may have been returned to senders as undeliverable.

Most workstations will need to be rebooted (Alt-Ctrl-F1 then Alt-Ctrl-Del); some will need to be restarted (hold power button for ten seconds to turn off then turn back on).

Any jobs running on bayes, gosset or freesurface will have been lost as those servers were connected to the part of the power system which failed.

April 12, 2011

Firefox 4.0 Available

I've installed Firefox 4.0 on the ms workstations but I have not made it the default since this is a .0 release and so a little bit suspect; the Firefox icon will still bring up version 3.5.

But you can try Firefox 4 with the command firefox4 (at a command prompt or via Alt-F2).

April 8, 2011

Partial Shutdown on Saturday, April 30th

Facility Services has announced that air conditioning to Hamilton Hall will be turned off from 6:00 am to 4:30 pm on Saturday, April 30th.   In order to prevent damage from overheating, we will be shutting down most systems in the server room on Friday afternoon: this means bayes, gosset, freesurface, etc.



I will leave the main file/web/mail server up, but if the room starts getting too hot I will shutdown everything but web services (no email, no workstations, no changes to the web server).


Announcement from FS follows...

Continue reading Partial Shutdown on Saturday, April 30th.

"Webmail Upgrade" Phishing Spam

http://www.rhpcs.mcmaster.ca/Most of us are able to spot the "Webmail Upgrade" or "Webmail Account Warning" phishing spam these days, so I don't normally warn people about it.

But a spate of such spam has hit @math.mcmaster.ca accounts right on the heels of some server upgrades and mail problems, so I'm going to emphasize that RHPCS will never

  • ask you for your password
  • send a message without signing off with the name of a specific RHPCS staff member
  • commit more than two outrageous solecisms per message
More specifically, the messages which came today with the subject "Mcmaster.ca Webmail are Currently Upgrading" should simply be deleted.

Enhanced by Zemanta

April 7, 2011

Mail Problem on Wednesday

Email messages addressed directly to @math.mcmaster.ca addresses sent from off-campus sources (e.g. gmail.com, another university or from home without a VPN connection) were not deliverable between 5:00 pm Tuesday and 4:30 pm Wednesday.

Some of the undeliverable messages will have been queued off campus and delivered once @math.mcmaster.ca was accessible to external mail servers again.  Other messages will have bounced, in which case the sender will most likely have received a delivery-failure warning.

Messages sent from the following sources were not effected ...

Messages sent to @mcmaster.ca addresses which are redirected to @math.mcmaster.ca addresses (by UTS aliases or a univmail forwarding rule) were not effected, either.


April 6, 2011

Wikis Down

Wikis hosted at wiki.math.mcmaster.ca are down today and will be until Thursday morning.

External Mail Problem

Most mail from off campus addressed to @math.mcmaster.ca addresses is not reaching the server; the problem started when we moved the mail server from the ABB server room to the HH server room yesterday evening.

I'm working with UTS to resolve the problem and expect to have it sorted out this afternoon.

Note that ...

  • mail forwarded by @mcmaster.ca to @math.mcaster.ca is arriving;
  • mail forwarded by unvimail is arriving;
  • all mail originating from univmail, muss, other campus mails servers, or VPN-connected clients is arriving.

Continue reading External Mail Problem.

Post-Downtime Update

Our main server (ms.mcmaster.ca) is now back in HH after a few months in the ABB server room and is using a new, larger disk array, also in HH (we were borrowing space in ABB while ms was there).  Thanks for your patience as we completed another part of the migration to new server infrastructure.

A few notes regarding the downtime and recovery ...

  • contrary to my plan, the xguest login on the ms workstations did not work
  • the downtime extended to 8:45 pm instead of 7:00 pm
  • the web server was down from 4:55 pm to 5:20 pm
    • the main page (and other database-driven pages) were down for another hour
    • other sites (e.g. course and instructor pages) were OK
  • most workstations are working fine as of 9 o'clock Wednesday morning, but a few will need to be rebooted; if your workstation is frozen
    • hold down the power button for ten seconds to turn it off
    • wait five seconds
    • turn it back on
    • note that the boot may take five minutes or so while the disk is checked for errors

April 5, 2011

SSH Warnings

Because ms.mcmaster.ca has moved between buildings (from ABB to HH), it has been given a different IP number (i.e. network address).  You should remove the old entries for the server from your ssh host-key file in order to avoid dire warnings of "Offending keys".

ssh-keygen -R ms
ssh-keygen -R 130.113.105.93

Mail Hiccoughs

Some of you will be having trouble getting to your mail via imap clients or webmail until later on this evening: I neglected to redirect the mathmail.mcmaster.ca to the new network location of the server. My apologies.

Note that anyone using the addresses mail.math.mcmaster.ca or ms.mcmaster.ca won't see these problems - though mathmail.mcmaster.ca is the preferred address.

Downtime Extended but Web Sites Up

The scheduled downtime is not quite over.  Websites and printing are up but email and workstations are still down while I finish some work on the storage array.

Things should be working again at ca. 8:30 pm.  My apologies for the delay.

Note that all web sites hosted by ms.mcmaster.ca were down from 4:55 pm to 5:20 pm (contrary to my announced intention of limiting downtime to a few seconds).  The www.math.mcmaster.ca main page and other database-dependent pages were generating errors until 6:30 due to a network problem introduced during the server move.  Other parts of the site (i.e. most course and instructor pages) were fine.

Downtime This Evening - Changes

The scheduled downtime from 4:45 pm to 7:00 pm this evening will proceed as planned but with these differences ...

  1. the www.math.mcmaster.ca web site will not be down for more than a few seconds (though you won't be able to make changes during this period)
  2. limited-use guest accounts will be available on the most workstations
Once the upgrades begin at 4:45 today, you will not be able to get to your email, use your linux account on your workstation or print.  But if you logout and login with the username xguest, you should find that you are able to use a browser (if firefox doesn't work, use Chromium).

All services should be on-line again by 7:00 pm; some (e.g. printing) will come up sooner.

April 4, 2011

Dowtime Tuesday Afternoon

We will be moving our main server and storage array from their temporary berth in ABB back to our HH server room on Tuesday. Email, workstation and home-directory access will be down from 4:45 pm to 7:00 pm on Tuesday; web sites will be down from ca. 6:50 pm to 7:00 pm.

If all goes well, I should have the linux workstations set up so that you can login and run a browser without logging into the server; you won't be able to read your mail @math.mcmaster.ca or get to your files, though.

Enhanced by Zemanta

About this Archive

This page is an archive of entries from April 2011 listed from newest to oldest.

March 2011 is the previous archive.

May 2011 is the next archive.

Find recent content on the main index or look in the archives to find all content.