NSF01: Unscheduled downtime

The recent disk failure will be repaired tonight on the large temp storage. This requires a reboot. Sorry for the short notice.
Also, there will be an (overseen) Point of Use (POU) upgrade. When the system comes back up again,

/temporary/saswork

will be (hopefully) functional again as RAID0. The long-term solution involves

  • having an on-site spare disk provided by SGI
  • having an implementation plan for RAID5 available.

The next time a disk fails, RAID5 will be implemented, using the available on-site spare disk. This should reduce the duration of the next downtime, and prevent future downtimes.

Comments are closed.