Problem connecting to SITS and other services, 12 March

Maintenance badge

[12 March 2014]

At 0930 this morning one of our two servers that provide directory services (LDAP) was taken down for routine maintenance, including being moved to a more resilient rack location.

It was expected that the second server would, as usual, take up all of the load without any problem. This second server, however, was unable to handle all of the demands made on it and it failed.

There was a subsequent problem which arose when the now re-sited server was brought back into service. The authentication services did not start up automatically as expected and it was this failure which led to the problems whereby some services were unable to complete logins.

We are investigating ways to prevent a recurrence of this problem in the future and apologise for the inconvenience caused.