Forums incident (slowdown or error)
Forums incident (slowdown or error)
Last night GMT and today, you maybe experienced very slow AP forums or even no access at all.
The reason is simple: the whole RAID system (!!!) died for a still unknown cause (possible EMP -Electro Magnetic Pulse- despite the protections). The electronic was replaced but the hard disks remained dead. This affected Apache/PHP/FTP/filers systems. The SQL (database) system is separate and was not affected, so data is safe. Routers were affected to some degree.
The AP bots server is also a separate machine and was not affected at all.
The whole web system switched to an emergency cluster that provided slow access due to the fact it was at the same time verifying files and restoring some backups. Also routers were being fixed/replaced.
We reached a disastrous packet loss of 81% average.
The RAID system is being rebuilt on new machines. Operations should resume to normal conditions in a few.
The power supply system is being investigated; techs are searching for hacker hamsters....
Somewhere among these UPS and power transformers a hamsters commando is hiding....
The reason is simple: the whole RAID system (!!!) died for a still unknown cause (possible EMP -Electro Magnetic Pulse- despite the protections). The electronic was replaced but the hard disks remained dead. This affected Apache/PHP/FTP/filers systems. The SQL (database) system is separate and was not affected, so data is safe. Routers were affected to some degree.
The AP bots server is also a separate machine and was not affected at all.
The whole web system switched to an emergency cluster that provided slow access due to the fact it was at the same time verifying files and restoring some backups. Also routers were being fixed/replaced.
We reached a disastrous packet loss of 81% average.
The RAID system is being rebuilt on new machines. Operations should resume to normal conditions in a few.
The power supply system is being investigated; techs are searching for hacker hamsters....
Somewhere among these UPS and power transformers a hamsters commando is hiding....
- Alphacenta
- Leetissimo!
- Posts: 3200
- Joined: Thu Apr 20, 2006 8:05 pm
Lupusceleri L220/24 Agent.
Silversmith upcoming TL5 twink.
Wolfseye L110/12 Adventurer (towertwink).
Lysdexic L90/9 Agent (Mimic Enf towertwink).
Aesculapias L21/2 Doctor (ancient).
Aaaand various other alts.
Silversmith upcoming TL5 twink.
Wolfseye L110/12 Adventurer (towertwink).
Lysdexic L90/9 Agent (Mimic Enf towertwink).
Aesculapias L21/2 Doctor (ancient).
Aaaand various other alts.
Hyde wrote:Can I post again?
Ok, I can post again. I was starting to take it as a hint.
(when I logged on last night and tried to send messages I could read them, I could see other people posting, but it kept telling me I couldn't post ... though ... I'm not on that computer so I may not be able to post from home, will test that by quoting myself again tonight)
Hyde wrote:Hyde wrote:Can I post again?
Ok, I can post again. I was starting to take it as a hint.
(when I logged on last night and tried to send messages I could read them, I could see other people posting, but it kept telling me I couldn't post ... though ... I'm not on that computer so I may not be able to post from home, will test that by quoting myself again tonight)
Yes it's a hint: I need my favorite (except me) forum admin back
Seriously, nothing had change in your forums account/permissions and nothing will ever change.
I assume your issue came from some malfunction of the network at that time although I don't see what one. Please let us know if it works when you are back at home. If it doesn't, please provide info and I'll look into the problem.
We are back to normal conditions.
The average packet loss is less than 1% (Still a peak at 40+ % for some reasons being investigated)
The servers automatically restored our files (almost) without any human intervention.
The whole servers cluster is being redesigned in order to prevent such an exceptional accident, especially the way RAID is organized and the way auto backups are organized.
(If you are a Geek, you'll be happy to learn that a whole new storage system is in the making -and in fact has been in the making for 6 months- based on Solaris/Open Solaris and Sun ZFS systems. <---- Geek)
Ah... OK... you want the truth and not tech bullshit... well... the hamsters are not on strike anymore!
The average packet loss is less than 1% (Still a peak at 40+ % for some reasons being investigated)
The servers automatically restored our files (almost) without any human intervention.
The whole servers cluster is being redesigned in order to prevent such an exceptional accident, especially the way RAID is organized and the way auto backups are organized.
(If you are a Geek, you'll be happy to learn that a whole new storage system is in the making -and in fact has been in the making for 6 months- based on Solaris/Open Solaris and Sun ZFS systems. <---- Geek)
Ah... OK... you want the truth and not tech bullshit... well... the hamsters are not on strike anymore!