ok so this outage started a little before 9am this morning:
9:05 AM PDT We are currently experiencing elevated error rates with S3. We are investigating.
Translation: s3 is down
9:26 AM PDT We're investigating an issue affecting requests. We'll continue to post updates here.
translation: its still down
9:48 AM PDT Just wanted to provide an update that we are currently pursuing several paths of corrective action.
translation: its still down
10:12 AM PDT We are continuing to pursue corrective action.
translation: its still down
10:32 AM PDT A quick update that we believe this is an issue with the communication between several Amazon S3 internal components. We do not have an ETA at this time but will continue to keep you updated.
translation: its still down
11:01 AM PDT We're currently in the process of testing a potential solution.
translation: its still down
11:22 AM PDT Testing is still in progress. We're working very hard to restore service to our customers.
translation: its still down
11:45 AM PDT We are still in the process of testing a series of configuration changes aimed at bringing the service back online.
translation: its still down
12:05 PM PDT We have now restored communication between a small subset of hosts. We are working on restoring internal communication across the rest of the fleet. Once communication is fully restored, then we will work to restore request processing.
translation: its still down
12:25 PM PDT We have restored communication between additional hosts and are continuing this work across the rest of the fleet. Thank you for your continued patience.
translation: its still down
12:51 PM PDT The restored hosts are stable and we are moving forward in restoring communication between additional hosts.
translation: its still down
1:17 PM PDT We continue to make incremental progress and communication between additional hosts has been restored. We are continuing with the plan to restore communication across Amazon S3's large fleet of hosts.
translation: its still down
1:38 PM PDT At this point, we are accelerating progress on restoring internal communication as all signs continue to look good.
translation: its still down
2:03 PM PDT We have restored all internal communication between hosts in the EU and we are continuing to make progress in the US. Once all internal communication has been restored, we will start a multi-step process to begin accepting requests across Amazon S3 locations.
translation: its still down
2:19 PM PDT A quick update to let you know that we have now also restored all internal communication between hosts in our West Coast facilities in the US.
translation: its still down
5 and a half hours of outage... booooooooooooooooo!
Sunday, July 20, 2008
Thursday, July 17, 2008
Comments enabled
You can now comment on my blog posts. Please be nice, I have poor self esteem.
Posted by
Michael Economy
Wednesday, July 16, 2008
To all executive search consultants
Dear sir/madam,
You are not an executive search consultant, you are a recruiter, please do everyone a favor and drop the charade.
Thanks,
-Michael
executive source code production specialist,
Goodreads Inc
You are not an executive search consultant, you are a recruiter, please do everyone a favor and drop the charade.
Thanks,
-Michael
executive source code production specialist,
Goodreads Inc
Posted by
Michael Economy
Tuesday, July 15, 2008
Analytics
So I'm looking through the analytics for a certain social network for readers I work on and it looks like one of our popular google queries is "101 sex positions". Ok, thats a little bit funny, but how many impressions have we seen on that keyword in the last month? 69. HAHAHAHAHA.
Posted by
Michael Economy
What I learned today
kill all matches the full path in addition to the path displayed by the 'ps' comment
what i also learned:
plesk lets you restart sshd! :D
[root@blah current]# ps x
UID PID PPID C STIME TTY STAT TIME CMD
...
root 32553 1 0 13:06 ? Rs 0:00 sshd: root@pts/0
root 32560 32553 0 13:06 pts/0 Ss 0:00 -bash
root 24178 1 0 13:25 ? Ss 0:00 /usr/local/sbin/sshd -p 8080
root 16228 1 0 13:43 ? Ss 0:00 /usr/local/sbin/sshd
root 16350 16228 0 13:43 ? Ss 0:00 sshd: root
root 19985 32560 0 13:45 pts/0 R+ 0:00 ps -ef x
[root@blah current]# killall /usr/local/sbin/sshd
Connection to blah.com closed by remote host.
Connection to blah.com closed.
~ $ ssh -A root@blah.com
ssh: connect to host blah.com port 22: Connection refused
~ $ FUCK!!!!!
-bash: !: event not found
what i also learned:
plesk lets you restart sshd! :D
Posted by
Michael Economy
Sunday, July 13, 2008
Friday, July 11, 2008
Subscribe to:
Posts (Atom)
