Latest Amazon Outage and Designing for Failure

Folks,

I am getting really tired or people bashing the Cloud by saying things like “Amazon’s outage in third day: debate over cloud computing’s future begins”. People need to understand that the Cloud is not some magical place you put your applications in to and never have to worry ever again. There are always a bunch of servers in racks somewhere or some power lines going to the DC and there is always a driver that can run in to a power-line pole.

You can’t just put all responsibility in to Amazon’s hands to keep your application online. All they do is provide flexible computing resources on demand in different regions and availability zones and provide 99.95% SLA at the most for their services.

Its your responsibility to design applications and systems architectures for failure, building in redundancy mostly at the application layer.

Here are some things you can do design for failure on Amazon:
1. Use Multiples Availability Zones
2. Use Multiple Regions
3. Use database replication across Availability Zones or Regions (MySQL or NoSQL)
4. Have multiple servers on all tiers and spread them across Availability Zones
5. Perform onsite and offsite backups
6. Have a DR plan and an ability to launch on a different Cloud

These are just a few. The point is you need to design your applications and systems architectures for HA in the Cloud and not assume just because you are running in the Cloud you are protected.

Cheers

This entry was posted in AWS and tagged . Bookmark the permalink.

Leave a Reply

Your email address will not be published. Required fields are marked *

*

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>