Deusm: Detect & Recover From VM & Cloud Failures

As you virtualize more of your datacenter infrastructure and develop more cloud-based apps, understanding where the failure points are and how to recover from failures will be key. Knowing what VMs are dependent on others and how to restart particular services in the appropriate order will take some careful planning. How Netflix has approached this problem is to test its large Amazon Web Services infrastructure continually with what it calls “Chaos Monkey.”

You can read the full post here in Develop in the Cloud.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s