Putting the 'D' Back into Disaster Recovery Plans
Over the past 25 years I must have reviewed over 200 Business Continuity and Disaster Recovery Plans for IT departments of all sizes and companies in all kinds of businesses. I would guess that about half of them were either fundamentally unworkable, unaligned with the economics of the business or largely unnecessary. These plans had several things in common, all of which contributed to that judgment:
I could go on. Fundamentally, however, these were IT asset protection plans - ways for IT to survive, even if the business doesn't. Which clearly makes no sense. And, in many cases, there was no way to actually test the plan in total - too complex, too expensive, too disruptive to everyone's day job. Yet most real disasters are much less well-structured than a test - so if you can't make the test work when you can plan for it in advance and stage everything just right, what chance will you have if the big one hits? One way to get a workable DR plan (there are several options) is to do some up-front scenario analysis after the BIA is done and build up a set of layered responses to incidents of increasing severity. For the least serious impacts you can engineer high availability solutions - essentially disaster avoidance strategies. For disasters you can't avoid, you can build routine operational processes (things like rolling cluster upgrades, managed application failover, deliberate load shifting) that let you practice for a real problem, so your people are familiar with most of the work they'll need to do in a disaster. That will also exercise most of the technologies you'll need and ensure they're working reliably - and that the disaster won't be their first use. One final thought: Few companies exist in isolation. They'll have connections to suppliers and service providers, maybe to customers. In a "my-DR" centric plan, you'll have ways to maintain these connections. Now, hold up the mirror: You're in everyone else's DR plans to (or you should be). So, if several of your partners are caught up in the same disaster, can your DR plan work with theirs? Can your DR site connect to their DR sites? How do you test that scenario? And let's not forget that, first and foremost, we have to protect our people. |
Comments (1)
I agree with John, the companies that have DR plans for the most part have never tested them and for the most part have never looked at the plan relative to what they are protecting from what level of disaster.
My favorite story is a company with thousands of employees and multiple sites all tied to a single data center that after a severe storm was down for a day and then installed an over-sized generator so it wouldn't happen again. The problem was that the storm flooded around the building to a level of 3 feet and the new generator was placed on a slab at ground level and would fail when the water reached one foot high which it does a few times each year.
Posted by Henry Robert | July 5, 2011 12:53 PM