The Hot Aisle Logo
Fresh Thinking on IT Operations for 100,000 Industry Executives

The consequences of data center failure can be pretty catastrophic for your business, so unsurprisingly, you will have very high expectations about availability and reliability. Unfortunately for many of us, our expectations will not be met by reality. Very often the system that is our data center facility is significantly less reliable than we might wish and than the level of resilience we have been told to expect.

The first thing we should look at is the general approach that Engineers have taken to building highly resillent components in our data centers and then secondly how these components are linked together to create a resillient end to end system.

Resillience Approaches. Typically high availability is driven by providing additional redundant components or systems that can take over if a primary system fails. An example of this would be dual power cords on Servers with dual power supplies that can take over if a power supply or power feed fails.

Sometimes resillience is provided on a N+1, N+2 or N+N basis. In the case of N+1, we supply a single redundant unit to backup a set of on-load units, an example might be that we need three 1MW Generator sets but we equip the site with four sets to deal with a situation where one won’t start or is in maintenance.

Application Resillience. By providing smart application code or underlying clustering software simple applications can be made resillient and able to withstand an outage. Often DNS or smart network hardware can provide resillience for Web based applications that do not need to maintain application state between transfers.

Server Resillience. Often server equipment is equipped by multiple power supplies and multiple power cords (mostly two but sometimes more on large systems). These provide protection from a single power stream failing – if and only if – each power cord is connected to a separate power stream. If connected to a power stream that has a piece of shared infrastructure, the power system has a single point of failure for that server.

Sometimes servers have additional resillience capability including dual network connections with a form of IP multi-pathing that is oblivious to a single network outage. Almost all server disk subsystems are equiped with disks connected in a redundant way to provide resillience in case of a disk failure.

Cabinet Resillience. Cabinets in data centers are almost always fitted with dual power strips so that dual power fed servers can be easilly connected to dual power feeds to maintain operation across a single power stream failure. Note that it is important that neither server power supplies are loaded above 50% capacity as a single power failure would take the remaining power supplies above 100% and cause a cascade failure. Single power fed servers are superficially a cheap alternative but they can neither withstand a power failure nor enable planned maintenance to be carried out without being taken offline.

PDU Resillience. PDUs need to be configured so that at least two PDUs supply each cabinet. This means that properly connected servers in the cabinets can survive a single PDU failure. Equally PDUs must never be loaded above 50% capacity as this would cause a cascade failure if one unit fails causing both loads to be supported by one unit.

UPS Resillience. Uninterruptable Power Supplies are usually connected in an N+1 or N+N configuration. Care must be taken to ensure that each unit is not loaded beyond the point where any unit taking on the additional loading caused by a UPS failure would suffer an overload and subsequent cascade failure. For example three 500KW units connected in N+1 could support a total load of 1MW or 333KW each in normal operation and 500KW if one unit failed. To provide an N+N design with the same load would require four UPS configured to support no more than 250KW each.

Generator Set

Engine Resillience. Engine design needs to follow the same rules with excess capacity needed to provide resillience through failure as well as the capability to perform maintenace without taking our data center offline.

In the real world, with equipment being moved into and out of data center halls continuously, it is almost impossible to maintain the proper power balance across each of these components and many sites drift into a situation where a cascade failure will occur when a single power component fails. Most sites recalculate every few weeks or months and then – in a panic – reconfigure and rebalance their power feeds when an inballance is detected.

  • Share/Bookmark

There Is 1 Response So Far. »

  1. [...] Data Center? teve O’Donnell, former Global Head of Data Centres at British Telecom has a new blog post on the subject of Data Center reliability where he explains “why your Data Center will fail eventually and you will be affected”. [...]

Post a Response

  • clothing pants armani jeans linen
  • cheap prices on couches
  • thumbprint cookies with chocolate
  • sailing schools for beginners
  • modern clocks canada
  • the nassau inn princeton
  • mazzarelli garden urns statuary
  • cha cha songs for kids
  • mustache trimmer battery
  • cheap patio furniture couches
  • food water dishes for dog kennels
  • ladies tea hats
  • economy inn somerset pa
  • bike football snap pants
  • cz stud screw back earrings
  • halloween costume stores in mesa az
  • hong kong diecast toys
  • travel accessories children
  • ps3 80gb power supply
  • high quality laptops affordable prices
  • mother daughter christmas sweaters
  • netgear wireless adapter software dowload
  • software millionaire salor free college education
  • facial waxing supplies
  • bmw 550i for sale
  • affordable dentistry insurance
  • leather gloves custom buckled
  • college football handicapping software
  • maxim magazine table
  • living well meal replacement powder
  • eggshell paint color
  • page plus cell plan
  • grecian formula for eyebrows
  • luxurious super yachts
  • microsoft coffee table computer
  • mrp systems training
  • airport inn sedona arizona
  • ladies hats church hats
  • spiegel pencil skirt
  • leaf vac parts
  • size 18 women's holiday dresses
  • tanning bermuda hamilton
  • buy mortons sausage seasoning
  • scuba dive vacations
  • accordion clutch wallet
  • winter bicycle shoes
  • multi stylus pen
  • doppler baby heartbeat fades
  • summer volleyball camp bahrain
  • credit relief agencies
  • mailbox flag holder
  • wireless mice for computers
  • pachinko machine repair
  • benefit gel nails
  • dorm room metal loft beds
  • approved verizon fios franchise agreements
  • sams club wireless speakers
  • bath vanity lighting with 6 stars
  • pre electric shave gel
  • mothers bead bracelets
  • crawfordsville indiana bp gasoline stations
  • perennial retail store management software
  • electronic cigarette started kit
  • 2008 easton youth baseball bats
  • packs of snack crackers
  • women's zipper clutch wallet
  • volleyball book statistics
  • pay chase freedom credit card online
  • yamaha raptor 660 accessoriessgpon.php?flyiyp=847317
  • toyo tires ratings
  • mountain hardware skyview 3
  • autococker paintball gun accessories
  • airsoft gear solid
  • risotto cake recipes
  • air travel cheep fares greece
  • google map thailand directions online
  • new makeup spray foundations
  • reebok football jersey
  • stonewall kitchens evergreen walk ct
  • get well soon cards word
  • squirrel baffles 4x4 how to make
  • xl split partly dress skirt
  • effects of bodybuilding supplements
  • dentist miami blvd durham nc
  • barbie fashion house
  • mercedes clutch purse
  • 2 wire modem router
  • paige premium denim robertson
  • workout bench made in usa
  • commercial margarita maker
  • wheel and tired packages
  • medical rules driving
  • checker corps marine ribbon us
  • retro baby hospital bracelets
  • resturaunt supply melamine plates
  • bmw x3 carpeted floor mats
  • foods high in proteins
  • workout gear uk europe
  • equestrian barn designs
  • chipper vac leaf
  • decorative metal fencing fort worth texas
  • best refurbished laptops
  • lost sippy cup
  • epson 3200 perfection photo flatbed scanner
  • teac rack mount cd
  • Switch to our mobile site