Select your font size 
 
about us products & services consulting & support news & events contact us
Availability is easier to manage than uptime, because uptime has to do with one computer, which might fail, whereas availability has to do with the service, which might be spread across many computers, networks, and power grids.

High Availability vs. High Uptime - New Brunswick

print this article 
 

Achieving high uptime is a very noble goal. To that purpose, Transparen tends to purchase high-end server hardware that allows us to do things like insert new hot-plug ServeRaid cards and initialize new  SCSI disk enclosures without shutting down the servers or re-initializing the operating systems. In addition, it is why Transparen cares about ensuring a consistent Internet connection, and high-availability hydro-electric power for its servers. But despite all of these things, if uptime is the only factor managed, it is difficult to maintain more than 99% uptime under normal circumstances, and when failures occur, they can take a long time to resolve (sometimes days, not just hours or minutes).

Factors Resulting in High Uptime

Naturally, we believe in following industry best practices where it comes to maintaining high uptime, which include:

  • Using good hardware
  • Ensuring that electric power is highly available
  • Using Uninterruptible Power Supplies (UPS) to prevent short power interruptions from limiting uptime
  • Having redundant parts in the servers, including extra hard drives (RAID configurations), extra network cards, extra power supplies, etc.
  • Practising a conservative approach to software changes - making high-risk changes only when absolutely necessary, while taking regular actions to ensure that stability improvements are implemented promptly.

Despite Best Practices, Good Hardware, and Ideal Environment, Server Uptime is Limited By Single Points of Failure

Despite all these practices, a solitary machine, even with redundant parts, may still fail, because not all of its parts are redundant, and there are still things that can happen that will limit uptime. For instance:

  • More often than one might realize, a localized power outage may occur - one which may not affect a whole building, but which may affect the server. The most common example is a power breaker may flip, or a power cord may be unplugged.
  • A network connection may be severed. This could happen in many ways - the simplest is that the ethernet cable can become unreliable and wiggle slightly free, either on the router or the server. But there could be other ways, including router failures, fried ethernet cards, internet provider problems, etc.
  • The RAID array may collapse. Even though RAID provides hard drive redundancy, there are still parts of the RAID array that can fail and take the whole thing down. These include:
    • The RAID card (or SCSI/SATA/IDE card, if implementing a software RAID)
    • The backplane (i.e. what all the drives plug into)
    • The cabling between the RAID card and the backplane
    • Catastrophic hard drive failures (i.e. multiple hard drive failures beyond the redundancy provided by the RAID configuration)
  • Memory might be defective - Server memory is usually provisioned with error correction codes (ECC), but these may still fail under certain circumstances.
  • Power supplies might go out of commission and require replacement. If the power supply redundancy is not sufficient, then the machine may need to be shut down, although it may be possible to replace parts without necessitating shutdown.

In other words, there are too many points of failure, and therefore the odds are stacked against keeping a single server up for years and years.

Availability is Not Limited By Uptime

But even as individual servers may need to be taken down for maintenance from time to time, either voluntarily, or involuntarily - this does not mean that the 'system' cannot remain available. During such times, the goal is rather to allow the system to continue to operate, only perhaps not as powerfully as when all servers are up. In other words, ideally, if a server goes down, the system should operate a tiny bit slower than normal, but continue to operate. This way, services can be provided continuously, despite hardware problems that occasionally arise.

The benefit is that availability is compromized only when all nodes fail. If each node has a 1% chance of being down on a particular day, then the chance that the whole system will go down on that day is 1%^n + x, where n is the number of nodes, and x is the chance that the clustering solution is configured wrong or has some bug. With 3 nodes, the chance of having a catastrophic failure on one day is then 0.01 % + x, where, due to the nature of the software written for high availability, and the people who are interested in configuring it, x is a very very small number.

Many Single Points of Failure Eliminated

By employing redundant servers configured for high availability, we can eliminate several points of failure:

  • Multiple Internet providers can be used, so if one fails, the other may still work
  • Multiple locations - if power goes down in one place, a server in another placeis likely to still have power and an Internet connection, and be able to take over as a primary server.
  • Multiple servers - if a server (or node) becomes disfunctional, others stand ready to take its place
  • Multiple DNS servers on different IP addresses - if one goes down, the others take over. Raw DNS can be used to provide a kind of load balancing - each time a web browser looks up a web server address, it receives a list of IP addresses (in random order). The web browser tries the servers one by one until one works. In the event of a downed server, the user would experience a slowdown, but not a service disruption.

High Availability is Not An Excuse to Not Do Backups

Just because the system is engineered to never go down, does not mean that the system administrators can rest assured that it will never happen. Even if it is extremely unlikely, it is only a matter of when, not if, a catastrophic failure will occur.... And due to the complexity of the system, when the failure occurs, some pretty damned good backups will be needed to effect a timely restoration.

Most Recent Website and Regional Updates

 Transparen Toronto Office Locations
Addresses of Transparen Corporation offices in Toronto, Ontario.

 
 High Scalability - Large Systems Optimization
Transparen Corporation lends its expertise to clients experiencing rapid and sudden growth in traffic or server utilization, bottlenecks, systems instability, downtime during peak traffic, or which would like to plan to avoid such issues.

 
 Throughput (or Bandwidth) vs. Latency
This document uses the example of Bill Gates purchasing Google to explain the difference between bandwidth (or throughput) and latency.

 
 Emergency Management Services
The prototypical emergency involves a shutdown of essential services for a finite period of time. What will your organization do when a world-wide financial crisis strikes?

 
 Fast RAID Server Data Recovery Service
Transparen's Vancouver International Response Team provides the option in Canada and USA to get a raid server back running in hours - eliminating costly waiting associated with typical RAID recoveries.

 
 Data Recovery Service
Have you deleted a mission critical file? Accidentally dropped a computer, or formatted a hard drive? No recent backup? Mistakes can happen, but the data might still be there.

 
 About Transparen
Transparen is committed to serving its clients.

 
 Research Tools
Measure human resource allocation and collect data with the goal of determining patterns that will bring forward actionable insights which may lead to policy changes, saving money and improving quality of service.

 
 Process Evaluation Questions
Questions to help focus discussion about process improvement

 
 Operations Research
Operations Research (frequently called OR), is the methodical study of how to do things better. It is also called Optimization Theory.

 
 R. c. Corbin, 2008 NBCP 52 (CanLII)

 
 R. v. Webb, 2008 NBPC 51 (CanLII)

 
 R. v. Seeglitz, 2008 NBPC 50 (CanLII)

 
 R. v. Goulette, 2008 NBPC 48 (CanLII)

 
 A Death in the Family - Documentary
Today on the podcast, the story of Paul Johnson and Bill Mullins-Johnson, two brothers from Sault Saint Marie, Ontario whose lives were torn apart after the murder of Paul's four-year-old daughter ... a crime that turned the two men against each other even though neither of them had committed it.

 
 06/01/2009: The Threatening Sea
Today on the podcast, we continue our Watershed series with a trip to Vanuatu, a nation of 83 islands in the South Pacific that is slowly but surely sinking into the sea.

 
 05/01/2009: Australia Drought
Dispatches from The Big Dry. Current producer Kathleen Goldhar brings us a report from Australia's enduring drought and the economy it's spawned, where rainless communities unravel, only the adaptable prosper and water is the new gold standard.

 
 02/02/2009: Economy Panel - 2009 Forecast
With the annus horibilis of 2008 in the rear view mirror, and 2009 lying in the wait, The Current organized an economy panel to give us their forecast for the new year.

 
 31/12/2008: Looking at Israel
Israel is a country where history is never really past, and where politics leeches into all quarters of society. No historian is merely an academic or a chronicler of the times. What he or she writes, in some cases, becomes the starting point of painful and contentious self-examiniation. In this podcast you will hear from a controvesial Israel professor and an author and intellectual counterpart to our first guest.

 
 30/12/2008: Gaza Witnesses
For some perpective about how Israel's latest military campaign is affecting ordinary Gazan citizens Tom Harrington was joined by two guests to discuss what they have been witnessing. Their stories and more about can be heard in this podcast.

 
 29/12/2008: Year-End Political Panel
It's been a year to remember, even if a lot of people would rather forget it ... elections and rumours of elections ... a tenuous coalition and an empty house of commons ... far-off wars and fears of financial Armageddon at home. There's still a couple of days left in it, but 2008 is marching into the history books, and so it's time for a post-mortem on a year that kept political watchers busy. In today's podcsat, you'll be hearing the thoughts of those on our year-end political panel for 2008.

 
 24/12/2008: Helping the Homeless
For Stephen Hwang, the term "help the homeless" has taken on deep meaning. The son of Chinese immigrants, he was born in the U.S. and studied medicine at the country's finest schools. He faced a bright career in research there. But he turned his back on that, and chose instead to move to Canada, and dedicate his work to studying and helping the homeless. We hear his story in today's podcast.

 

Google
 
Web transparen.com

Contact Information

Related Information

Avoidance of Magic - Informal Survey Results
Joe the IT Director phones up high-traffic websites to ask them if they used magic.
High Scalability - Large Systems Optimization
Transparen Corporation lends its expertise to clients experiencing rapid and sudden growth in traffic or server utilization, bottlenecks, systems instability, downtime during peak traffic, or which would like to plan to avoid such issues.
Throughput (or Bandwidth) vs. Latency
This document uses the example of Bill Gates purchasing Google to explain the difference between bandwidth (or throughput) and latency.
Fast RAID Server Data Recovery Service
Transparen's Vancouver International Response Team provides the option in Canada and USA to get a raid server back running in hours - eliminating costly waiting associated with typical RAID recoveries.
Data Recovery Service
Have you deleted a mission critical file? Accidentally dropped a computer, or formatted a hard drive? No recent backup? Mistakes can happen, but the data might still be there.
   
 
E C M | © 2003-2007 Transparen Corp.      

Standardized Services: Data Recovery Service / Creative Services / Premium Web Hosting Services / System Administration Tech Support Services
Recent Projects: Full-Service Mortgage and Financing Company / System to manage flights from Vancouver to Tofino / Photo exchange verification service
Our Vancouver BC Server Proudly Hosts: automated parking and revenue control systems, leafside lane at southlands, cost effective alternative power sources, Higher Grade Learning Centres, pacific forage bag supply, sunburst medical, neosonic design, roger mahler photography - passionate, intriguing, desirable, the connection between east and west, affordable flights to victoria and tofino, low interest mortgage brokers in vancouver, richmond, surrey, toronto, Toronto Calgary and Vancouver IT staffing and talent search
* Alma * Aroostook * Atholville * Baker Brook * Balmoral * Bas Caraquet * Bath * Bathurst * Belledune * Beresford * Bertrand * Blacks Harbour * Blackville * Bouctouche * Bristol * Cambridge-Narrows * Campbellton * Canterbury * Cap Pélé * Caraquet * Centreville * Charlo * Chipman * Clair * Dalhousie * Dieppe * Doaktown * Dorchester * Drummond * Edmundston * Eel River Crossing * Florenceville * Fredericton * Fredericton Junction * Gagetown * Grand Bay-Westfield * Grand Falls * Grand Manan * Grande-Anse * Hampton * Hartland * Harvey * Hillsborough * Kedgwick * Lac Baker * Lameque * Le Goulet * Maisonnette * McAdam * Meductic * Memramcook * Millville * Minto * Miramichi * Moncton * Nackawic * Neguac * New Maryland * Nigadoo * Norton * Oromocto * Paquetville * Perth-Andover * Petitcodiac * Petit Rocher * Plaster Rock * Pointe-Verte * Port Elgin * Quispamsis * Rexton * Richibucto * Riverside-Albert * Riverview * Riviere-Verte * Rogersville * Rothesay * Sackville * Saint-Andre * Saint-Antoine * Saint-François-de-Madawaska * Saint-Hilaire * Saint-Isidore * Saint John * Saint-Leolin * Saint-Leonard * Saint-Louis-de-Kent * Saint-Quentin * Sainte-Anne-de-Madawaska * Sainte-Marie - Sainte-Raphael * Salisbury * Shediac * Shippagan * Stanley * St. Andrews * St. George * St. Martins * St. Stephen * Sussex * Sussex Corner * Tide Head * Tracadie-Sheila * Tracy * Woodstock * Aberdeen * Aboujagane * Acadie * Acadie Siding * Acadieville * Adams Gulch * Adamsville * Addington * Albert Mines * Albrights Corner * Alcida * Alderwood * Aldouane * Allainville * Allardville * Allison * Ammon * Anagance * Anderson Road * Anderson Settlement * Andersonville * Anfield * Anse-Bleue * Apohaqui * Arbeau Settlement * Armond * Arthurette * Ashland * Astle * Aulac * Avondale * Back Bay * Baie-Ste-Anne * Baie Verte * Barryville * Bartibog Bridge * Bates Settlement * Bay du Vin * Bayside * Beaubassin East * Beaverbrook * Beaver Dam * Bellefleur * Benjamin River * Berwick * Bettsburg * Big Hole * Big River * Black River * Black River Bridge * Blair Athol * Blissfield * Blissville * Bloomfield * Bloomfield Ridge * Boiestown * Bocabec * Brantville * Brockway * Browns Flat * Bull Lake * Burnsville * Burnt Church * Burton * Burtts Corner * Cains River * Campbell Settlement * Campobello Island * Canton des Basques * Caron Brook * Carrolls Crossing * Casillis * Caverhill * Chamcook * Chatham * Chatham Head * Chelmsford * Clarkville * Cloverdale * Cocagne * Coles Island * Collette * Connors * Cornhill * Coteau Road * Dalhousie Junction * Daulnay * Dawsonville * Debec * Deer Island * Derby * Devereaux * Douglas * Douglastown * Dugas * Duguayville * Dumfries * Dundee * Dunlop * Durham Bridge * Eel Ground * Elgin * Escuminac * Evandale * Évangéline * Fairisle * Fairvale * Five Fingers * Flatlands * Four Falls * Gauvreau * Geary * Glassville * Glencoe * Glen Levit * Glenwood * Gondola Point * Grafton * Grande-Digue * Gravel Hill * Gray Rapids * Hainesville * Hampstead * Hanwell * Hardwicke * Hartfield * Hatfield Point * Haut-Lamèque * Haut-Sheila * Havelock * Hawkshaw * Hazeldean * Head of Millstream * Health Steele * Hebron * Honeydale * Howard * Hoyt * Inkerman * Jacquet River * Janeville * Jemseg * Johnsville * Juniper * Keswick Ridge * Kingsclear * Kingston * Kouchibouguac * Lagacéville * Lake George * Lakeville * LaPlante * Lavillette * Lawrence Station * Limestone * Loggieville * Lorne * Losier Settlement * Lower Newcastle * Ludlow * Mactaquac * Madran * Magaguadavic Settlement * Magundy * Maltampec * Maple Ridge * Marysville * Maugerville * McGivney * McGraw Brook * McLeods * McNamee * Menneval * Millerton * Miramichi Bay * Miscou Island * Moulin-Morneault * Napadogan * Napan * Nash Creek * Nashwaak Bridge * Nashwaak Village * Nasonworth * Nauwigewauk * Nelson * Nelson Hollow * New Bandon * New Denmark * New Jersey * New Mills * Nicolas-Denys * Noonan * Nordin * North Head * North Tetagouche * Northampton * Notre-Dame * Notre-Dame-de-Lourdes * Notre-Dame-des-Érables * Oak Bay * Odell * Otis * Oxbow * Pabineau Falls * Parker Ridge * Pembroke * Penniac * Penobsquis * Petite-Lamèque * Petite-Rivière-de-l'Ile * Petite-Tracadie * Pigeon Hill * Pinder * Pocologan * Point La Nim * Pointe-à-Bouleau * Pointe-Alexandre * Pointe-Canot * Pointe-Sapin * Pokemouche * Pokeshaw * Pokesudie * Pokiok * Pont-Lafrance * Pont-Landry * Porten * Priceville * Prince William * Quarryville * Queensbury * Quisbis * Red Bank * Renforth * Renous * Richibouctou-Village * Riley Brook * Ripples * Riviere-du-Portage * Robertville * Robinsonville * Rosaireville * Rossville * Rough Waters * Saint-Arthur * Saint-Basile * Saint-Charles * Saint-Ignace * Saint-Irenée * Saint-Jacques * Saint-Jean-Baptiste-de-Restigouche * Saint-Joseph-de-Madawaska * Saint-Laurent * Saint-Martin-de-Restigouche * Saint-Maure * Saint-Sauveur * Saint-Simon * Sainte-Louise * Sainte-Marie-de-Kent * Sainte-Rose * Salmon Beach * Saumarez * Scotch Lake * Seal Cove * Sevogle * Shannonvale * Sheffield * Shemogue * Siegas * Sillikers * Sisson Ridge * Southampton * South Tetagouche * Springfield * Squaw Cap * St. Margarets * Stickney * Strathadam * Stonehaven * Sunny Corner * Tabusintac * Targettville * Taxis River * Taymouth * Temperance Vale * Tetagouche Falls * Tilley * Tracadie Beach * Tremblay * Upper Blackville * Upper Kent * Upper Queensbury * Upsalquitch * Val-Comeau * Val-d'Amour * Val-Doucet * Verret * Village-Blanchard * Village-Saint-Laurent * Waterville * Wayerton * Weaver Siding * Welsford * Westfield * White Rapids * Whites Cove * Wicklow * Williamstown * Wilsons Beach * Wirral * Zealand * Aberdeen * Acadieville * Addington * Allardville * Alma * Alnwick * Andover * Baker Brook * Balmoral * Bathurst * Beresford * Blackville * Blissfield * Blissville * Botsford * Bright * Brighton * Brunswick * Burton * Cambridge * Campobello * Canning * Canterbury * Caraquet * Cardwell * Carleton * Chatham * Chipman * Clair * Clarendon * Colborne * Coverdale * Dalhousie * Denmark * Derby * Dorchester * Douglas * Drummond * Dufferin * Dumbarton * Dumfries * Dundas * Durham * Eldon * Elgin * Gagetown * Gladstone * Glenelg * Gordon * Grand Falls * Grand Manan * Greenwich * Grimmer * Hammond * Hampstead * Hampton * Harcourt * Hardwicke * Harvey * Havelock * Hillsborough * Hopewell * Huskisson * Inkerman * Johnston * Kars * Kent * Kingsclear * Kingston * Lac-Baker * Lepreau * Lincoln * Lorne * Ludlow * Madawaska * Manners Sutton * Maugerville * McAdam * Moncton * Musquash * Nelson * New Bandon * New Maryland * Newcastle * North Lake * Northampton * Northesk * Northfield * Norton * Notre-Dame-de-Loudres * Paquetville * Peel * Pennfield * Perth * Petersville * Prince William * Queensbury * Richibucto * Richmond * Rivière-Verte * Rogersville * Rothesay * Sackville * Saint Andrews * Saint Croix * Saint David * Saint George * Saint James * Saint Martins * Saint Mary * Saint Marys * Saint Patrick * Saint Stephen * Saint-André * Saint-Basile * Saint-Charles * Saint-François * Saint-Hiliare * Saint-Isidore * Saint-Jacques * Saint-Joseph * Saint-Louis * Saint-Léonard * Saint-Paul * Saint-Quentin * Sainte-Anne * Salisbury * Saumarez * Shediac * Sheffield * Shippagan * Simonds, Carleton County * Simonds, Saint John County * Southampton * Southesk * Springfield * Stanley * Studholm * Sussex * Upham * Wakefield * Waterborough * Waterford * Weldford * Wellington * West Isles * Westfield * Westmorland * Wickham * Wicklow * Wilmot * Woodstock