Windows IT Pro is the authoritative and independent resource for windows nt, windows 2000, windows 2003, windows xp. Features a collection of resources and magazines for windows IT professionals.
  
  
  Advanced Search 


November 1999

Keeping Disasters Away


RSS
Subscribe to Windows IT Pro | See More Reliability Articles Here | Reprints | Or get the Monthly Online Pass—only $5.95 a month!

Microsoft's guide to high reliability

In early June, Microsoft released a 105-page white paper, "Microsoft Windows NT High Availability Operations Guide: Implementing Systems for Reliability and Availability" (http://www.microsoft.com/ ntserver/nts/deployment/planguide/highavail.asp). Microsoft based this document on the company's examination of nine customers who keep their IT enterprises running most of the time. The study's purpose was to determine which practices the companies use to make their systems highly reliable (i.e., to achieve n nines of reliability—99.9 percent, 99.99 percent, 99.999 percent, and so on) and disseminate that information so that others can implement similar measures and build more reliable networks.

Microsoft's white paper discusses how customers use operational procedures to achieve high reliability. Microsoft used the following problem-oriented classifications to categorize each customer's operational procedures: Planning and Design, Operations, Monitoring and Analysis, Help Desk, Recovery, and Root Cause Analysis.

The Planning and Design sections suggest detailed plans for reliability (how many nines do you want?), as well as service level agreements (SLAs) and standardized hardware and software. Interestingly, the document doesn't specify that you should use Microsoft software.

In the Operations sections, Microsoft covers basic principles. For example, you need to automate as many tasks as possible, and you need to pay attention to physical security—which is nonexistent at many sites.

The document's Monitoring and Analysis sections advise users to use a monitoring tool and to regularly review the data that the tool produces. This advice is sound because before-and-after data is important. Unfortunately, most clients who ask me to solve an NT problem didn't turn on Performance Monitor until after the problem appeared.

The Help Desk sections describe commonsense practices for running Help desk operations. For example, you need to train Help desk staff adequately, create well-defined escalation procedures, and ensure that the staff can access problem servers quickly.

In the Recovery sections, Microsoft offers more elementary but necessary advice. The company recommends that you maintain a supply of spare parts and replacement servers that are ready to install, use a cloning process to bring new systems online quickly, and establish and record recovery procedures. This last point is important because you need hard-copy access to your recovery procedures: I had a client who put the only copy of his disaster-recovery plan on his intranet in HTML, with no printed copies available. Although Microsoft doesn't suggest that you test your disaster-recovery plan, I recommend that you do. Untested recovery plans are little more than highly organized prayers.

The white paper's Root Cause Analysis sections suggest that your network's long-term health will be rosiest if you pay attention to root causes instead of merely treating symptoms. Although the document notes that all nine customers think root-cause analysis is important, only two of them perform this analysis. I find this information comforting—I'm pleased to know that the folks who run those annoyingly reliable networks are also human.

The Microsoft document doesn't offer many new ideas, but it's an interesting refresher and might be a decent learning tool for a novice network administrator. Oddly, although the paper's audience is NT administrators, four of the nine companies in Microsoft's case studies don't use NT for daily operations. The white paper explains that "because many basic best practices do not depend upon a specific operating system—for example, you can use similar backup procedures for both a UNIX-based server and a Windows NT-based server—Microsoft did not exclusively confine this study to environments that used only Windows NT-based systems." I'm disheartened that Microsoft didn't—or couldn't—find nine companies that know how to make NT highly reliable.

End of Article



Reader Comments

You must log on before posting a comment.

If you don't have a username & password, please register now.




Top Viewed ArticlesView all articles
Microsoft Kills OneCare, Will Launch Free Security Solution

Microsoft on Tuesday announced that it would retire its $50-a-year security subscription product, Windows Live OneCare, and replace it with a free solution codenamed "Morro." Unlike OneCare, however, Morro will focus only on core anti-malware features and ...

The website is down because someone removed the X-Box

What happens when a manager mistakes a server for a games console. ...

Command Prompt Tricks

One reader shares his tip for setting up the command prompt to reflect a remote path. ...


Windows OSs Whitepapers Why SaaS is the Right Solution for Log Management

Related Events Check out our list of Free Email Newsletters!

Windows OSs eBooks Understanding and Leveraging Code Signing Technologies

A Guide to Windows Certification and Public Keys

SQL Server Administration for Oracle DBAs

Related Windows OSs Resources Become a VIP member of the Windows IT Pro community!
Get it all with the VIP CD and VIP access. A $500+ value for only $279!

Subscribe to Windows IT Pro!
Solve your toughest technical problems with our experts and access 10,000 + articles online. 30% off

Monthly Online Pass - Only $5.95!
Get instant access to 10,000+ articles from Windows IT Pro Magazine!

TechNet Virtual Labs
Evaluate and test Microsoft's newest products.


Windows IT Pro Home Register FAQ for Windows WinInfo News
Europe Edition About Us Contact Us/Customer Service Media Kit Affiliates / Licensing  
SQL Server Magazine Office & SharePoint Pro Windows Dev Pro IT Job Hound ITTV
IT Library Technology Resource Directory Connected Home Windows Excavator Windows SuperSite 
 
 Windows IT Pro is a Division of Penton Media Inc.
 Copyright © 2008 Penton Media, Inc., All rights reserved. Terms and Use | Privacy Statement | Reprints and Licensing