Design for Reliability: Information and Computer-Based Systems

Design for Reliability: Information and Computer-Based Systems book cover

Design for Reliability: Information and Computer-Based Systems

Author(s): Eric Bauer (Author)

  • Publisher: Wiley-IEEE Press
  • Publication Date: 22 Oct. 2010
  • Edition: 1st
  • Language: English
  • Print length: 348 pages
  • ISBN-10: 0470604654
  • ISBN-13: 9780470604656

Book Description

System reliability, availability and robustness are often not well understood by system architects, engineers and developers. They often don’t understand what drives customer’s availability expectations, how to frame verifiable availability/robustness requirements, how to manage and budget availability/robustness, how to methodically architect and design systems that meet robustness requirements, and so on. The book takes a very pragmatic approach of framing reliability and robustness as a functional aspect of a system so that architects, designers, developers and testers can address it as a concrete, functional attribute of a system, rather than an abstract, non-functional notion.

Editorial Reviews

Review

“Thus, I highly recommend this book to undergraduate students and junior researchers entering the reliability studies field. Though experts may not find the book to be very interesting, they will likely find it useful as a basis for lecturing, and as a good source of insightful, fundamental ideas.” (Computing Reviews, 16 May 2011)

“The book takes a very pragmatic approach of framing reliability and robustness as a functional aspect of a system so that architects, designers, developers and testers can address it as a concrete, functional attribute of a system, rather than an abstract, non-functional notion.” (Forums Digital Media Net, 16 March 2011)

From the Inside Flap

Techniques for developing reliable, robust networked systems that meet customers’ expectations

Today’s customer expects valid service requests or transactions to be reliably executed with acceptable quality. Design for Reliability brings together the analysis, design, and system implementation principles necessary to build highly available, reliable systems. It fills the knowledge gap in this area, explaining techniques for framing verifiable availability/reliability requirements and methodically designing, analyzing, and testing systems to meet those requirements.

This book takes a very pragmatic approach of framing reliability and robustness as concrete, functional attributes of a system, rather than abstract, non-functional notions. It is divided into three sections:

  • Reliability Basics—frames the elements of a typical system; defines eight broad categories of errors that can produce critical system failures; and explains the failure recovery process

  • Reliability Concepts—covers concepts for failure containment and recovery; reviews techniques that complement failure containment and redundancy to improve system reliability; outlines error detection and failure recovery mechanisms; provides design basics for reliable procedures; and offers information to help enterprises deploy robust operational policies to maximize highly available system operation

  • Design for Reliability—reviews reliability requirements and analysis techniques; demonstrates downtime budgeting and modeling to assess the feasibility of meeting a system’s service availability requirement; covers strategy and planning of robustness and stability testing; shows how field outage events can be analyzed to drive reliability improvements; and explains how to construct a reliability road map to methodically drive a system to achieve the ultimate service availability on a desired schedule

A case study of design for reliability diligence of a networked system is then presented to illustrate appropriate considerations for developing a high-availability, high-reliability system. System architects, engineers, developers, testers, and project and product managers will rely on Design for Reliability to understand how all the key elements fit into the overall system design lifecycle in order to produce robust systems that achieve customers’ expectations for service reliability and service availability. Quality professionals for products with high-availability expectations will also find this book useful in understanding what it takes to design and deploy robust systems.

From the Back Cover

Techniques for developing reliable, robust networked systems that meet customers’ expectations

Today’s customer expects valid service requests or transactions to be reliably executed with acceptable quality. Design for Reliability brings together the analysis, design, and system implementation principles necessary to build highly available, reliable systems. It fills the knowledge gap in this area, explaining techniques for framing verifiable availability/reliability requirements and methodically designing, analyzing, and testing systems to meet those requirements.

This book takes a very pragmatic approach of framing reliability and robustness as concrete, functional attributes of a system, rather than abstract, non-functional notions. It is divided into three sections:

  • Reliability Basics―frames the elements of a typical system; defines eight broad categories of errors that can produce critical system failures; and explains the failure recovery process

  • Reliability Concepts―covers concepts for failure containment and recovery; reviews techniques that complement failure containment and redundancy to improve system reliability; outlines error detection and failure recovery mechanisms; provides design basics for reliable procedures; and offers information to help enterprises deploy robust operational policies to maximize highly available system operation

  • Design for Reliability―reviews reliability requirements and analysis techniques; demonstrates downtime budgeting and modeling to assess the feasibility of meeting a system’s service availability requirement; covers strategy and planning of robustness and stability testing; shows how field outage events can be analyzed to drive reliability improvements; and explains how to construct a reliability road map to methodically drive a system to achieve the ultimate service availability on a desired schedule

A case study of design for reliability diligence of a networked system is then presented to illustrate appropriate considerations for developing a high-availability, high-reliability system. System architects, engineers, developers, testers, and project and product managers will rely on Design for Reliability to understand how all the key elements fit into the overall system design lifecycle in order to produce robust systems that achieve customers’ expectations for service reliability and service availability. Quality professionals for products with high-availability expectations will also find this book useful in understanding what it takes to design and deploy robust systems.

About the Author

ERIC BAUER is Reliability Engineering Manager in the Wireline Division of Alcatel-Lucent. After two decades of software development experience, he joined the Lucent reliability team to lead a reliability group, and has since worked reliability engineering on a variety of wireless and wireline products and solutions. Mr. Bauer currently focuses on increasing the reliability of Alcatel-Lucent’s IP Multimedia Subsystem (IMS) solution and the network elements that comprise the IMS solution. He has been awarded twelve U.S. patents, coauthored Practical System Reliability (Wiley), and has published several papers in the Bell Labs Technical Journal.

View on Amazon

电子书代发PDF格式价格30我要求助
未经允许不得转载:Wow! eBook » Design for Reliability: Information and Computer-Based Systems