DFT/HA Reliability

Trillium DFT/HA Reliability

Datasheet: Print  / Download

  • Accelerate time to market
  • Reduce development costs
  • Reduce project risk of internally developed applications

Now more than ever, telecommunication platforms must be robust enough to provide "five nines" (99.999%) of availability. A critical component in telecom operations, five nine's means that a user is insured practically no loss of service due to hardware or software errors, or any downtime for software upgrades or hardware maintenance. Alongside the stringent requirements demanded by this extreme high availability, network infrastructure must also be scalable enough to accommodate rapid customer growth. Furthermore, these standards of performance are expected in both circuit- and packet-based networks, thereby raising the competitive bar for everyone. All of these forces place unprecedented burdens on providers to ensure that every network element needed to support a service is functioning whenever a user requests it. "Always on" has come to embody a very literal meaning in the networking world.

Telecom infrastructure solutions from Radisys relieve this pressure and make network design and deployment much more manageable. The company's Trillium business unit delivers innovative protocol software solutions for Fault-Tolerant / High Availability (FT/HA) and Distributed Fault-Tolerant / High Availability (DFT/HA). From Voice over IP (VoIP), to Wireless, to IP Multimedia Subsystem (IMS), Radisys offers a wide range of Trillium solutions ready for platform integration and deployment.

Trillium Fault-Tolerance at Work

Trillium protocol stack software from Radisys offers powerful options for FT/HA and patented DFT/HA architectures and implementations. Trillium FT/HA software solutions are based on active/standby configurations to achieve redundancy. Providing a flexible, platform-independent, cost-effective framework that maintains active connectivity during software and hardware failures, Radisys software architecture solutions are modular, allowing telecom equipment manufacturers to add components as their customers' needs evolve. Furthermore, the widely deployed Trillium solutions give system designers and engineers total freedom when choosing their hardware platform and operating system.

Besides high availability, telecom equipment manufacturers also need to meet the scalability required to support converging networks. Trillium's DFT/HA solutions meet these challenges in several ways. For example, the core software functionality allows the creation and management of Distributed Fault-Tolerant applications, Pure Distributed applications, and Pure Fault-Tolerant applications. Trillium products achieve this mix of performance, scalability, and availability by distributing the processing load across multiple processors. At the same time, DFT/HA software allows the coexistence of distributed and non-distributed protocol layers.

In simple terms, Trillium DFT/HA solutions enable system designers and engineers to replicate a node and turn those nodes into a (D)FT/HA system. In particular, Trillium core software:

  • Distributes the protocol load onto available physical processors,
  • Dynamically re-distributes load upon processor failure and new processor introduction,
  • Retains active calls and recovers from processor failures, and
  • Allows maintenance operations to be performed without bringing the system down.

Sample Solution: FT/HA and DFT/HA for an HLR

An FT/HA open service platform can be used for a Home Location Register (HLR), which accesses database information for wireless user profiles and features. Trillium's FT/HA Transaction Capabilities Application Part (TCAP) stack within the HLR enables the standby subsystem to maintain state information through state updates from the active subsystem. These updates prevent the loss of state information, enabling an orderly automatic or manual switchover procedure from the failed subsystem to the standby subsystem.

Likewise, network element designers can use the Trillium DFT/HA TCAP stack to create an active/standby, dual-node architecture as well as a scalable, multi-node architecture, which allows for the number of active and standby nodes to be configured and for the active nodes to share the system load. In this way, different processors can be active and standby for each other, providing an N+M configuration in addition to 2N and N+1. For other system types such as a Signaling Gateway (SG) the multi-node approach using the Trillium DFT/HA software solution can provide the high level of availability required for carrier-grade deployments.

Your Product...Your Application...Your Reputation

Radisys' portable software approach widens the choices for hardware platforms and operating systems and supports easy integration of applications and protocols. Ultimately, that means that telecom equipment manufacturers can focus their limited resources on adding proprietary value to the application layer to differentiate product offerings and stay competitive, while Trillium software provides the FT/HA communication layers that give service to a system user and maintain active calls during software and hardware failures.

The patented Trillium DFT/HA software architecture builds on the strengths of the FT/HA solution while enabling next-generation network performance and scalability by distributing a protocol layer across multiple processors while maintaining fault tolerance. Designed to enable a smooth migration path, it is a high quality, carrier-class solution that fulfills the requirements of converged networks. Advantages to licensing Trillium FT/HA and DFT/HA solutions include:

  • High-performance software technology, which is targeted at all types of carrier-grade network infrastructure equipment for VoIP, wireless and IMS applications
  • Trillium Advanced Portability Architecture (TAPA) and source code solutions, which expand hardware/software platform options to reduce risk and time-to-market
  • Small footprint and dynamic memory for optimized solutions, which reduces costs
  • A broad technology roadmap, which provides flexibility, scalability, and long-term return on investment

Trillium DFT/HA Network Stack Diagram

Diagram FT/HA-DFT/HA Conventional Layer


Diagram FT/HA Architecture

Diagram DFT/HA Architecture


The Distributed Fault-Tolerant/High-Availability (DFT/HA) core software functionality allows the creation and management of:

  • Distributed Fault-Tolerant applications
  • Pure Distributed applications
  • Pure Fault-Tolerant applications

For Distributed Fault-Tolerant applications, illustrated in the figure below, the DFT/HA Core software distributes the protocol load onto available physical processors; dynamically re-distributes the load on processor failure and new processor introduction; retains active calls and recovers from processor failure; and allows maintenance operations to be performed without bringing the system down.

Distributed Application


For Pure Distributed applications, the DFT/HA Core software distributes the protocol load on available physical processors and dynamically re-distributes the load on processor failure and new processor introduction.

For Pure Fault-Tolerant applications, illustrated in the next figure, the DFT/HA Core software can be used to build an active/standby redundant system. The core software manages system operations, retains active calls on processor failure, and allows maintenance operations to be performed without bringing the system down.

Pure Fault-Tolerant Application, SS7 stack



DFT/HA Core software enables telecom equipment manufacturers of soft switches, signaling gateways, signal transfer points, service control points, media gateway controllers or other products to:

  • Accelerate time to market
  • Reduce development costs
  • Reduce project risk of internally developed DFT/HA applications

A series of technology-specific products, each called a Fault-Tolerant/High-Availability Protocol Specific Function (FT/HA PSF), is used to update the states of the Standby side from the Active side. In case of a failure the Standby can resume operation with minimal service disruption.

Product deliverables consist of C source software, documentation, training, a warranty and technical support.

DFT/HA

  • * Multiple system configuration support. The DFT architecture supports all standard DFT configurations in which a DFT protocol layer may reside. For applications that do not fall into one of the pre-defined system configurations, custom configurations may be created.
  • * Automatic system initialization. The core software is capable of starting active and standby copies of protocol layers on physical processors based on the selected system configuration. The protocol layer load is equally distributed on all available processors. If a node fails and is removed from the system or a new node is introduced into the system, the protocol load is redistributed on available processors. This load re-distribution is performed without interrupting ongoing protocol operations and is completely transparent to the system user. A protocol layer's load is re-distributed to maintain its preset system configuration. The same operation may be initiated when a node is lost due to system failure.
  • * Protocol layer specific configurations. The layout of one protocol layer in the system is completely de-coupled from the layout of other protocol layers in the system. It is possible to set up one protocol layer in a pure fault-tolerant configuration while having another protocol layer of the stack in an asymmetric distributed fault-tolerant configuration.
  • * Multiple choices of load distribution criteria. Each protocol layer may define one or more criteria upon which the load distribution is performed.
  • * Complete recovery during failure. This feature is available in both pure fault-tolerant and distributed fault-tolerant systems. When a failure occurs, failed protocol layers are able to completely recover stable state information. All protocol resources present in a stable state during the failure are maintained on the standby.
  • * Application restart on processor loss. This feature is applicable to pure distributed systems. If a processor in a pure distributed system fails, applications on the failed processor may be restarted on available processors to provide service for subsequent user traffic.
  • * Survive up to n-1 faults. DFT protocol layers may survive up to n-1 faults without loss of service where n is the number of processors over which the protocol layer was distributed. With the lost application restart feature enabled, a distributed protocol layer may continue to provide full service until the last processor in the system fails.
  • * User defined system operations. Advanced distributed system operations such as dynamic load balancing may be implemented using basic services provided by the core software.
  • * Graceful node shutdown. The system manager provides an operation to gracefully shutdown a node and an option to redistribute the protocol load onto remaining processors in the system. The load redistribution is completely transparent to the system users.
  • * Maintenance operations. The system manager provides an operation to swap the states of an active and standby node. This functionality may be used to perform maintenance operations on the system without shutting it down. These operations are completely transparent to the system users and will not interrupt service provided by the system.
  • * Supports software upgrades using the Rolling Upgrade feature. This feature allows upgrading to a new version of the software without taking the system offline or interrupting service.
  • * Alarms for failure detection. Each core component, protocol layer and protocol layer component generates a set of alarms when the component encounters an unexpected condition. These alarms may be used by the stack manager to passively detect faults in interacting system components.
  • * Statistical information. Each core component, protocol layer and protocol layer component maintains statistics information about ongoing operations. This information may be collected by the stack manager and used to determine system performance and processor loading in real-time. This information may be used to perform dynamic load balancing by the stack manager.
  • * Conforms to Trillium Advanced Portability Architecture (TAPA)
  • * Benefits of licensing Trillium software from Radisys

FT/HA

  • * Supports protection of single-layer as well as multi-layer applications.
  • * Maintains active connections during software or hardware failures.
  • * Supports Controlled Switchover for operator-initiated maintenance procedures without loss of information.
  • * Supports Forced Switchover when the Active side fails.
  • * Supports Warm Start of the Standby and causes no delay processing data traffic during the Warm Start operation
  • * Supports integration with the customer's fault-tolerant layers via a well defined API
  • * Supports software upgrades using the Rolling Upgrade Feature. This feature allows upgrading to a new version of the software without taking the system offline or interrupting service.
  • * Conforms to Trillium Advanced Portability Architecture (TAPA)
  • * Benefits of licensing Trillium software from Radisys

Trillium DFT/HA Load Distribution Functions (LDF)

  • DFT/HA - Distributed Fault-Tolerant / High Availability Core
  • GMM/SM LDF
  • ISUP LDF
  • M3UA LDF
  • MTP3 LDF
  • RANAP LDF
  • SCCP LDF
  • TCAP LDF

Trillium FT/HA Protocol-Specific Functions (PSF)

  • FT/HA - Fault-Tolerant / High Availability Core
  • Diameter PSF
  • eGTP-c PSF
  • GMM/SM PSF
  • GTP PSF
  • H.323 PSF
  • ISUP PSF
  • MTP3 PSF
  • M3UA PSF
  • MAP 3G PSF
  • MGCP & MEGACO H.248 PSF
  • S1AP PSF
  • SIP PSF
  • RANAP PSF
  • SCCP PSF
  • TCAP PSF

 

PRODUCT INTERWORKING

Trillium DFT/HA Core software works directly with:

  • Fault-Tolerant/High-Availability (FT/HA) Protocol Specific Function (PSF) to provide fault-tolerant, High Availability functionality for specific protocol layers.
  • Fault-Tolerant/High-Availability (FT/HA) Protocol Specific Function (PSF) and Load Distribution Function (LDF) to provide distributed and/or fault-tolerant, High Availability functionality for specific protocol layers.

DFT/HA Core software directly supports Trillium software products that must be distributed (pure distribution) and/or distributed and protected from failure (distributed fault-tolerant).

FT/HA Core software directly supports Trillium software products that must be protected from failure.

CONFORMANCE

Trillium DFT/HA and FT/HA Core software conforms to the following standard:

  • Trillium Advanced Portability Architecture (TAPA)

©2012 Radisys Corporation  |  Terms Of Use  |  Privacy Policy  |  Contact Us