Table of Contents

Table of Contents

Design Implementation Guide

Router Performance

 

Introduction

With the proliferation of Cisco router products in the past few years and the ever increasing features and functionalities available, understanding of some of the intricate interworkings of these devices is necessary to design optimal performance networks. In the early days of routers, raw packet-per-second performance was a valid concern, but with the increased improvements in processor power and memory management, the increasing performance numbers are reaching a point where perception and reality are becoming blurred. This paper focuses primarily on the realities of real network performance considerations and how the varying router platforms that Cisco provides can meet the appropriate criteria for any given network.

The network performance considerations will not address issues regarding latency of routers since in a network system as a whole, latency per switch or router has been found to be negligible when comparing to normal workstation or PC disk access speeds and lower-speed media bandwidth considerations.

It is important to note that raw performance numbers in packets per second should never be the sole criterion for choosing any product since criteria based on support responsiveness, company financials, feature enhancements, software reliability, troubleshooting capability, and a variety of other criteria factor heavily into a final product decision. Performance of the varying platforms should be understood to determine what meets the user's requirement, allowing for future growth and expansion of a user's network.

A description of how to determine what a user's performance requirements may be follows, including some sample calculations that can then be used as a guideline and extrapolated to fit into specific network designs. The traffic patterns and network applications may not be well understood in new network designs, but it is important for some investigation to be done to determine approximate traffic patterns and worst-case scenarios (not to be misconstrued as theoretical worst-case scenarios). Next, the switching paths of varying Cisco router platforms are listed and platform aggregate numbers specified to help determine the most optimal platform in a given network design scenario. The last section of the paper lists some common features that may affect switching paths and gives general guidelines for optimum network designs.

 

Realities of Network Performance Criteria

The bottom line of any given network is that it becomes a medium whereby users (the people relying on the network to do their work) accomplish their jobs without incurring any noticeable delays. Performance criteria are met when every user is satisfied in terms of network responsiveness. To ensure user satisfaction, every aspect of the network must be examined, from the media to the applications to the individual devices creating the network as a whole. This task is complex.

 

Differentiating Performance Tests versus Real Network Performance

Three areas require classification: what comprises a performance test, how does one interpret results, and then how does one compare the results to any realistic performance requirements? The more common performance tests include blasting traffic from an input port to an output port of a device. For a given device, injecting traffic through multiple input ports to multiple output ports on the same device gives aggregate performance numbers. Usually, these tests are performed on Ethernet since Ethernet-based testers were the first available. Aggregate performance numbers are media-independent, but the type of media used plays an important role in defining what the theoretical packet-per-second limitation is. Table 1 shows characteristics of some of the more common media in use today.
Table 1. Media Characteristics  
  Inter-Frame Gap  Minimum Valid Frame  Maximum Valid Frame  Bandwidth 
Ethernet  96 bits  64 Bytes  1518 Bytes  10 Mbps 
Token Ring  4 bits  32 Bytes  16K Bytes  16 Mbps 
FDDI  34 Bytes  4500 Bytes  100 Mbps 
ATM  30 Bytes (AAL5)  16K Bytes (AAL5)  155 Mbps 
BRI  24 Bytes (PPP)  1500 Bytes (PPP)  128 Kbps 
PRI  24 Bytes (PPP)  1500 Bytes (PPP)  1.472 Mbps 
T1  14 Bytes (HDLC)  None (Theoretical) 
4500 (Real) 
1.5 Mbps 
Fast Ethernet  96 bits  64 Bytes  1518 Bytes  100 Mbps 
Calculating the theoretical maximum packets per second involves all the variables listed in Table 1: interframe gap, bandwidth, and frame size. The formula to compute this number is:

Bandwidth / Packet Size = Theoretical Maximum Packets per Second (where packet size may incorporate interframe gap in bits)

Table 2 lists the theoretical packet-per-second limitations for three common media: 10-Mb Ethernet, 16-Mb Token Ring, and FDDI, each for eight different Ethernet frame sizes. These eight frame sizes, widely used in the industry, are derived from the performance testing methodology as outlined in the Internet standard for device benchmarking in RFC1242. The numbers are derived by using the formula.
Table 2. Packet-per-Second Limitation  
Ethernet Size (bytes)  10-Mb Ethernet (pps)  16-Mb Token Ring (pps)  FDDI (pps) 
64  14880  24691  152439 
128  8445  13793  85616 
256  4528  7326  45620 
512  2349  3780  23585 
768  1586  2547  15903 
1024  1197  1921  11996 
1280  961  1542  9630 
1518  812  1302  8138 
More specific detail in how the numbers in Table 2 were derived for the three media (10-Mb Ethernet, 16-Mb Token Ring, and FDDI) follow.

 

10-Mb Ethernet

The frame size needs to incorporate the data and header bytes as well as the bits used for the preamble and interframe gap, as shown in Figure 1.

Figure 1. : 10-Mb Ethernet Frames

fig_1.gif

16-Mb Token Ring

Neither token nor idles between packets are accounted for because the theoretical minima are hard to pin down, but by using only the frame format itself the maximum theoretical packets per second can be estimated, as shown in Figure 2. Note that, since we are basing our initial frame on an Ethernet frame, we need to subtract the Ethernet header bits for the correct calculation of the data portion. So, for a 64-byte Ethernet frame, we get 64 -- 18 = 46 bytes of data for the DATA portion of the Token Ring frame shown in Figure 2.

Figure 2. : 16-Mb Token Ring Frames

fig_2.gif

 

FDDI

Neither token nor idles between packets are accounted for because the theoretical minima are hard to pin down, but by using only the frame format itself the maximum theoretical packets per second can be estimated, as shown in Figure 3. Note that, since we are basing our initial frame on an Ethernet frame, we need to subtract the Ethernet header bits for the correct calculation of the data portion. So, for a 64-byte Ethernet frame, we get 64 -- 18 = 46 bytes of data for the DATA portion of the FDDI frame shown in Figure 3.

Figure 3. : FDDI Frames

fig_3.gif

The packet size is a major factor in determining the maximum packets per second, and in the theoretical test world, one packet size at a time is tested. Eight standard packet sizes are tested: 64-, 128-, 256-, 512-, 768-, 1024-, 1280-, and 1518-byte packets. Figure 4 shows a graph of the theoretical maximum packets per second for 10-Mbps Ethernet.

It is important to note that as the frame size increases the maximum theoretical packets per second decrease.

Figure 4. : 10-Mb Ethernet Theoretical Performance

fig_4.gif

Having seen how maximum theoretical performance is determined, we now see how that data fits in with the performance requirements of real user networks. Each medium has a specific fixed-size bandwidth pipe associated with it, and each one may or may not define a minimum and maximum valid frame size. The minimum and maximum frame sizes are important because most good applications written for workstations or PCs make efficient use of bandwidth available and use maximum-sized frames. The smaller the frame size, the higher the percentage of overhead relative to user data; in other words, smaller frame sizes mean less effective bandwidth utilization. (See Figure 5.)

Figure 5. : Bandwidth Efficiency for Small versus Large Frames

fig_5.gif

An understanding of real traffic patterns is important when designing networks. At least some typical applications should be known so that the average packet sizes on the network can be determined. Sniffer traces to look at typical packet sizes for varying applications are helpful; some of the more common ones include:

For optimal network designs, an understanding of the kinds of applications that will be used is necessary to determine the typical packet sizes that will be traversing your network. The following example, taken from a real network, shows how to optimize your network design.

 

Example

We consider a very simple network, depicted in Figure 6

Figure 6. : Sample Network

fig_6.gif

The network consists of six Ethernets that are interconnected via an FDDI backbone. Router A interconnects the Ethernet networks to the FDDI backbone. For simplicity, we assume that all the Ethernets have traffic characteristics similar to those shown in Figure 7.

Figure 7. : Graph of Typical Ethernet Network

fig_7.gif

Most of the traffic falls between 256-byte and 1280-byte packets, with numerous 64-byte packets that are typically acknowledgment packets. Our calculations assume that the Ethernet network is fairly busy with average utilization at 40 percent; in other words, 4 Mbps of Ethernet bandwidth is utilized. For average traffic rates, 40-percent utilization of Ethernet bandwidth is a rather heavily utilized network since collisions are very probable and most of the traffic on the network is retransmission traffic. However, the example is intended to show a worst-case real-world performance scenario.

For simplicity, we assume that the following traffic is on the Ethernet:

To calculate the total packets per second that would be on the Ethernet, we need to apply the following formula for each of the different packet sizes: (BW x Percent Media Used) / (Packet Size x bits/byte) = Packets per Second.

Using this formula yields:

The total, 2798 pps, is NOT the pps rate that goes through the router...if it is, the network design is not optimal and should be changed. Rather, the 80/20 rule applies to most nonswitched networks, where 80 percent of the traffic stays on the local network and 20 percent goes to a different destination. Then we have 2798 x 20% = 560 pps that the router must deal with from that single Ethernet network. If we take six Ethernets with similar characteristics, we get an aggregate of 3360 pps that the router must support.

Now consider a scenario with central servers and assume that the 80/20 rule does not apply; only 10 percent of the traffic stays local and 90 percent goes through the router to the servers that are off the backbone. In this scenario, the router must support 6 x (2798 x 90%) = 15,110 pps for our example of six Ethernets. The appropriate router platform must be chosen that will meet the traffic requirements.

This example shows how the packets-per-second requirement for varying networks is computed. As will be shown in subsequent sections, all Cisco router platforms meet and greatly exceed the pure packets-per-second requirements of real networks.

 

Router Platform Switching Paths

This section will list the switching paths that the various router platforms support.

 

Low-End/Midrange Routers

This category of routers includes the Cisco 2500, 4000, 4500, and 4700 series routers. The switching paths supported for these routers are process switching and fast switching. Fast switching is on by default for all protocols.

The aggregate performance numbers in packets per second are listed in Table 3.
Table 3. Aggregate Maximum Performance for Low-End/Midrange Routers (in Packetsper Second)  
Switching Paths  2500 Series  4000  4500  4700 
Process Switching  1000  1800  10,000  11,000 
Fast Switching  6000  14,000  45,000  50,000 
 

Features Affecting Performance

Understanding how a given feature will affect the router's switching paths is critical when designing networks. Many new features are initially incorporated into the process switching path and, in subsequent releases, incorporated into faster switching paths. The most current enhancements are listed in Cisco Connection Online (CCO) under Technical Assistance/Tech Tips: Hot Tips/IOS Information. There you will find new features for Cisco Internetwork Operating System (Cisco IOSTM) releases and any performance enhancements to previously implemented features.

 

Low-End and Mid-Range Router Memory Considerations

Most performance concerns arise from the need for sufficient memory to run in certain environments and the necessity to prevent overstrain on the CPU. The memory considerations are primarily issues for the low-end and midrange platforms. Product Bulletins #284 and #290 address these issues for the Cisco 4000 and 2500 series routers, respectively. They can be accessed via the Web as follows:

 

Other Considerations

Additional features that most affect CPU utilizations are link-state routing protocols such as Open Shortest Path First (OSPF) and NetWare Link Services Protocol (NLSP), tunneling, access lists, accounting, layer 2 forwarding (L2F), multichassis MP, queuing, compression, and encryption.

No boilerplate mechanism to give hard-and-fast platform limitations exists. What needs to be considered is that, for any given platform, the number of interfaces you can support depends greatly on encapsulations and features used. The aggregate maximum packets per second is a useful number for approximating the maximum number of interfaces to put into a given platform as long as some real-world analysis of the traffic flow is done. If designs follow a more theoretical maximum- packets-per-second approach, the Cisco routers will be greatly underutilized.

Some common rules to follow:

Network Design Guidelines

Some common network designs are suboptimal in terms of performance; most of these are based on media mismatch. (See Figure 8.)

Figure 8. : Media Mismatch

fig_8.gif

In this scenario, two separate cases of common media mismatch problems are shown. The first problem is between router B and router C where multiple clients are trying to access a centralized server farm. What may not be obvious at first glance is that the 56-Kb line is the primary connection between the clients and the servers, and it will quickly become oversubscribed with traffic. At the very least, enough bandwidth to support the maximum expected peak traffic between the clients and servers should be in place. Or, if certain Ethernet segments make extensive use of a particular server, distributing servers to local Ethernet segments will greatly improve network performance.

The second performance problem is through router A, where the server farm gets backed up to a network of backup servers. The media mismatch from 100 Mbps FDDI to 10 Mbps Ethernet is the bottleneck. To gain optimal performance for high-speed backups going through the router, the media speeds should be maximized.

 

Conclusion

Choosing the appropriate router interface media and router platform are important to designing optimized performance networks. Choose the appropriate media by understanding what average and peak traffic flows are for different points of the network. At the very least, an approximate calculation can be performed for worst case traffic scenarios. Next, the appropriate protocol feature set needs to be determined to ensure sufficient memory and CPU requirements are met. Armed with the knowledge of media interfaces, memory and CPU requirements, the appropriate router platform for a given scenario should be clear.

Click here to return to module 2, section 1.


Copyright Cisco Systems, Inc. -- Version 2.0 1/97