Network Performance Archives

Black Friday, Cyber Monday.


Boy, what a difference a year makes.

“Black Friday” usually refers to the day after Thanksgiving, when retailers, both online and offline, started getting rushes of orders in order to fulfill Christmas demand. But unless you’re a Wall Street firm, for whom Christmas has come early, you’re probably cutting back on expenditures this holiday season.

Now, it’s probably referring to any Friday that you re-read your quarterly 401(k) or 529 statement.

Still, whether or not people –spend- more online this holiday season, they’ll probably be making a similar number of transactions – that is, Hershey bars instead of Godiva chocolate, Playstation 2s instead of Wiis, Go-Bots instead of Transformers…

And with every dollar counting, the one thing that retailers and suppliers can’t afford on Black Friday and Cyber Monday this year are performance slowdowns like the ones that hit Costco, Victoria’s Secret, Lowe's, and Macy's last year.

Additionally, even if you aren’t a retailer, Cyber Monday typically sends some Web traffic spikes over company networks as employees use the high speed connections work provides in order to make their purchases.

In either case, you can analyze network traffic flows to identify what traffic is mission critical, what is mission irrelevant, and what is mission impossible. After quantifying the impact of certain types of traffic on network performance, you can then implement quality of service policies to ensure that business critical apps have priority access to network resources.

We know that on Friday and next Monday, there will be a higher than normal volume of Internet traffic.  The trick is finding out how much of an impact it will have and preventing it from impacting application performance.


Network Performance Archives

Who you gonna call? – Networking Myth Busting


The Ghostbusters are awesome.

They’re awesome because not only did they have to believe in ghosts, they also had to study ghosts and figure out the physics behind ghosts in order to develop the tools in order to bust ghosts. How did Egon Spengler know that an unlicensed nuclear accelerator would even affect incorporeal ectoplasmic entities? You’d think those particle beams would provide no more of an impediment to a ghost than your average wall.

Similarly, just as the Ghostbusters need to know much more about busting ghosts than the ghost hunting layperson, so too does the network engineer need to know more about busting network myths than the network layperson. And this is very important because networks often behave in unintuitive ways.

Network World has put out an article, “12 myths about how the Internet works,” which explain many of the ideas which are counter-intuitive enough to be headscratchers, but which network engineers deal with on a daily basis.

For example, just because A can reach B does not mean B can reach A, nor is it a given that A can reach C if A can reach B, and B can reach C. Every metaphor the layperson uses for the network – pipes, roads, series of tubes – kind of fails us.

But serious networking performance problems can be inadvertently created when application developers tend to believe some of these myths. For example:


4. The time it takes to initiate communications between two systems is what you'll see throughout the communication.

Thaler says many applications assume that the end-to-end delay of the first packet sent to a destination is typical of what will be experienced afterwards. For example, many applications ping servers and select the one that responds first. However, the first packet may have additional latency because of the look-ups it does. So applications may choose longer paths and have slower response times using this assumption.


This is one of the major reasons that companies might consider putting application developers and network teams into a single “application delivery” team. And to be told, choosing the server with the slowest initial ping time isn’t a “stupid” thing to do – it’s just something based on an assumption that a developer would make that a network engineer would know wouldn’t hold true.

Conversely, network engineers may design systems not realizing the needs of the application. For example:


9. If one stream between you and me can get through, so can another one.

Some applications open multiple connections – one for data and another for control – between two systems for communications. The problem is that middleboxes such as NATs and firewalls block certain ports and may not allow more than one connection. That's why applications such as File Transfer Protocol (FTP) and the Real-time Transfer Protocol (RTP) don't always work, Thaler says.


At any rate, the movie wasn’t called “Ghostbuster.” Work as a team.


Network Performance Archives

How do you quantify MPLS? – Why Networks Often Fail (To Perform)


Part 7 in a series adapted from Joel Trammell’s Keynote Speech at NetQoS Symposium 2008

Think back three years ago. Back then, how many of you had an MPLS environment?

The carriers have been busy. And in that MPLS environment you lose visibility.

So how do you quantify how the performance has changed and how that carrier network is performing? After all, carriers claimed that by going to MPLS you would get better performance. But since they don't really monitor their own networks, how would they know? With MPLS, you don’t have a lot of data to validate whether it has improved performance. The ability to quantify how that carrier network is performing is critically important.

Additionally, how do you gain traffic visibility into the traffic flows, such as voice, that can now, instead of being in a traditional hub and spoke design, go from any location to any other location? The ability to understand the traffic flows without having to put devices out at each of your local sites to get visibility back into that traffic is crucial.

Finally, anomaly detection is important because often we see with MPLS routing changes that “just occur” that often affect performance. Suddenly the route changes on a major protocol, unknown to you, and suddenly performance is dramatically affected. So, you'll detect that in response time but you'll also see it in changes in traffic flows that will show up in anomaly detection.


Network Performance Archives

Data Consolidation and Performance – Why Networks Fail (to Perform)


Part 4 in a series adapted from Joel Trammell’s Keynote Speech at NetQoS Symposium 2008

Is anyone out there contemplating a data center consolidation project or happen to be in the middle of one? How are you going to ensure that performance is consistent, when what you've effectively done-- and most executives in IT don't think about this--is move the users further away? From a networking sense that's what you're doing in a data center consolidation. So the ability to deliver consistent and acceptable application performance is going to be very important.

So, do you know what applications are affected? Do you know what applications are even in each of your data centers? Do you have a current baseline of performance? Do you know the traffic volumes that those applications take up so that you can properly size the infrastructure? If you have fifty Exchange servers and you're consolidating down to three locations, can you figure out what traffic is going to go to those three locations in order to properly size those links and size those servers? Do you understand the interdependencies of all the applications?

With multi tier applications it's not uncommon to think you're moving the entire application when in reality you find out some of the servers just got left behind thousands of miles away. We've have numerous examples of analyzing multi tier applications for people where one of the tiers just didn't get moved or didn't get located in the same physical data center and therefore is now a thousand miles away and so then they wonder why response time and performance suddenly slowed to a crawl.

So again, response time is the key network performance driver. Traffic flows are very important for understanding capacity issues and understanding how much traffic is generated by these consolidations. Device statistics now become important as well within the data center. Am I overloading routers that have been updated as part of the data center consolidation efforts? Finally, packet analysis should be employed to understand the affects consolidation has on multi tier applications and where the servers are physically located.


Network Performance Archives

Just a Flag in the Database – Why Networks Fail (to Perform)


Part 3 of a series adapted from Joel Trammell’s Keynote Speech at NetQoS Symposium 2008

My favorite story for a changed application: A network team was fighting with a thorny application performance issue for a long time and they saw the application had changed its bandwidth usage by an order of magnitude overnight. They went back to the application developers and asked, "Guys, why didn't you give us a heads up that you had changed this application so dramatically?" An application developer said, "What do you mean changed dramatically? We just flipped a flag in the database.”

"What do you mean you flipped a flag in the database?"

"Well, we made the graphics field, you know, available to the user.” So, it went from a text based application to one showing JPEG images that could be on the order of a MB in size, one for every page on that application.
To the application developer, this was not a change of any significance in the application.

But to the networking team, this was a major issue. Often, the application development team is not going to know what's going to be a little change to them and a big change to the network engineer. That’s why it’s important to quantify the normal behavior and performance for your key existing applications so that you can detect this change.

Do you have some sort of alarm in place of variations from what that normal performance is? Can you detect an unusual traffic flow that might indicate a change in usage or a change in architecture? Perhaps the server team has repositioned servers, it's going to have a dramatic effect on your network, though they haven't “changed” the network at all, right? Can you reconstruct the transaction timing to understand why things have changed?

So, the key driver in this case to detecting a changed application is going to be response time. You know what a normal baseline response time is. Hopefully, you're going to see an alarm off that normal baseline. Then you're going to want to be able to understand if traffic flows have changed, so anomaly detection may help you in this case and of course packet analysis again may help you understand how the application has specifically changed.


Network Performance Archives

Performance Edge Journal – Volume 3


The third edition of Performance Edge Journal has just been posted for download (some registration required). The publication is edited by Network Performance Daily’s Brian Boyko and is devoted to the diverse networking and application delivery responsibilities that today's network professionals must tackle.

In this edition, industry expert Dr. Jim Metzler contributes recent research, presenting some very interesting expectations and realities of today's NOC and how it's not just about monitoring network availability anymore. Find out what non-NOC IT pros think the NOC staff is really doing.

Performance Edge Journal also explores the risk-reward balance when considering incumbent technology. When is it prudent to stick with what you have? When does it make sense to invest in more modern technology?

Unified Communications adoption and ensuring VoIP quality continue to be issues of concern, so we included one article on some of the things to watch for when managing UC and another one that goes details on how to diagnose and deal with that annoying echo we sometimes get on phone calls. (Did you know that phone echo can never be caused by the digital stream?)

Other topics covered in this issue of the Performance Edge Journal include how technology has impacted the U.S. general election; anomaly detection software that provides early warnings of threats to optimal network performance; a case study that looks at how OSF HealthCare improved network visibility to address existing performance issues and prepare for critical initiatives such as VoIP, MPLS, and new network intensive imaging applications; a white paper discussing latency issues specific to financial trading environments; and finally a 2009 recreational network traffic calendar for planning of high traffic network times.


Network Performance Archives

Interview with ‘Bullied’ Network Engineer on Australian Gov’t Net Filters


Australia’s federal government has planned to require Australian ISPs to use filtering software to remove “illegal” content from Australia’s Internet. They’re spending around $77M (USD) to implement the program which the government had lead people to believe would be optional. Instead, it will be mandatory.

Mark Newton, a network engineer with Internode in Australia (but not working on behalf or speaking for Internode), did an analysis of the data gathered from Australian government trials of filtering software. He concluded that, among other things, more accurate filters degrade Internet speeds over 70%, and less accurate filters can have up to a 15% false positive rate.

In retaliation, Belinda Dennett, a policy advisor to Australia’s communication minister, Senator Stephen Conroy (Labor), wrote an e-mail to Newton’s employer, asking them to reign in the network engineer’s dissent.

We called Sen. Conroy’s office but we were not able to get a response before press time.

We have an audio interview in podcast form with Mark Newton below, with a transcript below the cut.

[Ed. Note: Due to problems with rendering in Internet Explorer 7, we've temporarily disabled the flash player version of the podcast. You can download the podcast as an MP3 file here.]

Continue reading "Interview with ‘Bullied’ Network Engineer on Australian Gov’t Net Filters" »


Network Performance Archives

This-specific-end-to-that-specific-end network performance management.


EMA analyst Dennis Drogseth had a column in Network World yesterday talking about end-to-end application management. In it, he had this to say:


You might believe, and with some real justification, that the term “end to end” is only used by vendors who custom-fit the definition to the scope of their particular product.

Does “end-to-end” application management, for instance, include the mainframe? You bet it does if you’re a vendor that manages the mainframe environment! Does it include capturing the end user experience at the end station, desktop, or mobile device? Once again, the answer is a definitive “yes” if you’re a vendor that has strong QoE (Quality of Experience) roots. Or how about insights into the code and design of the application itself? If you’re one of the few vendors that does this, you’re proud of it and wouldn’t have it any other way!


And this concerned me because, if you do a google search for: [site:networkperformancedaily.com “end-to-end”], you get 122 results. The phrase, “end-to-end” appears in a little more than 1 in 5 posts we’ve made to this blog.

So, what do we mean by “end-to-end?”  We’re usually using the phrase in connection with network response times and the end-user experience at the end station; NetQoS is a “vendor that has strong QoE roots.”

Now, we do have some insight into the code and design of the application.  But that isn’t the focus of our tools; the focus is to tell you whether the problem is in the network, server, or application, and if it’s in the application, give you a good idea of where to start your investigation.  (For example, an application that is slow due to unnecessary round-trip transactions behaves differently from an application that is slow due to a memory leak on the server where it is being run.) 

Drogseth is right when he says that no one vendor is optimized to do it all.  In the future, there could be, but then you run into the quality vs. quantity problem.  Is it better to do it all adequately or to do a few things extremely well?

EMA defined five major technology spheres, and last June, they polled more than 400 respondents to find out which of them they believed “most critical to end-to-end application management in 2008.”  The answer was “Network Application Management,” focusing on application flows and end-to-end (as we define it) transaction capabilities. 

For more information on this, I recommend you read the original article up at Network World.  Additionally, Drogseth promises to follow-up in his next two columns. 


Network Performance Archives

What do YOU think of the NetQoS Performance Center?


What do YOU think of the NetQoS Performance Center?

Peter Sevcik and Rebecca Wetzel, analysts with NetForecast, would like to know (and of course we at NetQoS are always looking for feedback). They have published an article about NetQoS for their “App Performance View” blog on NetworkWorld.com. This is the result of a series of blog posts they are writing about “tools that monitor application performance in real-world environments.” They ask for customers to comment about their experience with the NetQoS Performance Center at the end, and we would like to encourage all of you who have experience with any of NetQoS products to respond: What do you think of the NetQoS Performance Center?

You may respond anonymously if you wish, and the comment can be as short or as long as you want to make it. Feedback is encouraged on this blog as well.

And if you are just thinking about deploying any of our products, check out the customer comments already posted at the end of the Network World post!


Network Performance Archives

Three Things You Can Do Today To Improve Network Performance Without Spending a Dime


For months, we’ve been waiting to see what the fallout would be from the sub-prime mortgage crisis.

Apparently, the results are not unlike a hefty bag filled with chili con carne, dropped from the top of a skyscraper. Only instead of a hefty bag, it’s the U.S. economy.

So, as Wall Street explodes like an explosive so explosive it could explode and create a massive explosion, technology turnaround times will probably extend a couple more years as CIOs try to figure out how to use existing tools to solve network management problems and improve performance. How do you do that?

Luckily, there are ways to do that – Cisco routers and switches already have “application-aware” technologies and don’t require any additional purchases – including IP Service Level Agreement (IP SLA), Class Based Quality of Service (CBQoS), and Network Based Application Recognition (NBAR).

Managing Application Response Times with Cisco IP SLA

Now, measuring real application transactions is the most accurate method for measuring response times. But, failing that, you can use Cisco IP SLA to create synthetic transactions. This is not only useful when on an IT budget crunch but can also provide useful data when assessing whether or not to roll out a new application, or measuring a service provider’s SLA edge-to-edge.

IP SLA operates by sending synthetic transactions between two network devices or between a network device and a server. It can be configured to send different types of synthetic transactions based on port, packet size, type of service, and even more advanced characteristics, as is the case with Voice over Internet Protocol (VoIP) tests. When it gets a response, the sender then calculates the response-time metrics appropriate for the test type, and then repeats multiple times.

Some SNMP polling products can collect data automatically, store it in a database, display the results in a GUI, and provide analytical function beyond data collection, such as calculating baselines, displaying trends, and triggering threshold alerts based on collected IP SLA data. There’s also the possibility of simply getting the information from the CLI, but extracting the IP SLA response-time metrics and copying them to a spreadsheet can be difficult and tedious. However, for the extremely budget-conscious, it can be done.

Deploying Quality of Service with Cisco CBQoS

QoS is a blanket term for network policies and practices that help to manage different types of data traffic that share network links. Effectively, QoS determines how different types of traffic, with different priorities, are handled whenever tradeoffs that are likely to impede performance must be made.

Now, within any enterprise, the end-user experience with certain applications will always be more critical than it is with others. Strategies to avoid (or at least manage) congestion could include dropping traffic, adjusting application responses, and building packet queues. CBQoS is one way to do this – and comes with the CBQoS Management Information Base (MIB) to collect statistics about the traffic traversing the router and reports how the QoS configuration is being applied.

Here, an SNMP polling product with application-aware capabilities can get information on input and output QoS class map utilization, drop percentage, and packet counts. It can also get information on pre-versus-post QoS traffic volume, rate, and packet count. It can also point out traffic marked in conformance, in excess, and in violation of defined policies.

Without CBQoS, network managers don’t have a whole lot of evidence to verify that their QoS settings are actually improving network performance – in fact, they may even be inadvertently harming performance. CBQoS prevents network managers from flying blind with QoS deployments. And, like IP SLA, it’s built into Cisco IOS.

Gaining a New Level of Visibility with Cisco NBAR

From within the network device operating system, Cisco NBAR can inspect packets traversing the device and identify the corresponding application – for example, TCP traffic running on port 80 could be labeled as Google, SAP, SharePoint, SalesForce, etc. NBAR can also provide utilization, volume, and rate metrics on a per-application basis relative to the network circuit carrying the traffic.

It’s similar to NetFlow, but NetFlow identifies protocol traffic mixes – not application-layer visibility. NBAR identifies by application – which is important in setting proper QoS policies. And because NBAR is part of Cisco’s IOS, and the data can be collected with an application-aware SNMP poller (which many of you already have), it can be a more cost-effective solution than application discovery hardware.



1 2 3 4 5 6 7 8 9 10 11