下载中心 | 网站地图 | 站内搜索 | 加入收藏

安恒公司 / 技术文章 / 网络管理与网络测试 / 网络测试 / 技术文章:浪费带宽的隐患以及系统的故障诊断清单

2004-06-10   阅:    下页:
技术文章:浪费带宽的隐患以及系统的故障诊断清单

来自于“Cabling Network Systems”和“Globe and Mail”杂志的对于故障诊断的两种见解:浪费带宽的隐患以及系统的故障诊断清单

《Cabling Network Systems》网络故障诊断:关键在于了解问题的根源。福禄克网络公司加拿大分公司的产品专家 Ron Groulx 认为成功的故障诊断的要基于良好的观察力、正式的培训以及实践经验。但如果具有*个基础的故障诊断清单会相对缩短学习的过程。

《Globe & Mail》福禄克网络公司加拿大分公司产品经理 Brad Masterson 描述了*个用户如何试图通过增加带宽来解决问题,并*终发觉问题的根源,只需在网络的另*部分安装*个简单、便宜的方案。

https://anheng.com.cn/news/html/network_troubleshooting/306.html 

https://anheng.com.cn/news/html/network_troubleshooting/306.html 

《Cabling Network Systems》网络故障诊断:关键在于了解问题的根源。


CNS Magazine, March/April 2004

Network Troubleshooting

While it can be a complex chore, the key lies with understanding the root of a problem.

By Ron Groulx

The key to successful troubleshooting is knowing how the network functions under normal conditions, since it enables a technician to quickly recognize abnormal operation.

Any other approach is little better than a shot in the dark.

While the foundation of good troubleshooting is based on insight, formal training and practical experience, the following information can help shorten the learning curve on isolating and solving network problems.

Before the onsite visit: Technicians can save considerable time and resources by determining ahead of time whether an on-site visit is required. Even with continuous improvement being made in operating system software reliability, "reboot your PC" is still the first step.

Information can also be gathered over the phone with the help of the user. Most users can open a command prompt and report back to the technician the result of an IPCONFIG command.

This tells the technician whether the PC has an appropriate address for the subnet to which it is physically connected.

Have the user attempt to use the network following receipt of a fresh IP address. If the IPCONFIG command reports that the DHCP operation cannot be performed, then the user is probably using a static IP configuration.

If the user has reported a valid IP address, try pinging that address from your desk. If the user's PC responds, then have the user attempt some other network activity, such as opening a Web page or pinging the local router to verify basic connectivity.

Verifying the problem on site: If a site visit is necessary, it is important to question the user about any action or activity that may have affected network performance, including any recent changes (i.e. moving office furniture or installing a new screen saver).

The next step is to repeat the tests the user performed previously over the telephone.

A successful ping to a network server or off-net device immediately confirms that the workstation has Layer 3 connectivity to the network, which means all lower-layer tests are instantly deemed "not needed". If Layer 3 connectivity cannot be validated, then troubleshooting must start at the Physical layer--Layer 1.

Extended troubleshooting: Once the inability to log into the network has been verified, the next step is to determine whether the issue relates to the network or the user's PC. To verify this, the technician must determine whether the cable connecting the client to the network is in place and/or functioning properly.

Solving network problems in a timely, cost-effective manner at this point requires a tool that can quickly verify the status of critical network functionality. Handheld devices exist such as Fluke's Network Multimeter that can be used to find basic connection problems and confirm critical network operational parameters, in order to eliminate the presence of physical-layer issues before escalating the trouble-ticket to a more senior technician.

In a shared Ethernet environment, when too many stations attempt to transmit simultaneously, performance may suffer dramatically due to collisions.

While the existence of collisions is a normal part of half duplex Ethernet operation, when the number of collisions begins to rise due to increasing traffic, the traffic volume will begin to rise at an increasing level because of the re-transmissions required.

The network will display a performance curve that suddenly "falls off a cliff" as the number of frames sent, collisions, and re-transmitted packets spirals upward at a rapidly-increasing rate.

Be reminded, however, that if connected to a single switch port (not shared media) the only traffic seen may be broadcast frames, which can be very intermittent on low traffic networks.

Multiple collisions

A switch may operate in full duplex mode, essentially eliminating the shared Ethernet performance drops caused by multiple collisions.

If a link can be established and utilization is reasonable, the user may then press the button corresponding to the ping test to obtain an IP address from the network's DHCP server.

The failure of either a client's or the troubleshooting tool's automatic DHCP configuration could point to a problem with the DHCP relay system.

The process of obtaining a DHCP address demonstrates the viability of the local cable, the local hub or switch port, and the network infrastructure all the way back to the DHCP server. In one simple operation, therefore, most of the nearby network infrastructure has been validated up through Layer 3.

The simple success of a ping indicates that end-to-end Layer 3 connectivity exists between the two devices. The total roundtrip travel time for the request is easily compared to known values to provide a helpful diagnostic for more detailed analysis, if deeper analysis is required.

It is useful to send a series of pings to give the destination multiple opportunities to respond.

Servers outside the enterprise network may also be used as the target for pinging to verify WAN interconnectivity from the client station and local site to a remote site.

If servers within the firewall respond to ping, but those outside the firewall do not, then the source of the problem may be with routers or other aspects of the network boundary infrastructure.

If pings are successful to both external and internal servers, but the client is not receiving those services, it indicates that the problem lies at a level beyond the physical transport.

Next steps

If these instant tests are unsuccessful or inconclusive, then it is time to look at the network cabling. If the cable tests are successful but the problem continues, then the call should be escalated to a senior level network technician for resolution.

The next step is to trace the cable into the wiring closet and the local hub or switch. This can be simplified by using a tone probe feature for audible tracing, as well as a flash function for locating port links.

If the hub or switch port test is good, then the workstation might be the source of the problem. This can be verified by testing for the presence of link and the speed and duplex settings offered by the NIC.

Remediation procedures at this point can include rebooting and retesting of the link, network and protocol reconfiguration, and address verification.

If all components are in place and properly configured, and the workstation still does not show proper network and application connectivity, it is time to escalate the problem beyond the field technician level.

While troubleshooting can be a complex chore, understanding the root of a problem before escalating it to a more senior level can be instrumental in reducing workload and saving costs.

If a technician can quickly isolate the problem, he or she can then determine next steps and make the decision as to whether it can be resolved at the department or group level. All it really takes is solid groundwork.

Ron Groulx is a product specialist with Fluke Networks Canada. A member of the IEEE, he has been involved in the field of networking since 1997.

 


《Globe & Mail》福禄克网络公司加拿大分公司产品经理 Brad Masterson 描述了*个用户如何试图通过增加带宽来解决问题,并*终发觉问题的根源,只需在网络的另*部分安装*个简单、便宜的方案。
Urban legends: Bandwidth gone wild

By Brad Masterson
Globe and Mail Update

Front Lines is a guest viewpoint section offering perspectives on current issues and events from people working on the front lines of Canada's technology industry. The author is the Canadian product manager for Fluke Networks (www.flukenetworks.ca). He has been involved in the field of networking and network testing since 1995, is a Certified Engineering Technologist registered with OACETT, and is a member of BiCSi. He can be reached at brad.masterson@fluke.com.

https://anheng.com.cn/news/html/network_troubleshooting/306.html 

The world of enterprise networking is full of fiscal horror stories.

For example, a school board spent more than $50,000 on services and $100,000 on new equipment to fix a problem on its network to no avail. In desperation, they called in a network troubleshooter, who applied his network analyzer to find the root of the problem. After all the trials and tribulations, testing indicated it was a software glitch (the update, it turned out, was free).

Another school board spent more than $40,000 on network upgrades without seeing any performance improvements. The problem? A configuration issue that left the organization with only 10 per cent of the bandwidth it was supposed to have.

We'd like to think that networking stories like this are exceptional, but that is far from being the case. It is an all-too-common habit in today's environments to spend too much time and budget on fixes before determining the root of the problem. And the most common "fix" of all is thought to be adding more bandwidth.

The general train of thought is that network slowdowns mean you need more bandwidth. In actual fact however, according to Tony Fortunato of The Technology Firm (www.thetechfirm.com) - seasoned network testing expert - bandwidth is a problem in less than 10 per cent of the cases he has encountered. In the meantime he says, there has been a considerable amount of time and money spent on unnecessary fixes.

"I've been called into situations where companies have spent tens of thousands of dollars on upgrades to systems only to find things are worse," he says. "Typically companies have already spent at least $40,000 in fixes before I'm called in. A day or two of troubleshooting with network test tools before you throw in the dollars will often find it's something quite simple and inexpensive."

Fortunato says the cause of slowdowns can range from the keyboard to the Internet Service Provider, and anything in between. The variations he has dealt with are numerous. He said one insurance company spent $1-million in upgrades only to find file retrieval was slower than on the old system. The problem? Software coding and some minor infrastructure problems. After the fix, they found they were actually able to cut their bandwidth requirements in half and save $200,000 a year.

A financial institution was staring at a possible $500,000 in bandwidth upgrades, only to discover the problem was that the network drive mapping for its 7,000 PCs was improperly configured. It was a simple matter of disconnecting the computer in the lab which had been left on in error.

One government agency was actually running too much bandwidth, which slowed the applications down. It simply had to reduce its bandwidth to optimize the application's performance.

"It was the Lucy and Ethel Syndrome," says Fortunato (his pet term that refers to the famous chocolate factory scene where the assembly line keeps accelerating with disastrous consequences). "The application could only process information at a certain rate."

In another case, an oil and gas company discovered that the slowdowns were at the ISP (Internet Service Provider) site, which meant they were being billed for more bandwidth than they were getting — not to mention the fact they had already spent $50,000 on upgrades to resolve the "problem" before finding the real culprit.

These few of many examples indicate that the amount spent on equipment and/or bandwidth upgrades without discovering the root of a network problem can be staggering. Adding bandwidth is a particularly expensive undertaking. A single 3 Meg link can run you $1,500 a month. Adding a 1 Meg link for a large organization with multiple operations can easily take up to $600,000 a year out of an IT budget. Replacing switches, routers or applications: all of these escalate costs into the tens of thousands of dollars or more. In many of those cases, it is money wasted simply for lack of proper diagnostic tools.

Finding the root of the problem is unquestionably a challenge for operations. Specialists in their respective fields (e.g. switches, routers and application development) tend to focus on their area of expertise. Call in a switch or router expert, and they will diagnose your hardware, suggest upgrades and walk away. If the problem is not the switch, then the cabling professional comes in to do their bit. And so on.

Yet a performance slowdown can be virtually anywhere, from the desktop PC with a hard drive that's too full, to cabling, to patches and connections, to hubs and routers, to an application itself. In some cases, it may be caused by something outside the walls of the enterprise. In others, a single fix may not be enough to cure the problem. Using network analyzers for front-line testing and protocol analyzers for more in-depth analysis will quickly pinpoint the source(s) of the problem and provide the groundwork for taking remedial action.

When the right problem is discovered and action taken, the buck should not stop there. Few go to the trouble of verifying results, but they should. It is extremely important to retest your network thoroughly to make sure that the problem is fixed and there are no other hidden issues.

Even those who believe in troubleshooting networks before calling in the experts tend to overlook another cardinal rule of good network health: testing your network before a problem rears its head. In many cases once the slowdown occurs, the business impact is already being felt in the reduced ability to process transactions, lower productivity and lost revenues.

While some believe that networks are as robust and consistent as telephone systems, this is not the case. The complexities of a networking infrastructure require regular monitoring to ensure peak performance. Not only does this pinpoint potential problems, it is also a good "policing" technique for monitoring your bandwidth usage and quality of service delivery from providers.

Network analyzers can be used to perform a baseline diagnostic (an especially important step to take when implementing a new network). Tests should then be done routinely to detect any change in performance. Usually a quick tweaking of an application or configuration can bring the network back to top speed.

However, it's important to understand the skills of the person performing the test, and that the right tool is being used. A front line cable technician, for example, can perform tests with a simple network analyzer, but in many cases does not have the expertise to work with a protocol analyzer, which is required for much more in-depth analysis and troubleshooting. In many cases, bringing in outside services can be a very cost-effective alternative.

In an ideal world, enterprises would approach their networking investment as we do our cars — or our health. We wouldn't replace an engine if our car performed poorly, or used too much fuel. We would take it to an expert to perform a diagnostic before paying for repairs. Nor would we ask for surgery before running the proper tests.

Why enterprises don't practice the same logic with their networks is a mystery, especially in today's world of fiscal restraint. It is in everyone's best interests to perform proper testing before the big payout. So before contracting for more bandwidth, make sure that's what is really needed. A few simple tests by the right person with the right tools can mean tens or even hundreds of thousands of dollars to bottom line results.
 

下页:   

相关文章
光纤OTDR故障诊断维护测试的新体验 - 13-05-29 - 阅读: 209047
WLAN网络中易被忽视的光纤故障 - 12-12-24 - 阅读: 313516
WLAN网络测试之布线测试 - 12-11-22 - 阅读: 342288
综合布线中*不容易发现的故障——线序问题 - 12-11-22 - 阅读: 304279
安恒网络服务测试中心为某公司提供网络故障诊断测试 - 12-11-19 - 阅读: 363068
*次RAID5故障的恢复和经验教训 - 12-03-25 - 阅读: 280108
艾尔麦WiFi分析仪解决WLAN网络设备漫游故障 - 12-02-14 - 阅读: 256687
用html5离线存储解决故障转移问题 - 12-01-14 - 阅读: 173005
IPv6时代选用什么样的网络测试仪 - 11-02-10 - 阅读: 203737
安恒公司网络测试事业部经销商培训会火热进行 - 10-08-13 - 阅读: 268940
网络健康检测服务介绍,安恒网络测试服务中心 - 10-06-01 - 阅读: 391019
安恒公司为某大学宿舍进行无线网络测试 - 10-04-21 - 阅读: 375638
OTDR光纤测试的实际故障分析案例 - 10-04-09 - 阅读: 303772
网络综合性能评估及故障排查解决方案 - 10-02-18 - 阅读: 258449
OptiView应用程序故障排除专家选件OPVS3-ATE - 10-02-07 - 阅读: 207877
福禄克TAP分路器解决方案,网络测试的常用接入方法 - 10-02-03 - 阅读: 206423
安恒公司网络测试事业部2010年团队建设活动 - 10-01-31 - 阅读: 230866
艾尔麦无线网络测试仪中“AirWISE”是什么? - 10-01-28 - 阅读: 201350
网络诊断日记(十五):使用MicroScanner 2定位电缆故障的物理位置 - 10-01-18 - 阅读: 183008
网络测试仪接入网络分析数据的方式比较,TAP与HUB - 10-01-05 - 阅读: 193817
相关产品
TroubleEvaluator故障评估系统,电信运维产品 - 07-04-10 - 阅读: 622398
布线故障演示箱-分析线缆链路故障、演示故障成因、定位故障 - 08-12-14 - 阅读: 1330138
NetTool II 二代在线型网络测试仪(NetTool Series II Inline Network Tester) - 06-11-01 - 阅读: 1392003
福禄克Fluke 2042电缆探测仪音频探测故障定位 - 05-11-10 - 阅读: 876677
手持式无线网测试仪ES-WLAN网络通无线网络测试仪 - 05-10-25 - 阅读: 1061796
怎样选择福禄克手持式网络测试仪-Fluke选购指南 - 03-12-01 - 阅读: 633515
VisiFault光缆可视故障定位仪,Visual Fault Locator - 07-06-02 - 阅读: 1511298
LinkRunner链路通|Fluke掌上型网络测试仪LinkRunner Kit - 04-12-16 - 阅读: 1652952
ES网络通EtherScope千兆网络分析仪|Fluke便携式网络测试仪ES-LAN - 01-10-23 - 阅读: 2101903
掌上型OTDR - FRFL光缆/光纤故障定位仪 - 08-01-10 - 阅读: 1691974
NetTool网络万用表|掌上型网络测试仪NT-PRO|VoIP选件 - 05-10-21 - 阅读: 1726728
OneTouch 网络故障*点通1TS2PRO-I - 04-06-28 - 阅读: 1953056
手持式网络测试仪方案, Fluke, 福禄克网络测试仪 - 03-12-31 - 阅读: 689450
《网络维护与故障诊断指南》 - 01-11-25 - 阅读: 821642

Email给朋友 打印本文
版权所有·安恒公司 Copyright © 2004   nagios.anheng.com.cn   All Rights Reserved    
北京市海淀区*体南路9号 主语国际商务中心4号楼8层 (邮编100048) 电话:010-88018877