How do you find the bottleneck of a network?

wop@infosec.pub · edit-2 2 years ago

How do you find the bottleneck of a network?

wop@infosec.pub · 2 years ago

Added the Update 2. Still some things to do, but we know a little bit more now. Feedback and questions are still welcome.

phase_change · 2 years ago

Nice job. Packet loss will definitely cause these issues. Now, you just need to find the source of the packet loss.

In your situation, I’d first try to figure out if it is ISP/Internet before looking inside either network. I wouldn’t expect it to be internal at these speeds. Though, did you get CPU/RAM readings on the network equipment during these tests? Maxing out either can result in packet loss.

I’d start with two pairs of packet captures when the issue happened: endpoint to endpoint and edge router to edge router. Figure out if the packet loss is only happening in one direction or not. That is, are all the UK packets reaching DE but not all the DE making it back? You should clearly be able to narrow into a TCP conversation with dropped packets. Dropped packets aren’t ones that a system never sent, they’re ones that a system never received. Find some of those and start figuring out where the drop happened.

wop@infosec.pub · 2 years ago

The ISPs are slow to answer if there is no active outage. Will take some time anyway.

Packets are dropped in bot directions. I am currently looking through the pcaps and will do another stress test later - got another window. MTU/MSS is the prio today.