Transparent Cake Box

orangetek · March 6, 2017, 8:57pm

Hi all, i have a box with 2 ethernet cards in a simple LEDE bridge configuration. Dnsmasq and firewall are both disabled. The box is placed in between my gateway router and my 600 customers. eth0 is connected to the gateway side and eth1 is connected to the customer side. I have cake setup on eth1 with my upload and download values reversed. The limits work and bufferbloat testing shows good results. I am using "piece of cake" in the sqm and cake is using the default triple isolate.

My question is, is this setup seeing my internal ip's and doing per ip sharing? Or is using an interface for cake that is part of a bridge the wrong thing to do?

Thanks in advance.

moeller0 · March 6, 2017, 10:49pm

If the box cake runs on does not actually perform NAT for your network (and your network actually requires NAT, for all I know you might have enough IPv4 addresses to supply to your 600 customers) then I predict that you only get per-flow fairness, that is cake will try to treat each flow equally.

However in your setup all internal address are fully visible to cake and NAT would only change the internal address, so cake should have the necessary information to perform per internal IP fairness.

You should be able to test this by running a few flows on one of your internal hosts and many flows on another one (obviously the flows need to cross the cake box and should terminate in the wider internet to test the NAT issue). If both machines get roughly equal accumulative bandwidth then this indicates per IP fairness, if the bandwidth is distributed according to the ratio of few to many you have simple per-flow fairness. Please note that per-flow fairness is not too bad, and that you might want to use the dual-srchost and dual-dsthost options as triple-isolate does not guarantee strict per-internal-IP fairness. With the dual options you need dual-scrhost on the interface that handles the internet upload traffic and dual-dsthost on the interface that handles the internet download traffic.

A more interesting question will be how cake's inbuilt number of hash bins (see #define CAKE_QUEUES (1024) in sch_cake.c) will be sofficient for 600 connected machines. You might want to compile/bake your own cake module with more cake_queues (I would recommend you bring your questions to the cake mailing list (https://lists.bufferbloat.net/listinfo/cake), where Jonathan Morton, cake's principle author has a chance to chime in)

Best Regards

r43k3n · March 7, 2017, 1:15am

You can you please share with us more details about your configuration? What device do you use as "transparent cake box", how fast is your internet speed if dnsmasq and firewall is disabled then what device is doing them? Can you maybe post here your configuration files that would show us how did you manage to configure the "cake box" or you can PM me if you'd like that.

I'm asking because I'm thinking about doing something similar but I'm not sure exactly how to do it to fully use all the features that cake provides.

I am personally looking forward to see the result of this test since I'm thinking about doing a similar or the same setup.

Yes, but doesn't the cake box see all the internal IPs since all the traffic is coming though it?

orangetek · March 7, 2017, 7:35am

The box is a dual core pentium with 4gb ram running an x86 64 build of lede. It has 2 intel gigabit nics both bridged together and one relatek onboard nic in dhcp client mode for management purposes. The config was done purely through the Luci interface and i just disabled dnsmasq and firewall on startup. The internet connection is 300/100 dedicated fiber. During peak, all 300mbits are being used and cpu usage is around 1% from what i can tell. Its very very low. The 600 customers are actually 600 standard home routers. The customers are going through the cake box to get to the nat gateway and the customers share 1 external ip address. Today i will recompile lede with a bigger number of hash bins for cake. Does anyone know what sting i should be using in the advanced settings for dual-srchost and dual-dsthost options?

Thanks in advance

moeller0 · March 7, 2017, 8:29am

Interesting, with that setup you could forgo the ifb-based ingress shaping and shape on the egress interface of both intel nics, which might save around 5% overhead (very rough number). I would try to instantiate the shapers on the physical interfaces instead on the bridged interface. And then test whether this actually works as intended?

orangetek · March 7, 2017, 9:15am

I actually have it applied to the physical interface. Having it applied to the bridge interface didn't work at all. Also, applying to the egress of each interface works, i just wasnt sure about the ip visibility but in theory it should see them from either interface.

orangetek · March 7, 2017, 1:57pm

I just recompiled lede with #define CAKE_QUEUES (65535) Anything higher than 65535 cause lede to crash on bootup. I will also begin testing a duplicate box with a few machines to confirm sharing is working.

moeller0 · March 8, 2017, 2:19pm

Ah, great that it works. As far as I can tell the larger CAKE_QUEUES gets the less cache friendly cake behaves, so maybe you should test a few sizes to figure out how many parallel flows you actually ever encounter and only set the the closest power of two? Cake uses, IIRC a set assoziative hash list, so that it will avoid false sharing unless unavoidable so cake probably gets away with fewer CAKE_QUEUES than fq_codel would.

richb-hanover · March 12, 2017, 1:16pm

See also the "dedicated QoS/Bufferbloat" topic that seems to be running along the same lines Lede as a dedicated QoS /Bufferbloat appliance

docpecos · March 12, 2017, 2:21pm

My problem with installing a cake box between my router and clients is that the router is a Peplink that aggregates. There are 4 WANs attached to the Peplink of 10 mbps each. As a result putting a cake box where you do puts it on the "fat pipe" that is, in practice, about 37-38 mbps download and about 2.5 mbps upload. The bufferbloat occurs on each of the WANs so a cake box where you have it doesn't work: really needs to be at the other end...between the transparent bridged modems and the Peplink. So a cake box appears needed for each of the WANs in the Peplink. So far I have had difficulties in getting strong DNS when I put cake boxes between the bridged modems and the Peplink.

RanocchiO · September 13, 2017, 1:38pm

Hi all,
I'm in a similar situation: I'd like to put a transparent instance at the office between the lan's interface of our gateway router and the main switch. The router is doing nat and is the only one in our network.
I'd like to know if it's possible to have per-host fairness and how to do it.

I've already prepared a test box configured as following:
wlan as control interface
2 unmanaged interfaces eth0.1 eth0.2 bridged together
this is an old wap with no separate interfaces for wan and lan, so I'm using vlans, could this be a problem?

Thanks in advance
Lorenzo

moeller0 · September 13, 2017, 3:00pm

At that point incoming packets already use the internal IP address as dst and outgoing use the internal IPs as src, so per-internal-IP fairness should work. So pick the VLAN that connects to the router and create an sqm instance for it and then add: the following to your /etc/config/sqm (and then restart sqm)
option iqdisc_opts 'dual-dsthost'
option eqdisc_opts 'dual-srchost'

That should give you IPv4 internal-host fairness (IPv6 will also work correctly). Now, I do not know how this will interact with a bridge, and it might be a good idea to set the ingress bandwidth on the WAN-VLAN to zero and instantiate another sqm-instance on LAN-VLAN where you only configure egress (but since LAN egress + internet ingress that will work fine), only then just add:
option eqdisc_opts 'dual-srchost'
to WAN-VLAN and:
option eqdisc_opts 'dual-dsthost'
to LAN-VLAN (please note here it is dual-dsthost, not dual-srchost). Make sure to properly test all of this, I am certain that I am overlooking things here and stuff might break/misbehave...

Best Regards

RanocchiO · September 13, 2017, 7:24pm

I'll try and report the result, thank you very much!

Lorenzo

RanocchiO · September 14, 2017, 7:33pm

It works, indeed it works very well!!
I tried the simple setup with only one cake instance on the wan interface and I'm surprised what a cheap box can do thanks to the work of some clever developers

no sqm:

all-no-sqm

cake enabled:

all-piece-of-cake

all-layer-cake

Lorenzo

moeller0 · September 14, 2017, 7:50pm

The change looks nice, but I believe yu still need to test whether the per-interanal-IP fairness actually works..., try running flent from multiple internal hosts against an external one, and use rrul_cs8 to get a test that uses 8 flows per direction, in the end the total bandwidth of the ruul host and the rrul_cs8 host need to be somewhat equal...

But nice that it works, btw could you post the result of "tc -s qdisc" from your shaper appliance, please?

Best Regards

RanocchiO · September 14, 2017, 8:50pm

Hi moeller0,
my colleagues mostly use Windows pc so I'm not able to run multiple flent test at the same time. Maybe I can deploy some virtual machines, but it would take some time to organize the test because I can play only in the late afternoon when everyone else is gone.

I'm not sure the charts are showing the right numbers, we have a 20/20 fiber link and usually I see ~20ms ping, so I need to re-check also these results.

Now I'm home and can't connect to the appliance (our VPN doesn't route wifi signals ) but I will post it tomorrow from the office.

Thank you very much for your help!!!

Lorenzo

RanocchiO · September 15, 2017, 8:31am

Here it is:

tc -s qdisc

qdisc noqueue 0: dev lo root refcnt 2
Sent 0 bytes 0 pkt (dropped 0, overlimits 0 requeues 0)
backlog 0b 0p requeues 0
qdisc fq_codel 0: dev eth0 root refcnt 2 limit 10240p flows 1024 quantum 1514 target 5.0ms interval 100.0ms ecn
Sent 405922122 bytes 768103 pkt (dropped 0, overlimits 0 requeues 3)
backlog 0b 0p requeues 3
maxpacket 1514 drop_overlimit 0 new_flow_count 11 ecn_mark 0
new_flows_len 0 old_flows_len 0
qdisc noqueue 0: dev br-sqm root refcnt 2
Sent 0 bytes 0 pkt (dropped 0, overlimits 0 requeues 0)
backlog 0b 0p requeues 0
qdisc noqueue 0: dev eth0.1 root refcnt 2
Sent 0 bytes 0 pkt (dropped 0, overlimits 0 requeues 0)
backlog 0b 0p requeues 0
qdisc cake 8005: dev eth0.2 root refcnt 2 bandwidth 18944Kbit diffserv3 dual-dsthost rtt 100.0ms raw
Sent 132948147 bytes 373123 pkt (dropped 462, overlimits 145714 requeues 0)
backlog 0b 0p requeues 0
memory used: 374976b of 4Mb
capacity estimate: 18944Kbit
Bulk Best Effort Voice
thresh 1184Kbit 18944Kbit 4736Kbit
target 15.3ms 5.0ms 5.0ms
interval 110.3ms 100.0ms 10.0ms
pk_delay 0us 808us 285us
av_delay 0us 75us 27us
sp_delay 0us 10us 8us
pkts 0 372759 826
bytes 0 133586964 52980
way_inds 0 13902 0
way_miss 0 22667 24
way_cols 0 0 0
drops 0 462 0
marks 0 2 0
sp_flows 0 1 0
bk_flows 0 1 0
un_flows 0 0 0
max_len 0 1514 167

qdisc ingress ffff: dev eth0.2 parent ffff:fff1 ----------------
Sent 270862410 bytes 397341 pkt (dropped 0, overlimits 0 requeues 0)
backlog 0b 0p requeues 0
qdisc mq 0: dev wlan0 root
Sent 1297878 bytes 3281 pkt (dropped 0, overlimits 0 requeues 0)
backlog 0b 0p requeues 0
qdisc fq_codel 0: dev wlan0 parent :1 limit 10240p flows 1024 quantum 1514 target 5.0ms interval 100.0ms ecn
Sent 2098 bytes 21 pkt (dropped 0, overlimits 0 requeues 0)
backlog 0b 0p requeues 0
maxpacket 0 drop_overlimit 0 new_flow_count 0 ecn_mark 0
new_flows_len 0 old_flows_len 0
qdisc fq_codel 0: dev wlan0 parent :2 limit 10240p flows 1024 quantum 1514 target 5.0ms interval 100.0ms ecn
Sent 0 bytes 0 pkt (dropped 0, overlimits 0 requeues 0)
backlog 0b 0p requeues 0
maxpacket 0 drop_overlimit 0 new_flow_count 0 ecn_mark 0
new_flows_len 0 old_flows_len 0
qdisc fq_codel 0: dev wlan0 parent :3 limit 10240p flows 1024 quantum 1514 target 5.0ms interval 100.0ms ecn
Sent 1295780 bytes 3260 pkt (dropped 0, overlimits 0 requeues 0)
backlog 0b 0p requeues 0
maxpacket 0 drop_overlimit 0 new_flow_count 0 ecn_mark 0
new_flows_len 0 old_flows_len 0
qdisc fq_codel 0: dev wlan0 parent :4 limit 10240p flows 1024 quantum 1514 target 5.0ms interval 100.0ms ecn
Sent 0 bytes 0 pkt (dropped 0, overlimits 0 requeues 0)
backlog 0b 0p requeues 0
maxpacket 0 drop_overlimit 0 new_flow_count 0 ecn_mark 0
new_flows_len 0 old_flows_len 0
qdisc cake 8006: dev ifb4eth0.2 root refcnt 2 bandwidth 18944Kbit besteffort dual-srchost wash rtt 100.0ms raw
Sent 272971644 bytes 394965 pkt (dropped 2376, overlimits 372236 requeues 0)
backlog 0b 0p requeues 0
memory used: 681408b of 4Mb
capacity estimate: 18944Kbit
Tin 0
thresh 18944Kbit
target 5.0ms
interval 100.0ms
pk_delay 699us
av_delay 102us
sp_delay 11us
pkts 397341
bytes 276425184
way_inds 7790
way_miss 21643
way_cols 0
drops 2376
marks 1
sp_flows 1
bk_flows 1
un_flows 0
max_len 1514

moeller0 · September 16, 2017, 8:25am

That looks good. I guess the only obvious improvement would be to specify the correct per-packet-overhead, which for fiber links unfortunately is somewhat hard to get right... (that and testing whether internal host isolation works*)

*) you do not actually need to run flent on multiple internal hosts; as long as you have one flent-capable host simply use the dslreports speedtest (configured for multiple streams on the windows hosts, make sure to extend the test duration to 30 seconds or more). Then expect the flent measured throughput to scale back to the (1/concurrently active hosts) fraction whenever the windows hosts have their speedtests running. BTW https://www.dslreports.com/forum/speedtestbinary gives command line clients for the dslreports speedtest, making testing a bit simpler.

Good luck & Best Regards

RanocchiO · September 16, 2017, 6:45pm

Could you explain me how to read those values? I'd like to be able to evaluate it.

I will try this way, much simpler than setting up multiple virtual machines to run flent.

Many thanks
Lorenzo

moeller0 · September 17, 2017, 6:28pm

Ciao Lorenzo,

Well, I can try to comment the few I look at:

    This is the most important one as this sort of summarizes how you ended up configuring cake. Since it says "raw" instead of "overhead NN" I believe that you did not configure the per packet overhead explicitly and hence mentioned it.

    The fact that there were drops tells me the shaper is doing its job...

    AFAIK, this shows the peak delay cake induced, it looks quite nice.

    This tells me only very few of your data flows use ECN, as otherwise there would be less drops and more marks.

And this shows that you do not suffer from large meta-packages (from GRO or GSO) as otherwise max_len would be larger, 1514 is typical for MTU 1500 packets as the linuk kernel skb structure will add the size of the ethetype and the two ethernet MAC addresses (the kernel fills these fields and hence 1514 is true from the kernel's perspective, even though it is not really suitable for any shaper).

I hope that helps a bit.

Best Regards