We are using a CCR2116-12G-4S+ as a core gateway and the CCR2004-1G-12S+2XS at a tower site. Using MPLS/VPLS or EoIP tunnels we have discovered that passing traffic from CCR2116-12G-4S+ to CCR2004-1G-12S+2XS gives far better performance than going from CCR2004-1G-12S+2XS to CCR2116-12G-4S+. What I have noticed is that when sending traffic to the CCR2004-1G-12S+2XS , the CCR2116-12G-4S+ uses all of it's CPUs to provide good throughput and the CCR2004-1G-12S+2XS maxes out 2 of it's 4 cpus at 100% but when the traffic comes from the CCR2004-1G-12S+2XS to the CCR2116-12G-4S+ the CCR2004-1G-12S+2XS spreads out the load across all 4 of it's cpus but the CCR2116-12G-4S+ maxes out a single cpu and the throughput is only 1/4 of what it is in the other direction. If I do the same tests between 2 CCR2004-1G-12S+2XS s I get equal bandwidth in both directions and 2 CPUs are maxed out on the receiving end.
So it seems like the CCR2116-12G-4S+ is not multithreaded but the CCR2004-1G-12S+2XS is.
Is this expected or is there maybe a bug in the CCR2116-12G-4S+?