I use peerapp and cachemara transparent caching solutions. One is basically a squid and the other is more like an IDS.
I have 4 sites running theses caches, all with CCR 1016. Recently I upgraded a RB1100ahx2 from one site (working fine) to CCR and got this too.
I setup firewall-mangle rules to route mark all tcp 80 and >1024 to a default route to the cache server.
The problem is, all running interfaces flap at the same time (go to down state). After a few seconds they come up again. Some take a little bit more time. It happens on irregular intervals, the more traffic the more often. Usually once an hour with 300-500 megabits. No other errors appear on the log and there are no drops or errors on the interfaces. The CPU is low, about 2-5%.
I am being unable to use my cache servers due to this bug. I will try to upgrade one site to a 1036 and see if it still happens.
Today, I tried disabling autonegotiation on my cache server interface (mikrotik side), after some time the interface got down and traffic stopped going to the cache and went to the gateway (check ping on the cache route), although no redirection was being made the CCR still flapped. If I disable the mangle rules the CCR stops flapping.
I was using 6.1 and am using 6.2 ROS, same problem. Happens on both cache servers (peerapp and cachemara).
Cachemara Site M:
4 chain=prerouting action=mark-routing new-routing-mark=cache passthrough=yes
protocol=tcp src-address-list=cache in-interface=!ether12
dst-port=80,2710,3310,5869,6500,6969
5 chain=prerouting action=mark-routing new-routing-mark=cache passthrough=yes
protocol=tcp dst-address-list=cache in-interface=!ether12
src-port=80,2710,3310,5869,6500,6969
Peerapp Site T:
4 X ;;; peer out
chain=prerouting action=mark-routing new-routing-mark=cache
passthrough=yes protocol=tcp src-address-list=cache in-interface=!ether18
dst-port=80,1024-65535
5 X ;;; peer in
chain=prerouting action=mark-routing new-routing-mark=cache
passthrough=yes protocol=tcp dst-address-list=cache in-interface=!ether18
src-port=80,1024-65535
Tried to underclock the CCR. Tried to use address lists and fixed src/dst. Changed interfaces, changed caching servers. Nothing works.
Please help, I dont know what to do anymore.