Hello,
This current setup I try to deploy relay on 2x CRS354-48G-4S+2Q switches and CCR1009 router, with few servers at the end.
Router is connected with SFP to each switch, both switches are connected with qsfpplus1-1 links between them, server is connected via sfp1 to switch#1 with sfp2 with switch#2 plus two extra ethernet connection per switch (4 total)
I read mt guide and any other guides that I could find about MLAG with same end results, as soon as I enable and get connection between switches one of them get kickout - traffic is getting blocked? From test-to-test sometime entire network is down (both switches).
I'm using vlan in current network (vlans on switch/bridge), but from documentation that is not a problem as I enabled mstp (MT documentation said mstp is ok with mlag, one single presentation from few years ago say mstp is not compatible with mstp).
The configuration I did was something along this steps:
1. on server side I created bond and added both sfp and 4 eth port as LACP, with l2+3.
2. on both switches I remove ports that will be used for bond from bridge
3. on both switches I created single bond with sfp+2eth with 802.3ad l2+3
4. on both switches I added created bond to bridge
5. vlans were enabled already on bridge.
6. up until now I can ping server from both switches (routeros)
7. before I even enable I can disable link to server and I see that switch with disable link will ping server thru router, but not always (i have to disable/enable bonds on switches)
8. I add on both switches qsfpplus port to bridge with pvid 99 (as in most documentation and there is no collision)
9. **and this is where "funny things happens"**
10. as soon I add qsfpplus1-1 on both port into bridge MLAG peer port and see "connected" one switch get kickout (If I'm unlucky both get kick out killing network).
11. as soon as I'm able to CLI any switch and disable peer port network is up and running.
I tried adding qsfpplus1-1 to all vlans as tagged, not difference. It's been few days now. And I'm stuck here trying to deploy redundancy. Both switches pass vlans without issue the way it was intended, LACP also works without issue with same server connected just to one switch.
I'm out of ideas
I tried to recreating this in GNS3 but looks like CHR version of RouterOS don't support MLAG at all, which I notices after setting everything up.
On forum I found few success stories but many failed on too, and now I dont know if the issue is with my setup, lack of hardware, lack of configuration or just bugs in os.