I have setup a bgp monitoring tool which sends me alerts by email.
I constantly receive alerts which I cant make any sense of.
The BGP we have in place was configured by our former net admin.
We have a /20 allotted to us by Arin however we have opted to use 8 /24's at each of our edge routers(2 edge routers)
We have what we call the north network and south network.
we'll say our /20 is 1.1.1.1/20
We have 2 upstream providers both are ATT but setup in a way to give us redundancy should one of them fail.
At each of of our edge routers we are peered with upstream on wan interface
On inside interfaces we have a /29 private range setup with both our edge routers using this range to communicate ibgp.
the edge router for north end of network has 1.1.1.1/24, 2.2.2.2/24, 3.3.3.3/24, 4.4.4.4/24, 5.5.5.5/24, 6.6.6.6/24, 7.7.7.7/24, 8.8.8.8/24 6.6.6.6/27 on the inside interface. I know the /27 is redundant but this is how he set it up to give a business customer a /27 for themselves.
We also have 172.16.16.3/29 on inside interface which is remote peer address for the south
the edge router for the south has 9.9.9.9/24, 10.10.10.10/24, 11.11.11.11/24, 12.12.12.12/24, 13.13.13.13/24, 14.14.14.14/24, 15.15.15.15/24, 16.16.16.16/24
We have 172.16.16.2/29 on inside interface which is remote peer address for mikrotik on north network.
Our configuration doesn't seem to cause customer problems, however last week when our upstream provider on the south network went down instead of the south traffic going over our ibgp link to the north network upstream provider
it did nothing.
I have copied and pasted our configurations into notepad but will paste those in a reply post to avoid this post becoming a page long.
We also have some private ip ranges on both edge routers inside interfaces for our management subnets.
I read about something called route flapping and I have a feeling this may be going on or issues with our routing filters, which only have one entry apiece.
Please calling all BGP experts to advise me how to find my former net admin's configuration mistakes.