It's crazy to use 5.x this early as corerouter. We just upgraded one of ourHi @ All
we use a RB1200 as BGP Corerouter and noticed, that the board reboot randomly about once a day.
We also contacted MT Support and they pleased us to wait for version 5.5 (but that didn't help so far)
Yesterday evening I replaced the board with a new RB1200 and today it started to reboot.
Can this be an issue with BGPv6 ???
Best Regards,
Martin
Very true, i would expand that to production networks even it makes Normis jump.It's crazy to use 5.x this early as corerouter. We just upgraded one of our
corerouters to 4.17.
tell me the support ticket number and I will investigate why the upgrade didn't fix your problemHi @ All
we use a RB1200 as BGP Corerouter and noticed, that the board reboot randomly about once a day.
We also contacted MT Support and they pleased us to wait for version 5.5 (but that didn't help so far)
Yesterday evening I replaced the board with a new RB1200 and today it started to reboot.
Can this be an issue with BGPv6 ???
Best Regards,
Martin
i guess you cann't, see this Mr. Janisk posted.can I downgrade the board to 4,17 until you have fixed this problem ?
I can confirm that without filter rules it still rebootsThen we can rule out more things:
We have no dhcp-relay or vlans, but we also have filter rules (may be we can try to disable it for a period and see if reboots disappears, we will report ...).
So, seemingly the problem is not mangle rules, dhcp-relays, vlans, ... and another big thing in play is the queue trees for QoS ...
ZDIV wrote 23.November - latest firmware and still rebootHi All,
Is this problem Fixed. need to know as about to deploy 6 RB1200s in critical Locations. Be well unhappy if just blown a grand on duff gear.
it's available as always, what's the problem?Does not sound good.
And also noticed 5.8 is no longer available on the download site. MK what is happening?
sorry, a small side-effect of web server upgrade. it will be back in a short whileI only see 5.7 as latest listed on the site ??
We have the same problem reported 21.Sep Ticket#2011092166000416After five months on the shelf RB1200 finally detected the problem of Random reboots. I will try now to see resolved.
Just to note I am using 5.8 with 2.37 firmware, which did seem at first to make these things more reliable, after I had problems from 5.2 - 5.5 with lockups.THIS ISSUE COST ME DEARLY TODAY. Affecting almost 200 small business to government office customers. I also do not think the issue is with ports 9-10. I think it is memory related. Unless the memory issue is also related to those ports.
I had a vital router (RB1200) fail today no amount of reboots could get it back up. I had a recent export and started rebuilding the config in a spare 1100AH but several functions were not working. Finally we replaced the memory in the RB1200(hynix 5123MB 1Rx16 PC2-6400S-666-12) with some memory (Kingston ??) from another spare 1100AH and the 1200 fired up instantly.
I have 10 of these in the field. Each is critical to its prospective customer base. I really need to rely on them. So is 5.10 really the answer? Dont use ports 9,10? Should I replace all the memory? This is a real black eye for me, my company and my trust in MT.
Not using ports ether9, ether10 was temporary solution before MT come up with the fix. This fix was implemented in pre-release version 5.10.So is 5.10 really the answer? Dont use ports 9,10? Should I replace all the memory? This is a real black eye for me, my company and my trust in MT.
Hmm, are you sure it's that simple? I'm using port 10 _only_ (connected to a switch with many VLANs) and my RB1200 has been running fine, OSPF and PPPoE (~170 customers, up to about 60% CPU load under about 40Mb/s of traffic), for a few days so far.Watchdog doesn't seem to help after all. We found that the issue is to not use ports 9 and 10 and then the problem will stop. We will fix the issue in v5.10
So - UNPLUG CABLES FROM PORTS 9 AND 10 and see if it helps!
Speaking of network interface, what is happening with the network interface RB435G, stop responding. In version 5.7/2.36 and 2.37. Upgrade to 5.9 now I am testing.After v5.10 is installed, you can use all ports again. Suggestion for not using those ports is only in affect until you install v5.10. Then - everything returns to normal and you can use all ports as before.
please make a new topic about a different issue.Speaking of network interface, what is happening with the network interface RB435G, stop responding. In version 5.7/2.36 and 2.37. Upgrade to 5.9 now I am testing.After v5.10 is installed, you can use all ports again. Suggestion for not using those ports is only in affect until you install v5.10. Then - everything returns to normal and you can use all ports as before.
Looks like something goes fishy with IPSEC/GRE?RB1200
last sysfs file: /sys/devices/plb.0/opb.3/gpio-leds.6/leds/user-led/max_brightness
NIP: 801c74c8 LR: a96491bc CTR: 801c749c
REGS: 9fff3df0 TRAP: 0700 Not tainted (2.6.35-440)
MSR: 00029000 <EE,ME,CE> CR: 42000028 XER: 00000000
TASK = 80301300[0] 'swapper' THREAD: 8031a000
GPR00: 000005c4 9fff3ea0 80301300 00000000 00000004 00000000 2f000000 9d532510
GPR08: 00000001 9d55b620 ac1ffe02 00000001 24000084 10019670 03712904 00002000
GPR16: 00000000 d0014b51 00000001 000000ff fffff9f4 00000003 8029228c 00100100
GPR24: 9d55b6e0 9d433400 00000800 9d8d9898 00000000 9d8d9884 9d55b620 9f3abf00
NIP [801c74c8] skb_pull+0x2c/0x40
LR [a96491bc] 0xa96491bc [ipgre@0xa9649000]
Call Trace:
[9fff3ea0] [00000003] 0x3 (unreliable)
[9fff3ec0] [a963e0e4] register_gre_proto+0xe4/0x148 [gre@0xa963e000]
[9fff3ed0] [801f8468] ip_local_deliver_finish+0x130/0x25c
[9fff3ef0] [801f7df8] ip_rcv_finish+0x138/0x3b8
[9fff3f10] [801d10fc] __netif_receive_skb+0x424/0x474
[9fff3f50] [801d11e0] process_backlog+0x94/0x144
[9fff3f80] [801d1548] net_rx_action+0xa4/0x144
[9fff3fb0] [80038ad0] __do_softirq+0xb8/0x134
[9fff3ff0] [8000de38] call_do_softirq+0x14/0x24
[8031be80] [80003b7c] do_softirq+0x74/0x80
[8031bea0] [80038c14] irq_exit+0x60/0x70
[8031beb0] [8000aac0] timer_interrupt+0xf8/0x118
[8031bed0] [8000eab4] ret_from_except+0x0/0x18
--- Exception: 901 at ppc44x_idle+0x10/0x20
LR = cpu_idle+0x84/0xd4
[8031bf90] [800075ac] cpu_idle+0xd0/0xd4 (unreliable)
[8031bfb0] [800019d8] rest_init+0x64/0x74
[8031bfc0] [802d972c] start_kernel+0x2a4/0x2b8
[8031bff0] [80000060] _start+0x60/0x9c
Instruction dump:
4e800020 80030050 7c691b78 38600000 7f840040 4d9d0020 81690054 7c040050
90090050 7d6b0010 7d6b5910 7d6b00d0 <0f0b0000> 806900b8 7c632214 906900b8
Kernel panic - not syncing: Fatal exception in interrupt
panicSaver: dumping panic to flash
flash: erase 10
flash: prg 10
flash: prg err 0
Rebooting in 1 seconds..
RouterBOOT booter 3.02
RouterBoard 1200
CPU frequency: 666 MHz
Memory size: 512 MiB
NAND size: 64 MiB
[****@RSW] > Oops: Exception in kernel mode, sig: 5 [#1]
RB1200
last sysfs file: /sys/devices/plb.0/opb.3/gpio-leds.6/leds/user-led/max_brightness
NIP: 801c74c8 LR: a964a1bc CTR: 801c749c
REGS: 9fff3df0 TRAP: 0700 Not tainted (2.6.35-440)
MSR: 00029000 <EE,ME,CE> CR: 42000028 XER: 00000000
TASK = 80301300[0] 'swapper' THREAD: 8031a000
GPR00: 000005c4 9fff3ea0 80301300 00000000 00000004 00000000 2f000000 9dbe45b0
GPR08: 00000001 9dabf620 ac1ffe02 00000001 24000084 10019670 03712904 00002000
GPR16: 00000000 d0014b51 00000001 000000ff fffff9f4 00000003 8029228c 00100100
GPR24: 9dabf6e0 9d429800 00000800 9c5fe098 00000000 9c5fe084 9dabf620 9f2fff00
NIP [801c74c8] skb_pull+0x2c/0x40
LR [a964a1bc] 0xa964a1bc [ipgre@0xa964a000]
Call Trace:
[9fff3ea0] [00000003] 0x3 (unreliable)
[9fff3ec0] [a963f0e4] register_gre_proto+0xe4/0x148 [gre@0xa963f000]
[9fff3ed0] [801f8468] ip_local_deliver_finish+0x130/0x25c
[9fff3ef0] [801f7df8] ip_rcv_finish+0x138/0x3b8
[9fff3f10] [801d10fc] __netif_receive_skb+0x424/0x474
[9fff3f50] [801d11e0] process_backlog+0x94/0x144
[9fff3f80] [801d1548] net_rx_action+0xa4/0x144
[9fff3fb0] [80038ad0] __do_softirq+0xb8/0x134
[9fff3ff0] [8000de38] call_do_softirq+0x14/0x24
[8031be80] [80003b7c] do_softirq+0x74/0x80
[8031bea0] [80038c14] irq_exit+0x60/0x70
[8031beb0] [8000aac0] timer_interrupt+0xf8/0x118
[8031bed0] [8000eab4] ret_from_except+0x0/0x18
--- Exception: 901 at ppc44x_idle+0x10/0x20
LR = cpu_idle+0x84/0xd4
[8031bf90] [800075ac] cpu_idle+0xd0/0xd4 (unreliable)
[8031bfb0] [800019d8] rest_init+0x64/0x74
[8031bfc0] [802d972c] start_kernel+0x2a4/0x2b8
[8031bff0] [80000060] _start+0x60/0x9c
Instruction dump:
4e800020 80030050 7c691b78 38600000 7f840040 4d9d0020 81690054 7c040050
90090050 7d6b0010 7d6b5910 7d6b00d0 <0f0b0000> 806900b8 7c632214 906900b8
Kernel panic - not syncing: Fatal exception in interrupt
panicSaver: dumping panic to flash
flash: erase 10
flash: prg 10
flash: prg err 0
Rebooting in 1 seconds..
RouterBOOT booter 3.02
RouterBoard 1200
CPU frequency: 666 MHz
Memory size: 512 MiB
NAND size: 64 MiB
Press any key within 2 seconds to enter setup..
loading kernel from nand... OK
setting up elf image... OK
jumping to kernel code