Sun Fire x4170 M1 Server Errors During Reboot "Kernel BUG at drivers/net/bonding/bonding.h:135" (Doc ID 1409966.1)

Last updated on MAY 27, 2021

Applies to:

Linux OS - Version Oracle Linux 5.6 to Oracle Linux 5.6 [Release OL5U6]
Oracle Cloud Infrastructure - Version N/A and later
Linux x86-64
Linux x86


When the Sun Fire x4170 M1 Server is rebooted the reboot fails or takes too long and error below can be seen:
Kernel BUG at drivers/net/bonding/bonding.h:135

Further analysis of a generated vmcore would show

     COMMAND: "ifdown-eth"

for the instance of the problem. Checking the dmesg log via vmcore:


bonding: bond0: Removing slave eth4
----------- [cut here ] --------- [please bite here ] ---------
Kernel BUG at drivers/net/bonding/bonding.h:135
invalid opcode: 0000 [1] SMP
last sysfs file: /class/net/bond0/bonding/slaves
Modules linked in: mptctl ipmi_devintf ipmi_si ipmi_msghandler autofs4 dmpjbod(PU) dmpap(PU) dmpaa(PU) vxspec(PFU) vxio(PFU) vxdmp(PU) lockd sunrpc cpufreq_ondemand acpi_cpufreq freq_table mperf bonding be2iscsi ib_iser rdma_cm ib_cm iw_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp bnx2i cnic ipv6 xfrm_nalgo crypto_api uio cxgb3i cxgb3 libiscsi_tcp libiscsi2 scsi_transport_iscsi2 scsi_transport_iscsi vxportal(PFU) fdd(PFU) vxfs(PU) dm_mirror video backlight sbs power_meter hwmon i2c_ec dell_wmi wmi button battery asus_acpi acpi_memhotplug ac parport_pc lp parport joydev tpm_tis tpm igb sg e1000e 8021q i7core_edac edac_mc tpm_bios i2c_i801 lpfc dca scsi_transport_fc i2c_core pcspkr dm_raid45 dm_message dm_region_hash dm_log dm_mod dm_mem_cache megaraid_sas cciss aacraid mptsas mptscsih mptbase ata_piix ahci libata shpchp mpt2sas scsi_transport_sas sd_mod scsi_mod raid1 ext3 jbd uhci_hcd ohci_hcd ehci_hcd
Pid: 11381, comm: ifdown-eth Tainted: PF     2.6.18- #1
RIP: 0010:[<ffffffff888eec0b>]  [<ffffffff888eec0b>] :bonding:bond_release+0x62/0x4f1
RSP: 0018:ffff8106730a3e28  EFLAGS: 00010286
RAX: 00000000ffffffff RBX: 00000000000005dc RCX: 0000000000000282
RDX: 00000000ffffffff RSI: ffff81067e498000 RDI: ffff810673700530
RBP: ffff810673700500 R08: 0000000000000000 R09: 0000000000000030
R10: 0000000000000000 R11: ffffffff888f78f8 R12: ffff810673700000
R13: 0000000000000006 R14: ffff81067e498000 R15: ffff8106615836c0
FS:  00002ad1f647af50(0000) GS:ffff810116c9a240(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00000037b7a69370 CR3: 000000066bd4f000 CR4: 00000000000006e0
Process ifdown-eth (pid: 11381, threadinfo ffff8106730a2000, task ffff8106556ec860)
Stack:  00000000000080d0 ffffffff8006456b ffff8106556ec860 00000000000005dc
 ffff81067e498000 ffff810673700500 0000000000000006 0000000000000006
 ffff8106615836c0 ffffffff888f7b54 000000346874652d 0000000000000000
Call Trace:
 [<ffffffff8006456b>] __down_write_nested+0x12/0x92
 [<ffffffff888f7b54>] :bonding:bonding_store_slaves+0x25c/0x2f7
 [<ffffffff8010ff6d>] sysfs_write_file+0xb9/0xe8
 [<ffffffff80016b3b>] vfs_write+0xce/0x174
 [<ffffffff80017388>] sys_write+0x45/0x6e
 [<ffffffff8005d28d>] tracesys+0xd5/0xe0

Code: 0f 0b 68 aa 94 8f 88 c2 87 00 4c 8b 6d 08 31 c0 eb 0c 4d 39
RIP  [<ffffffff888eec0b>] :bonding:bond_release+0x62/0x4f1
 RSP <ffff8106730a3e28>

and call stack:

PID: 11381  TASK: ffff8106556ec860  CPU: 2   COMMAND: "ifdown-eth"
 #0 [ffff8106730a3b80] crash_kexec at ffffffff800af84d
 #1 [ffff8106730a3c08] bond_release at ffffffff888eec0b [bonding]
 #2 [ffff8106730a3c80] die at ffffffff8006c73a
 #3 [ffff8106730a3cb0] do_invalid_op at ffffffff8006ccfa
 #4 [ffff8106730a3cc8] bond_release at ffffffff888eec0b [bonding]
 #5 [ffff8106730a3d20] find_get_page at ffffffff80007691
 #6 [ffff8106730a3d70] error_exit at ffffffff8005dde9
    [exception RIP: bond_release+98]
    RIP: ffffffff888eec0b  RSP: ffff8106730a3e28  RFLAGS: 00010286
    RAX: 00000000ffffffff  RBX: 00000000000005dc  RCX: 0000000000000282
    RDX: 00000000ffffffff  RSI: ffff81067e498000  RDI: ffff810673700530
    RBP: ffff810673700500   R8: 0000000000000000   R9: 0000000000000030
    R10: 0000000000000000  R11: ffffffff888f78f8  R12: ffff810673700000
    R13: 0000000000000006  R14: ffff81067e498000  R15: ffff8106615836c0
    ORIG_RAX: ffffffffffffffff  CS: 0010  SS: 0018
 #7 [ffff8106730a3e70] bonding_store_slaves at ffffffff888f7b54 [bonding]
 #8 [ffff8106730a3ed0] sysfs_write_file at ffffffff8010ff6d
 #9 [ffff8106730a3f10] vfs_write at ffffffff80016b3b
#10 [ffff8106730a3f40] sys_write at ffffffff80017388
#11 [ffff8106730a3f80] tracesys at ffffffff8005d28d (via system_call)
    RIP: 00000037b7ac6420  RSP: 00007fff790da9b8  RFLAGS: 00000246
    RAX: ffffffffffffffda  RBX: ffffffff8005d28d  RCX: ffffffffffffffff
    RDX: 0000000000000006  RSI: 00002ad1f9a64000  RDI: 0000000000000001
    RBP: 0000000000000006   R8: 00000000ffffffff   R9: 00002ad1f647af50
    R10: 0000000000000022  R11: 0000000000000246  R12: 00000037b7d52780
    R13: 00002ad1f9a64000  R14: 0000000000000006  R15: 0000000000000000
    ORIG_RAX: 0000000000000001  CS: 0033  SS: 002b


