[ PCA / OVM ] A virtual Machine is rebooting for an unknown reason with stack containing eAC_IsFamilyPid and/or SEOS_kbl_trigger

(Doc ID 2344642.1)

Last updated on JANUARY 04, 2018

Applies to:

Private Cloud Appliance - Version 2.0.1 and later
Oracle VM - Version 3.0.1 and later
Linux x86-64

Symptoms

 On Oracle Private Cloud Appliance, some virtual machines appear to be crashing randomly.

 When inspecting the core file generated by the virtual machine, one can see stuck threads with stacks that contains SEOS_ or eAC_ at the beginning of the function's name such as the following stack :

BUG: soft lockup - CPU#5 stuck for 22s! [java:28849]
Modules linked in: bridge stp llc tcp_diag inet_diag ovmapi seos(P)(U) nfs lockd fscache auth_rpcgss nfs_acl sunrpc rdma_ucm ib_sdp rdma_cm iw_cm ib_addr ib_ipoib ib_cm ipv6 ib_uverbs ib_umad mlx4_vnic mlx4_
vnic_helper mlx4_ib ib_sa ib_mad ib_core mlx4_core ppdev microcode parport_pc parport xen_netfront pcspkr i2c_piix4 i2c_core ext4 mbcache jbd2 xen_blkfront floppy pata_acpi ata_generic ata_piix dm_mirror dm_
region_hash dm_log dm_mod [last unloaded: scsi_wait_scan]
CPU 5
Modules linked in: bridge stp llc tcp_diag inet_diag ovmapi seos(P)(U) nfs lockd fscache auth_rpcgss nfs_acl sunrpc rdma_ucm ib_sdp rdma_cm iw_cm ib_addr ib_ipoib ib_cm ipv6 ib_uverbs ib_umad mlx4_vnic mlx4_
vnic_helper mlx4_ib ib_sa ib_mad ib_core mlx4_core ppdev microcode parport_pc parport xen_netfront pcspkr i2c_piix4 i2c_core ext4 mbcache jbd2 xen_blkfront floppy pata_acpi ata_generic ata_piix dm_mirror dm_
region_hash dm_log dm_mod [last unloaded: scsi_wait_scan]

Pid: 28849, comm: java Tainted: P 2.6.39-400.297.12.el6uek.x86_64 #1 Xen HVM domU
RIP: 0010:[<ffffffffa048c85b>] [<ffffffffa048c85b>] is_descendant+0x4b/0xa0 [seos]
RSP: 0018:ffff881287d69d78 EFLAGS: 00000206
RAX: ffff883479506040 RBX: 0000000000000098 RCX: 0000000000000034
RDX: 0000000000006ff9 RSI: ffffffff817ae140 RDI: 00000000000070b1
RBP: ffff881287d69d88 R08: 2000000000000000 R09: 8f88000000000000
R10: 7c40000000000000 R11: ffff881287d69c58 R12: ffffffff8151908e
R13: ffff881287d69d48 R14: 0000000000000000 R15: ffff881287d69d28
FS: 00007f942eded700(0000) GS:ffff88380fca0000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00007f945407ba98 CR3: 0000000eb8c27000 CR4: 00000000001406e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
Process java (pid: 28849, threadinfo ffff881287d68000, task ffff88365aa8c500)
Stack:
00000000000070b1 0000000000000035 ffff881287d69da8 ffffffffa04a8808
ffff882b67668000 00000000000070b1 ffff881287d69db8 ffffffffa04a8885
ffff881287d69df8 ffffffffa04884bd 0000fc0600005aa4 ffff88365a913a80
Call Trace:
[<ffffffffa04a8808>] eAC_IsFamilyPid+0x28/0x90 [seos]
[<ffffffffa04a8885>] eAC_Family_i_am+0x15/0x20 [seos]
[<ffffffffa04884bd>] SEOS_kbl_trigger+0x2d/0xd0 [seos]
[<ffffffffa048857b>] SEOS_kbl_should_change_path+0x1b/0x100 [seos]
[<ffffffffa048b618>] SEOS_execve+0xa8/0x1b0 [seos]
[<ffffffffa04979e4>] SEOS_check_exec+0x1a4/0x250 [seos]
[<ffffffffa0495cbb>] ? ac_pn_alloc+0x1b/0x40 [seos]
[<ffffffffa0497f18>] my_execve+0x1d8/0x240 [seos]
[<ffffffff8151813c>] stub_execve+0x6c/0xc0
Code: 26 00 e8 e9 30 08 e1 44 89 e7 e8 b1 fb ff ff 48 85 c0 74 23 8b 90 94 04 00 00 83 fa 01 7e 18 39 d3 74 4d 3b 90 98 04 00 00 74 35
8> 8b 80 d8 04 00 00 48 85 c0 75 dd 31 d2 48 8b 05 78 93 26 00
Call Trace:
[<ffffffffa04a8808>] eAC_IsFamilyPid+0x28/0x90 [seos]
[<ffffffffa04a8885>] eAC_Family_i_am+0x15/0x20 [seos]
[<ffffffffa04884bd>] SEOS_kbl_trigger+0x2d/0xd0 [seos]
[<ffffffffa048857b>] SEOS_kbl_should_change_path+0x1b/0x100 [seos]
[<ffffffffa048b618>] SEOS_execve+0xa8/0x1b0 [seos]
[<ffffffffa04979e4>] SEOS_check_exec+0x1a4/0x250 [seos]
[<ffffffffa0495cbb>] ? ac_pn_alloc+0x1b/0x40 [seos]
[<ffffffffa0497f18>] my_execve+0x1d8/0x240 [seos]
[<ffffffff8151813c>] stub_execve+0x6c/0xc0
ffff8836b77ac280 ffff8831c5919280
00007f95421b9000 ffff8831c59195d0 ffff8803412aedc8 ffffea00ab883be0
ffff8836bc6117a0 ffffffff8104d766 ffff8836bc6117a0 ffff8831c5919280

 

 

Cause

Sign In with your My Oracle Support account

Don't have a My Oracle Support account? Click to get started

My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms