My Oracle Support Banner

Fujitsu M10-1 / M10-4 / M10-4S Server: Domains May Panic After 1101 Days of Uptime (Doc ID 2274620.1)

Last updated on SEPTEMBER 06, 2017

Applies to:

Fujitsu M10-4 - Version All Versions and later
Fujitsu M10-4S - Version All Versions and later
Fujitsu M10-1 - Version All Versions and later
Information in this document applies to any platform.

Symptoms

A bug exists in the Hypervisor included in XCP versions less than XCP2320, which may cause a domain to panic after 1101 days of uptime.

Various types of panic might be evident in the console log, including, "panic: send_mondo_set: timeout".

The "showlogs event" may have sufficient history to check the hypervisor uptime.
Look for "Start Hypervisor".

Example showlogs event output:
.....
May 30 06:20:44 EST 2014 Current PPARs' phase (PPARID 0 POST phase: Start Hypervisor)
<snip>
Jun 04 11:20:29 EST 2017 PPARID 0 GID 00000001 state change (Solaris panicking)
....

The Solaris GNU date command can be used to easily calculate a date 1101 days earlier from the panic date.
Example:
% /usr/gnu/bin/date -d "Jun 04 2017 - 1101 days"
Friday, May 30, 2014 12:00:00 AM UTC

Example Panic stack:
Jun 04 11:20:29 EDT 2017 send mondo timeout [retries: 0x3b9ab8] cpuids: 0x1^M^M
Jun 04 11:20:29 EDT 2017 panic: failed to stop cpu1^M^M
Jun 04 11:20:29 EDT 2017 ^M^M
Jun 04 11:20:29 EDT 2017 ^Mpanic[cpu14]/thread=30122c817a0: send_mondo_set: timeout^M^M
Jun 04 11:20:29 EDT 2017 ^M^M
Jun 04 11:20:29 EDT 2017 000002a10971cda0 unix:send_mondo_set+5a8 (1, 21cbb243851f7f, 1, 3b9ab8, ffff, 1)^M^M
Jun 04 11:20:29 EDT 2017 %l0-3: 0021cbb243851f73 00000300134ec808 0000000000000001 000000000000000e^M^M
Jun 04 11:20:29 EDT 2017 %l4-7: 00000000010ca400 000000000346d802 0000000000000000 00000000010ca750^M^M
Jun 04 11:20:29 EDT 2017 000002a10971cea0 unix:xt_some+1d0 (2a10971d280, 2a10971d050, bfff, fffffffffffffff8, 2a10971cf58, 0)^M^M
Jun 04 11:20:29 EDT 2017 %l0-3: 000000000193cb18 000002a10971d050 0000000000004000 0000000000000000^M^M
Jun 04 11:20:29 EDT 2017 %l4-7: 000000000000bfff 000002a10971d0d0 000002a10971d050 0000000000000000^M^M
Jun 04 11:20:29 EDT 2017 000002a10971d150 unix:sfmmu_tlb_range_demap+1f4 (2a10971d728, 2000, 300062ae040, 412ab62, 0, 1)^M^M
Jun 04 11:20:29 EDT 2017 %l0-3: 0000000000000000 000000000000000d 000000000412ab63 000000000198e6c8^M^M
Jun 04 11:20:29 EDT 2017 %l4-7: 000002a10a99004d 000002a10a990000 0000000000000001 000002a10971d280^M^M
Jun 04 11:20:29 EDT 2017 000002a10971d300 unix:hat_unload_callback+7c4 (0, 2a10971d518, 0, 2a10971d618, 0, 300062ae040)^M^M
Jun 04 11:20:29 EDT 2017 %l0-3: 000002a10a992000 0000000000000001 0000000000000001 000002a10a9907ff^M^M
Jun 04 11:20:29 EDT 2017 %l4-7: 0000000000000000 00000301238a0460 00000301238a0430 000007003e9c71a0^M^M
Jun 04 11:20:29 EDT 2017 000002a10971d760 genunix:segkp_release_internal+98 (3005fc113f0, 19cb800, 2000, 2a10a990000, 10c2c00, 1914d00)^M^M
Jun 04 11:20:29 EDT 2017 %l0-3: 000000000000000d 0000000000000002 0000000000000000 0000000001914790^M^M
Jun 04 11:20:29 EDT 2017 %l4-7: ffffffffffffffff 0000000000000001 0000000000000000 00000000019cb8c0^M^M
Jun 04 11:20:29 EDT 2017 000002a10971d820 genunix:schedctl_freepage+18 (30196f8d070, 2a10a990000, 30184ecb910, ff1e2000, 0, 1863000)^M^M
Jun 04 11:20:29 EDT 2017 %l0-3: 00000000019cb808 0000000000000004 0000000000000001 0000000000000000^M^M
Jun 04 11:20:29 EDT 2017 %l4-7: 00000000ff1e2000 0000000000002000 0000000000002000 00000301112703d0^M^M
Jun 04 11:20:29 EDT 2017 000002a10971d8d0 genunix:schedctl_proc_cleanup+48 (300edd098d0, 0, 38, 19e8c00, 10c2f70, 300002a6000)^M^M
Jun 04 11:20:29 EDT 2017 %l0-3: 000000004a004002 000000004a004002 00000000db5fffff 00000000db5ffc00^M^M
Jun 04 11:20:29 EDT 2017 %l4-7: 0000030088f30b38 0000030123894820 0000000000000000 00000000019e8f30^M^M
Jun 04 11:20:29 EDT 2017 000002a10971d980 genunix:proc_exit+15c (1, 0, 0, 30088f30b38, 301b2e587d0, a200)^M^M
Jun 04 11:20:29 EDT 2017 %l0-3: 0000000000000000 00000000ff3523fc 00000300edd098d0 0000000000000000^M^M
Jun 04 11:20:29 EDT 2017 %l4-7: 0000030122c817a0 0000030123894820 0000000000000000 0000000001865a98^M^M
Jun 04 11:20:29 EDT 2017 000002a10971da30 genunix:exit+8 (1, 0, 0, ffbffd04, 2, 221c8)^M^M
Jun 04 11:20:29 EDT 2017 %l0-3: 0000000000000000 0000000000000000 00000000f3e50000 000000000000f3e5^M^M
Jun 04 11:20:29 EDT 2017 %l4-7: 0000000000000001 0000000000000000 0000000000000000 0000000000000000^M^M
Jun 04 11:20:29 EDT 2017 ^M^M
Jun 04 11:20:29 EDT 2017 syncing file systems... done^M^M

If the panic occurs at or near 1101 days then bug 23193383 has likely been manifested.

Changes

 

Cause

To view full details, sign in with your My Oracle Support account.

Don't have a My Oracle Support account? Click to get started!


My Oracle Support provides customers with access to over a million knowledge articles and a vibrant support community of peers and Oracle experts.