Fujitsu M10-1 / M10-4 / M10-4S Server: Domains May Panic After 1101 Days of Uptime

(Doc ID 2274620.1)

Last updated on SEPTEMBER 06, 2017

Applies to:

Fujitsu M10-4 - Version All Versions and later
Fujitsu M10-4S - Version All Versions and later
Fujitsu M10-1 - Version All Versions and later
Information in this document applies to any platform.

Symptoms

A bug exists in the Hypervisor included in XCP versions less than XCP2320, which may cause a domain to panic after 1101 days of uptime.

Various types of panic might be evident in the console log, including, "panic: send_mondo_set: timeout".

The "showlogs event" may have sufficient history to check the hypervisor uptime.
Look for "Start Hypervisor".

Example showlogs event output:
.....
May 30 06:20:44 EST 2014 Current PPARs' phase (PPARID 0 POST phase: Start Hypervisor)
<snip>
Jun 04 11:20:29 EST 2017 PPARID 0 GID 00000001 state change (Solaris panicking)
....

The Solaris GNU date command can be used to easily calculate a date 1101 days earlier from the panic date.
Example:
% /usr/gnu/bin/date -d "Jun 04 2017 - 1101 days"
Friday, May 30, 2014 12:00:00 AM UTC

Example Panic stack:
Jun 04 11:20:29 EDT 2017 send mondo timeout [retries: 0x3b9ab8] cpuids: 0x1^M^M
Jun 04 11:20:29 EDT 2017 panic: failed to stop cpu1^M^M
Jun 04 11:20:29 EDT 2017 ^M^M
Jun 04 11:20:29 EDT 2017 ^Mpanic[cpu14]/thread=30122c817a0: send_mondo_set: timeout^M^M
Jun 04 11:20:29 EDT 2017 ^M^M
Jun 04 11:20:29 EDT 2017 000002a10971cda0 unix:send_mondo_set+5a8 (1, 21cbb243851f7f, 1, 3b9ab8, ffff, 1)^M^M
Jun 04 11:20:29 EDT 2017 %l0-3: 0021cbb243851f73 00000300134ec808 0000000000000001 000000000000000e^M^M
Jun 04 11:20:29 EDT 2017 %l4-7: 00000000010ca400 000000000346d802 0000000000000000 00000000010ca750^M^M
Jun 04 11:20:29 EDT 2017 000002a10971cea0 unix:xt_some+1d0 (2a10971d280, 2a10971d050, bfff, fffffffffffffff8, 2a10971cf58, 0)^M^M
Jun 04 11:20:29 EDT 2017 %l0-3: 000000000193cb18 000002a10971d050 0000000000004000 0000000000000000^M^M
Jun 04 11:20:29 EDT 2017 %l4-7: 000000000000bfff 000002a10971d0d0 000002a10971d050 0000000000000000^M^M
Jun 04 11:20:29 EDT 2017 000002a10971d150 unix:sfmmu_tlb_range_demap+1f4 (2a10971d728, 2000, 300062ae040, 412ab62, 0, 1)^M^M
Jun 04 11:20:29 EDT 2017 %l0-3: 0000000000000000 000000000000000d 000000000412ab63 000000000198e6c8^M^M
Jun 04 11:20:29 EDT 2017 %l4-7: 000002a10a99004d 000002a10a990000 0000000000000001 000002a10971d280^M^M
Jun 04 11:20:29 EDT 2017 000002a10971d300 unix:hat_unload_callback+7c4 (0, 2a10971d518, 0, 2a10971d618, 0, 300062ae040)^M^M
Jun 04 11:20:29 EDT 2017 %l0-3: 000002a10a992000 0000000000000001 0000000000000001 000002a10a9907ff^M^M
Jun 04 11:20:29 EDT 2017 %l4-7: 0000000000000000 00000301238a0460 00000301238a0430 000007003e9c71a0^M^M
Jun 04 11:20:29 EDT 2017 000002a10971d760 genunix:segkp_release_internal+98 (3005fc113f0, 19cb800, 2000, 2a10a990000, 10c2c00, 1914d00)^M^M
Jun 04 11:20:29 EDT 2017 %l0-3: 000000000000000d 0000000000000002 0000000000000000 0000000001914790^M^M
Jun 04 11:20:29 EDT 2017 %l4-7: ffffffffffffffff 0000000000000001 0000000000000000 00000000019cb8c0^M^M
Jun 04 11:20:29 EDT 2017 000002a10971d820 genunix:schedctl_freepage+18 (30196f8d070, 2a10a990000, 30184ecb910, ff1e2000, 0, 1863000)^M^M
Jun 04 11:20:29 EDT 2017 %l0-3: 00000000019cb808 0000000000000004 0000000000000001 0000000000000000^M^M
Jun 04 11:20:29 EDT 2017 %l4-7: 00000000ff1e2000 0000000000002000 0000000000002000 00000301112703d0^M^M
Jun 04 11:20:29 EDT 2017 000002a10971d8d0 genunix:schedctl_proc_cleanup+48 (300edd098d0, 0, 38, 19e8c00, 10c2f70, 300002a6000)^M^M
Jun 04 11:20:29 EDT 2017 %l0-3: 000000004a004002 000000004a004002 00000000db5fffff 00000000db5ffc00^M^M
Jun 04 11:20:29 EDT 2017 %l4-7: 0000030088f30b38 0000030123894820 0000000000000000 00000000019e8f30^M^M
Jun 04 11:20:29 EDT 2017 000002a10971d980 genunix:proc_exit+15c (1, 0, 0, 30088f30b38, 301b2e587d0, a200)^M^M
Jun 04 11:20:29 EDT 2017 %l0-3: 0000000000000000 00000000ff3523fc 00000300edd098d0 0000000000000000^M^M
Jun 04 11:20:29 EDT 2017 %l4-7: 0000030122c817a0 0000030123894820 0000000000000000 0000000001865a98^M^M
Jun 04 11:20:29 EDT 2017 000002a10971da30 genunix:exit+8 (1, 0, 0, ffbffd04, 2, 221c8)^M^M
Jun 04 11:20:29 EDT 2017 %l0-3: 0000000000000000 0000000000000000 00000000f3e50000 000000000000f3e5^M^M
Jun 04 11:20:29 EDT 2017 %l4-7: 0000000000000001 0000000000000000 0000000000000000 0000000000000000^M^M
Jun 04 11:20:29 EDT 2017 ^M^M
Jun 04 11:20:29 EDT 2017 syncing file systems... done^M^M

If the panic occurs at or near 1101 days then bug 23193383 has likely been manifested.

Changes

 

Cause

Sign In with your My Oracle Support account

Don't have a My Oracle Support account? Click to get started

My Oracle Support provides customers with access to over a
Million Knowledge Articles and hundreds of Community platforms