Database and ASM hang with error "WARNING:io_getevents timed out 600 sec" due to device saturation (iostat with util=100%)
(Doc ID 2056450.1)
Last updated on APRIL 14, 2022
Applies to:
Oracle Database - Enterprise Edition - Version 11.2.0.4 and laterOracle Database Cloud Schema Service - Version N/A and later
Oracle Database Exadata Cloud Machine - Version N/A and later
Oracle Cloud Infrastructure - Database Service - Version N/A and later
Oracle Database Backup Service - Version N/A and later
IBM: Linux on System z
Symptoms
11g Database and ASM hang,
and both DB & ASM Alert logs start getting "WARNING:io_getevents timed out 600 sec" for next 11 hours,
and several hours later, DB Alert gets error "ORA-00020: maximum number of processes (150) exceeded".
Database was non-responsive and ASM was also not responding during 11 hour hang.
There is one "timed out" device error in "/var/log/messages" for device "dasdbt" during 11 hour hang:
Jul 13 22:01:36 xxx kernel: dasd-eckd.157e58: 0.0.0657: cqr 00000000024cfd18 timed out (30s), 255 retries remaining
OSWatcher data was captured during the 11 hour hang period and all the OSWatcher iostat files
showed timed-out device "dasdbt" was the only device running with util = 100% or total "device saturation" for 11+ hours.
Device: rrqm/s wrqm/s r/s w/s rkB/s wkB/s avgrq-sz avgqu-sz await svctm %util
dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 16.00 0.00 0.00 100.00
%util
Percentage of CPU time during which I/O requests were issued to the device (bandwidth utilization for the device).
Device saturation occurs when this value is close to 100%.
Search "dasdbt" (58 hits in 1 file)
osw\oswiostat\xxx_iostat_15.07.13.2200.dat (58 hits)
Line 76: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 2.44 0.00 0.00 122.00
zzz ***Mon Jul 13 22:02:09 EDT 2015
Line 168: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 2.00 0.00 0.00 100.00
Line 260: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 4.00 0.00 0.00 100.00
Line 352: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 4.68 0.00 0.00 117.00
===
Line 5136: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 6.00 0.00 0.00 100.00
Line 5228: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 6.00 0.00 0.00 100.00
Line 5320: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 5.94 0.00 0.00 99.00
zzz ***Mon Jul 13 22:59:30 EDT 2015
Search "dasdbt" (32 hits in 1 file)
osw\oswiostat\xxx_iostat_15.07.14.0900.dat (32 hits)
Line 76: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 16.16 0.00 0.00 101.00
zzz ***Tue Jul 14 09:00:26 EDT 2015
Line 168: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 15.84 0.00 0.00 99.00
Line 260: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 16.16 0.00 0.00 101.00
Line 352: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 16.16 0.00 0.00 101.00
===
Line 2560: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 15.84 0.00 0.00 99.00
Line 2652: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 16.00 0.00 0.00 100.00
Line 2744: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 16.00 0.00 0.00 100.00
Line 2836: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 16.00 0.00 0.00 100.00
Line 2928: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 16.00 0.00 0.00 100.00
zzz ***Tue Jul 14 09:32:01 EDT 2015
Result from "lsdasd" command:
Device "dasdbt" ( dasd-eckd.157e58: 0.0.0657 )
dasd-eckd.157e58: 0.0.0657:
0.0.0657 active dasdbt 94:284 ECKD 4096 23033MB 5896620
RDA > ASM > Device List
lrwxrwxrwx 1 root root [0x0] Jul 14 13:59 2015 Jul 14 13:59 2015 ASM0657 -> dasdbt1
Disk Partition Information
From /proc/partitions
major minor #blocks name
===
94 284 23586480 dasdbt
94 285 23586384 dasdbt1
Listing of Devices from /dev
===
brw-rw---- 1 root disk [0x105e1c] Jul 14 13:59 2015 Jul 14 13:59 2015 dasdbt
brw-rw---- 1 grid asmadmin [0x105e1d] Jul 17 12:22 2015 Jul 17 12:22 2015 dasdbt1
/dev/.udev/links/ASM0657:
drwxr-xr-x 2 root root 60 Jul 14 13:59 2015 Jul 14 13:59 2015 .
drwxr-xr-x 702 root root 14040 Jul 14 14:00 2015 Jul 14 14:00 2015 ..
lrwxrwxrwx 1 root root 52 Jul 14 13:59 2015 Jul 14 13:59 2015 b94:285 -> /devices/css0/0.0.004d/0.0.0657/block/dasdbt/dasdbt1
/dev/.udev/links/disk\x2fby-id\x2fccw-0X0657:
drwxr-xr-x 2 root root 60 Jul 14 13:59 2015 Jul 14 13:59 2015 .
drwxr-xr-x 702 root root 14040 Jul 14 14:00 2015 Jul 14 14:00 2015 ..
lrwxrwxrwx 1 root root 44 Jul 14 13:59 2015 Jul 14 13:59 2015 b94:284 -> /devices/css0/0.0.004d/0.0.0657/block/dasdbt
/dev/.udev/links/disk\x2fby-id\x2fccw-0X0657-part1:
drwxr-xr-x 2 root root 60 Jul 14 13:59 2015 Jul 14 13:59 2015 .
drwxr-xr-x 702 root root 14040 Jul 14 14:00 2015 Jul 14 14:00 2015 ..
lrwxrwxrwx 1 root root 52 Jul 14 13:59 2015 Jul 14 13:59 2015 b94:285 -> /devices/css0/0.0.004d/0.0.0657/block/dasdbt/dasdbt1
Jul 13 22:01:36 xxx kernel: dasd-eckd.157e58: 0.0.0657: cqr 00000000024cfd18 timed out (30s), 255 retries remaining
OSWatcher data was captured during the 11 hour hang period and all the OSWatcher iostat files
showed timed-out device "dasdbt" was the only device running with util = 100% or total "device saturation" for 11+ hours.
Device: rrqm/s wrqm/s r/s w/s rkB/s wkB/s avgrq-sz avgqu-sz await svctm %util
dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 16.00 0.00 0.00 100.00
%util
Percentage of CPU time during which I/O requests were issued to the device (bandwidth utilization for the device).
Device saturation occurs when this value is close to 100%.
Search "dasdbt" (58 hits in 1 file)
osw\oswiostat\xxx_iostat_15.07.13.2200.dat (58 hits)
Line 76: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 2.44 0.00 0.00 122.00
zzz ***Mon Jul 13 22:02:09 EDT 2015
Line 168: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 2.00 0.00 0.00 100.00
Line 260: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 4.00 0.00 0.00 100.00
Line 352: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 4.68 0.00 0.00 117.00
===
Line 5136: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 6.00 0.00 0.00 100.00
Line 5228: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 6.00 0.00 0.00 100.00
Line 5320: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 5.94 0.00 0.00 99.00
zzz ***Mon Jul 13 22:59:30 EDT 2015
Search "dasdbt" (32 hits in 1 file)
osw\oswiostat\xxx_iostat_15.07.14.0900.dat (32 hits)
Line 76: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 16.16 0.00 0.00 101.00
zzz ***Tue Jul 14 09:00:26 EDT 2015
Line 168: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 15.84 0.00 0.00 99.00
Line 260: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 16.16 0.00 0.00 101.00
Line 352: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 16.16 0.00 0.00 101.00
===
Line 2560: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 15.84 0.00 0.00 99.00
Line 2652: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 16.00 0.00 0.00 100.00
Line 2744: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 16.00 0.00 0.00 100.00
Line 2836: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 16.00 0.00 0.00 100.00
Line 2928: dasdbt 0.00 0.00 0.00 0.00 0.00 0.00 0.00 16.00 0.00 0.00 100.00
zzz ***Tue Jul 14 09:32:01 EDT 2015
Result from "lsdasd" command:
Device "dasdbt" ( dasd-eckd.157e58: 0.0.0657 )
dasd-eckd.157e58: 0.0.0657:
0.0.0657 active dasdbt 94:284 ECKD 4096 23033MB 5896620
RDA > ASM > Device List
lrwxrwxrwx 1 root root [0x0] Jul 14 13:59 2015 Jul 14 13:59 2015 ASM0657 -> dasdbt1
Disk Partition Information
From /proc/partitions
major minor #blocks name
===
94 284 23586480 dasdbt
94 285 23586384 dasdbt1
Listing of Devices from /dev
===
brw-rw---- 1 root disk [0x105e1c] Jul 14 13:59 2015 Jul 14 13:59 2015 dasdbt
brw-rw---- 1 grid asmadmin [0x105e1d] Jul 17 12:22 2015 Jul 17 12:22 2015 dasdbt1
/dev/.udev/links/ASM0657:
drwxr-xr-x 2 root root 60 Jul 14 13:59 2015 Jul 14 13:59 2015 .
drwxr-xr-x 702 root root 14040 Jul 14 14:00 2015 Jul 14 14:00 2015 ..
lrwxrwxrwx 1 root root 52 Jul 14 13:59 2015 Jul 14 13:59 2015 b94:285 -> /devices/css0/0.0.004d/0.0.0657/block/dasdbt/dasdbt1
/dev/.udev/links/disk\x2fby-id\x2fccw-0X0657:
drwxr-xr-x 2 root root 60 Jul 14 13:59 2015 Jul 14 13:59 2015 .
drwxr-xr-x 702 root root 14040 Jul 14 14:00 2015 Jul 14 14:00 2015 ..
lrwxrwxrwx 1 root root 44 Jul 14 13:59 2015 Jul 14 13:59 2015 b94:284 -> /devices/css0/0.0.004d/0.0.0657/block/dasdbt
/dev/.udev/links/disk\x2fby-id\x2fccw-0X0657-part1:
drwxr-xr-x 2 root root 60 Jul 14 13:59 2015 Jul 14 13:59 2015 .
drwxr-xr-x 702 root root 14040 Jul 14 14:00 2015 Jul 14 14:00 2015 ..
lrwxrwxrwx 1 root root 52 Jul 14 13:59 2015 Jul 14 13:59 2015 b94:285 -> /devices/css0/0.0.004d/0.0.0657/block/dasdbt/dasdbt1
Cause
To view full details, sign in with your My Oracle Support account. |
|
Don't have a My Oracle Support account? Click to get started! |
In this Document
Symptoms |
Cause |
Solution |
References |