Noch mal zu meinem crashenden Controller. Ich hab hier noch mal ein Log. Kann das sein dass es Bus-Fehler sind?
Hier sieht man wie das anfängt. Filesystem ist BTRFS und es crasht vor allem dann wenn ich per NFS drauf zu greife. Ein Zufall?
Nachtrag:
Auf einer "etwas" älteren Seite haben auch andere das Problem und haben auch NFS an. Ich werde erst mal vermeiden NFS zu nehmen. Mal sehen ob es wieder auftritt
https://bugzilla.redhat.com/show_bug.cgi?id=605444
...
Aug 1 12:58:26 server systemd[1]: systemd-udevd.service: Watchdog timeout (limit 3min)!
Aug 1 12:58:33 server kernel: [410020.373221] INFO: task systemd-udevd:1508 blocked for more than 120 seconds.
Aug 1 12:58:33 server kernel: [410020.373384] Not tainted 4.4.0-31-generic #50-Ubuntu
Aug 1 12:58:33 server kernel: [410020.373493] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 1 12:58:33 server kernel: [410020.373646] systemd-udevd D ffff880128e13938 0 1508 1 0x00000004
Aug 1 12:58:33 server kernel: [410020.373661] ffff880128e13938 0000000000000046 ffff880129c11b80 ffff880128383700
Aug 1 12:58:33 server kernel: [410020.373671] ffff880128e14000 ffff880128e13a90 ffff880128e13a88 ffff880128383700
Aug 1 12:58:33 server kernel: [410020.373680] ffff880128383700 ffff880128e13950 ffffffff81829a25 7fffffffffffffff
Aug 1 12:58:33 server kernel: [410020.373689] Call Trace:
Aug 1 12:58:33 server kernel: [410020.373711] [<ffffffff81829a25>] schedule+0x35/0x80
Aug 1 12:58:33 server kernel: [410020.373721] [<ffffffff8182cb45>] schedule_timeout+0x1b5/0x270
Aug 1 12:58:33 server kernel: [410020.373732] [<ffffffff810ab0e4>] ? check_preempt_curr+0x54/0x90
Aug 1 12:58:33 server kernel: [410020.373740] [<ffffffff810ab139>] ? ttwu_do_wakeup+0x19/0xe0
Aug 1 12:58:33 server kernel: [410020.373749] [<ffffffff810ab29d>] ? ttwu_do_activate.constprop.90+0x5d/0x70
Aug 1 12:58:33 server kernel: [410020.373760] [<ffffffff8182a483>] wait_for_completion+0xb3/0x140
Aug 1 12:58:33 server kernel: [410020.373767] [<ffffffff810ac0b0>] ? wake_up_q+0x70/0x70
Aug 1 12:58:33 server kernel: [410020.373777] [<ffffffff8109b23d>] flush_work+0x10d/0x1c0
Aug 1 12:58:33 server kernel: [410020.373785] [<ffffffff810974b0>] ? destroy_worker+0x90/0x90
Aug 1 12:58:33 server kernel: [410020.373794] [<ffffffff8109b425>] __cancel_work_timer+0xa5/0x1d0
Aug 1 12:58:33 server kernel: [410020.373804] [<ffffffff813d2571>] ? exact_lock+0x11/0x20
Aug 1 12:58:33 server kernel: [410020.373815] [<ffffffff815573ff>] ? kobj_lookup+0x10f/0x160
Aug 1 12:58:33 server kernel: [410020.373823] [<ffffffff8109b583>] cancel_delayed_work_sync+0x13/0x20
Aug 1 12:58:33 server kernel: [410020.373831] [<ffffffff813d34b8>] disk_block_events+0x78/0x80
Aug 1 12:58:33 server kernel: [410020.373841] [<ffffffff81249417>] __blkdev_get+0x67/0x460
Aug 1 12:58:33 server kernel: [410020.373849] [<ffffffff81249c7d>] blkdev_get+0x12d/0x340
Aug 1 12:58:33 server kernel: [410020.373859] [<ffffffff81249f62>] blkdev_open+0x82/0xd0
Aug 1 12:58:33 server kernel: [410020.373868] [<ffffffff8120acdf>] do_dentry_open+0x1ff/0x310
Aug 1 12:58:33 server kernel: [410020.373876] [<ffffffff81249ee0>] ? blkdev_get_by_dev+0x50/0x50
Aug 1 12:58:33 server kernel: [410020.373884] [<ffffffff8120be74>] vfs_open+0x54/0x80
Aug 1 12:58:33 server kernel: [410020.373893] [<ffffffff81217adb>] ? may_open+0x5b/0xf0
Aug 1 12:58:33 server kernel: [410020.373902] [<ffffffff8121b657>] path_openat+0x1b7/0x1330
Aug 1 12:58:33 server kernel: [410020.373912] [<ffffffff813fd656>] ? sprintf+0x56/0x70
Aug 1 12:58:33 server kernel: [410020.373923] [<ffffffff8121d9c1>] do_filp_open+0x91/0x100
Aug 1 12:58:33 server kernel: [410020.373932] [<ffffffff8122b256>] ? __alloc_fd+0x46/0x190
Aug 1 12:58:33 server kernel: [410020.373940] [<ffffffff8120c248>] do_sys_open+0x138/0x2a0
Aug 1 12:58:33 server kernel: [410020.373949] [<ffffffff8120c3ce>] SyS_open+0x1e/0x20
Aug 1 12:58:33 server kernel: [410020.373957] [<ffffffff8182db32>] entry_SYSCALL_64_fastpath+0x16/0x71
Aug 1 12:58:33 server kernel: [410020.373992] INFO: task btrfs-transacti:2886 blocked for more than 120 seconds.
Aug 1 12:58:33 server kernel: [410020.374135] Not tainted 4.4.0-31-generic #50-Ubuntu
Aug 1 12:58:33 server kernel: [410020.374241] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 1 12:58:33 server kernel: [410020.374393] btrfs-transacti D ffff8801290479f8 0 2886 2 0x00000000
Aug 1 12:58:33 server kernel: [410020.374404] ffff8801290479f8 00000c3496194000 ffff880129c10dc0 ffff880128385280
Aug 1 12:58:33 server kernel: [410020.374413] ffff880129048000 ffff88012ec96d00 7fffffffffffffff ffffffff8182a220
Aug 1 12:58:33 server kernel: [410020.374422] ffff880129047b58 ffff880129047a10 ffffffff81829a25 0000000000000000
Aug 1 12:58:33 server kernel: [410020.374431] Call Trace:
Aug 1 12:58:33 server kernel: [410020.374441] [<ffffffff8182a220>] ? bit_wait+0x60/0x60
Aug 1 12:58:33 server kernel: [410020.374450] [<ffffffff81829a25>] schedule+0x35/0x80
Aug 1 12:58:33 server kernel: [410020.374457] [<ffffffff8182cb45>] schedule_timeout+0x1b5/0x270
Aug 1 12:58:33 server kernel: [410020.374537] [<ffffffffc0468ff0>] ? extent_write_cache_pages.isra.31.constprop.51+0x370/0x3d0 [btrfs]
Aug 1 12:58:33 server kernel: [410020.374547] [<ffffffff8182a220>] ? bit_wait+0x60/0x60
Aug 1 12:58:33 server kernel: [410020.374556] [<ffffffff81828f54>] io_schedule_timeout+0xa4/0x110
Aug 1 12:58:33 server kernel: [410020.374565] [<ffffffff8182a23b>] bit_wait_io+0x1b/0x70
Aug 1 12:58:33 server kernel: [410020.374573] [<ffffffff81829dcd>] __wait_on_bit+0x5d/0x90
Aug 1 12:58:33 server kernel: [410020.374585] [<ffffffff8118d04b>] wait_on_page_bit+0xcb/0xf0
Aug 1 12:58:33 server kernel: [410020.374595] [<ffffffff810c3ce0>] ? autoremove_wake_function+0x40/0x40
Aug 1 12:58:33 server kernel: [410020.374604] [<ffffffff8118d163>] __filemap_fdatawait_range+0xf3/0x160
Aug 1 12:58:33 server kernel: [410020.374615] [<ffffffff8118d1e4>] filemap_fdatawait_range+0x14/0x30
Aug 1 12:58:33 server kernel: [410020.374680] [<ffffffffc0462e22>] btrfs_wait_ordered_range+0x72/0x110 [btrfs]
Aug 1 12:58:33 server kernel: [410020.374746] [<ffffffffc048c7ae>] btrfs_wait_cache_io+0x5e/0x1f0 [btrfs]
Aug 1 12:58:33 server kernel: [410020.374756] [<ffffffff811ebada>] ? kmem_cache_alloc+0x1ca/0x1f0
Aug 1 12:58:33 server kernel: [410020.374812] [<ffffffffc04328ce>] btrfs_write_dirty_block_groups+0xae/0x2b0 [btrfs]
Aug 1 12:58:33 server kernel: [410020.374872] [<ffffffffc04c052d>] commit_cowonly_roots+0x218/0x2c2 [btrfs]
Aug 1 12:58:33 server kernel: [410020.374932] [<ffffffffc04470f6>] btrfs_commit_transaction+0x576/0xa90 [btrfs]
Aug 1 12:58:33 server kernel: [410020.374992] [<ffffffffc0442229>] transaction_kthread+0x229/0x240 [btrfs]
Aug 1 12:58:33 server kernel: [410020.375051] [<ffffffffc0442000>] ? btrfs_cleanup_transaction+0x570/0x570 [btrfs]
Aug 1 12:58:33 server kernel: [410020.375060] [<ffffffff810a0808>] kthread+0xd8/0xf0
Aug 1 12:58:33 server kernel: [410020.375068] [<ffffffff810a0730>] ? kthread_create_on_node+0x1e0/0x1e0
Aug 1 12:58:33 server kernel: [410020.375077] [<ffffffff8182decf>] ret_from_fork+0x3f/0x70
Aug 1 12:58:33 server kernel: [410020.375084] [<ffffffff810a0730>] ? kthread_create_on_node+0x1e0/0x1e0
Aug 1 12:58:33 server kernel: [410020.375108] INFO: task nfsd:3428 blocked for more than 120 seconds.
Aug 1 12:58:33 server kernel: [410020.375233] Not tainted 4.4.0-31-generic #50-Ubuntu
Aug 1 12:58:33 server kernel: [410020.375339] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 1 12:58:33 server kernel: [410020.375491] nfsd D ffff880067e8fb00 0 3428 2 0x00000000
Aug 1 12:58:33 server kernel: [410020.375501] ffff880067e8fb00 00000000512560cb ffff880129c11b80 ffff880129eeb700
Aug 1 12:58:33 server kernel: [410020.375510] ffff880067e90000 ffff8800a44249f0 ffff8800a4424800 ffff8800a44249f0
Aug 1 12:58:33 server kernel: [410020.375519] 0000000000000001 ffff880067e8fb18 ffffffff81829a25 ffff8800a89e37a0
Aug 1 12:58:33 server kernel: [410020.375528] Call Trace:
Aug 1 12:58:33 server kernel: [410020.375538] [<ffffffff81829a25>] schedule+0x35/0x80
Aug 1 12:58:33 server kernel: [410020.375597] [<ffffffffc0445fb3>] wait_current_trans.isra.21+0xd3/0x120 [btrfs]
Aug 1 12:58:33 server kernel: [410020.375607] [<ffffffff810c3ca0>] ? wake_atomic_t_function+0x60/0x60
Aug 1 12:58:33 server kernel: [410020.375667] [<ffffffffc04478db>] start_transaction+0x2cb/0x4c0 [btrfs]
Aug 1 12:58:33 server kernel: [410020.375727] [<ffffffffc0447ae8>] btrfs_start_transaction+0x18/0x20 [btrfs]
Aug 1 12:58:33 server kernel: [410020.375790] [<ffffffffc045d6f8>] btrfs_sync_file+0x238/0x3b0 [btrfs]
Aug 1 12:58:33 server kernel: [410020.375803] [<ffffffff8124112b>] vfs_fsync_range+0x4b/0xb0
Aug 1 12:58:33 server kernel: [410020.375836] [<ffffffffc05bc04d>] nfsd_vfs_write+0x14d/0x380 [nfsd]
Aug 1 12:58:33 server kernel: [410020.375870] [<ffffffffc05c81c4>] nfsd4_write+0x1a4/0x200 [nfsd]
Aug 1 12:58:33 server kernel: [410020.375903] [<ffffffffc05ca13a>] nfsd4_proc_compound+0x38a/0x660 [nfsd]
Aug 1 12:58:33 server kernel: [410020.375931] [<ffffffffc05b6e78>] nfsd_dispatch+0xb8/0x200 [nfsd]
Aug 1 12:58:33 server kernel: [410020.375986] [<ffffffffc05671cc>] svc_process_common+0x40c/0x650 [sunrpc]
Aug 1 12:58:33 server kernel: [410020.376036] [<ffffffffc0568593>] svc_process+0x103/0x1c0 [sunrpc]
Aug 1 12:58:33 server kernel: [410020.376063] [<ffffffffc05b68cf>] nfsd+0xef/0x160 [nfsd]
Aug 1 12:58:33 server kernel: [410020.376089] [<ffffffffc05b67e0>] ? nfsd_destroy+0x60/0x60 [nfsd]
Aug 1 12:58:33 server kernel: [410020.376097] [<ffffffff810a0808>] kthread+0xd8/0xf0
Aug 1 12:58:33 server kernel: [410020.376105] [<ffffffff810a0730>] ? kthread_create_on_node+0x1e0/0x1e0
Aug 1 12:58:33 server kernel: [410020.376113] [<ffffffff8182decf>] ret_from_fork+0x3f/0x70
Aug 1 12:58:33 server kernel: [410020.376121] [<ffffffff810a0730>] ? kthread_create_on_node+0x1e0/0x1e0
Aug 1 12:58:33 server kernel: [410020.376178] INFO: task smartctl:31187 blocked for more than 120 seconds.
Aug 1 12:58:33 server kernel: [410020.376311] Not tainted 4.4.0-31-generic #50-Ubuntu
Aug 1 12:58:33 server kernel: [410020.376417] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 1 12:58:33 server kernel: [410020.376569] smartctl D ffff880100973c88 0 31187 31186 0x00000000
Aug 1 12:58:33 server kernel: [410020.376579] ffff880100973c88 0000000000000000 ffffffff81e11500 ffff88001aeb6040
Aug 1 12:58:33 server kernel: [410020.376588] ffff880100974000 ffff880035426180 0000000000000000 0000000000000000
Aug 1 12:58:33 server kernel: [410020.376596] ffff880100973cc0 ffff880100973ca0 ffffffff81829a25 ffff880035ace4d8
Aug 1 12:58:33 server kernel: [410020.376605] Call Trace:
Aug 1 12:58:33 server kernel: [410020.376615] [<ffffffff81829a25>] schedule+0x35/0x80
Aug 1 12:58:33 server kernel: [410020.376632] [<ffffffffc0032dcb>] megasas_issue_blocked_cmd+0x11b/0x200 [megaraid_sas]
Aug 1 12:58:33 server kernel: [410020.376642] [<ffffffff810c3ca0>] ? wake_atomic_t_function+0x60/0x60
Aug 1 12:58:33 server kernel: [410020.376657] [<ffffffffc003a10e>] megasas_mgmt_fw_ioctl+0x3de/0xad0 [megaraid_sas]
Aug 1 12:58:33 server kernel: [410020.376675] [<ffffffffc003a9cb>] megasas_mgmt_ioctl_fw.isra.25+0x1cb/0x230 [megaraid_sas]
Aug 1 12:58:33 server kernel: [410020.376690] [<ffffffffc003aca8>] megasas_mgmt_ioctl+0x28/0x40 [megaraid_sas]
Aug 1 12:58:33 server kernel: [410020.376698] [<ffffffff81220c0f>] do_vfs_ioctl+0x29f/0x490
Aug 1 12:58:33 server kernel: [410020.376707] [<ffffffff8121c944>] ? putname+0x54/0x60
Aug 1 12:58:33 server kernel: [410020.376715] [<ffffffff8120c2cf>] ? do_sys_open+0x1bf/0x2a0
Aug 1 12:58:33 server kernel: [410020.376723] [<ffffffff81220e79>] SyS_ioctl+0x79/0x90
Aug 1 12:58:33 server kernel: [410020.376731] [<ffffffff8182db32>] entry_SYSCALL_64_fastpath+0x16/0x71
Aug 1 12:58:38 server kernel: [410026.028995] sd 2:2:0:0: tag#16 megasas: RESET cmd=0 retries=0
Aug 1 12:58:38 server kernel: [410026.029013] megaraid_sas 0000:01:00.0: [ 0]waiting for 17 commands to complete
Aug 1 12:58:43 server kernel: [410031.048670] megaraid_sas 0000:01:00.0: [ 5]waiting for 17 commands to complete
Aug 1 12:58:48 server kernel: [410036.068305] megaraid_sas 0000:01:00.0: [10]waiting for 17 commands to complete
Aug 1 12:58:54 server kernel: [410041.088123] megaraid_sas 0000:01:00.0: [15]waiting for 17 commands to complete
Aug 1 12:58:59 server kernel: [410046.107942] megaraid_sas 0000:01:00.0: [20]waiting for 17 commands to complete
...