Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DEBUG] Kernel BUG in ocf/metadata/metadata_raw.c:206! with OCF_CLEANING_DEBUG=1 #1620

Open
Deixx opened this issue Mar 10, 2025 · 0 comments
Labels
bug Something isn't working P3-low

Comments

@Deixx
Copy link
Contributor

Deixx commented Mar 10, 2025

Description

When OCF_CLEANING_DEBUG is set to 1, the cleaner thread causes kernel BUG.

Steps to Reproduce

  1. Set OCF_CLEANING_DEBUG=1 in open-cas-linux/ocf/src/cleaning/alru.c
  2. Compile and install OCL
  3. Start cache
  4. Wait for cleaner thread

Logs

[ 2302.255026] ------------[ cut here ]------------
[ 2302.255508] kernel BUG at /root/github/open-cas-linux/modules/cas_cache/src/ocf/metadata/metadata_raw.c:206!
[ 2302.256544] invalid opcode: 0000 [#1] SMP NOPTI
[ 2302.257007] CPU: 10 PID: 76494 Comm: cas_cl_cache1 Tainted: G           O      5.11.16-1.el8.elrepo.x86_64 #1
[ 2302.257948] Hardware name: Red Hat KVM/RHEL-AV, BIOS 1.16.0-4.module_el8+603+e0ca2c01 04/01/2014
[ 2302.258793] RIP: 0010:_raw_ram_access+0x1e/0x20 [cas_cache]
[ 2302.259387] Code: c0 5b 5d 41 5c c3 0f 0b 0f 0b 0f 0b 0f 1f 44 00 00 48 85 f6 74 14 89 d2 48 3b 56 18 73 0c 8b 46 0c 48 0f af c2 48 03 46 40 c3 <0f> 0b 0f 1f 44 00 00 89 f2 48 89 d0 48 3b 57 18 73 06 31 d2 f7 77
[ 2302.261164] RSP: 0018:ff5a3a3ec8767dd0 EFLAGS: 00010246
[ 2302.261673] RAX: ffffffffc08e9370 RBX: ff5a3a3ec878b000 RCX: ff5a3a3ec84f1000
[ 2302.262346] RDX: 00000000002772c0 RSI: ff5a3a3ec84ed3a0 RDI: ff5a3a3ec8239000
[ 2302.263027] RBP: 0000000067ce9cbf R08: 0014e0e5e1577606 R09: ff43b22b9a86d000
[ 2302.263700] R10: 0000000000000001 R11: 0000021801194331 R12: ff5a3a3ec8239000
[ 2302.264369] R13: 00000000002772c0 R14: ff5a3a3ec878b058 R15: ff5a3a3ec8239080
[ 2302.265050] FS:  0000000000000000(0000) GS:ff43b2335fd00000(0000) knlGS:0000000000000000
[ 2302.265811] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 2302.266351] CR2: 00007f0106701080 CR3: 00000008f4a3c002 CR4: 0000000000771ee0
[ 2302.267033] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 2302.267777] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 2302.268458] PKRU: 55555554
[ 2302.268721] Call Trace:
[ 2302.268970]  cleaning_alru_perform_cleaning+0x201/0x650 [cas_cache]
[ 2302.269641]  ocf_cleaner_run+0x11c/0x160 [cas_cache]
[ 2302.270126]  _cas_cleaner_thread+0xf4/0x210 [cas_cache]
[ 2302.270719]  ? wait_woken+0x80/0x80
[ 2302.271089]  ? _cas_cleaner_complete+0x20/0x20 [cas_cache]
[ 2302.271626]  kthread+0x116/0x130
[ 2302.271959]  ? kthread_park+0x80/0x80
[ 2302.272315]  ret_from_fork+0x1f/0x30
[ 2302.272679] Modules linked in: cas_cache(O) rfkill intel_rapl_msr intel_rapl_common nfit libnvdimm kvm_intel kvm iTCO_wdt intel_pmc_bxt ppdev iTCO_vendor_support irqbypass virtio_gpu rapl virtio_dma_buf drm_kms_helper i2c_i801 parport_pc pcspkr drm joydev virtio_balloon lpc_ich i2c_smbus parport syscopyarea sysfillrect sysimgblt fb_sys_fops xfs libcrc32c nvme_tcp nvme_fabrics nvme ahci crct10dif_pclmul libahci crc32_pclmul crc32c_intel nvme_core libata t10_pi ghash_clmulni_intel serio_raw virtio_net virtio_blk net_failover failover sunrpc dm_mirror dm_region_hash dm_log dm_mod fuse [last unloaded: cas_cache]
[ 2302.277830] ---[ end trace 899de942041a5303 ]---
[ 2302.278294] RIP: 0010:_raw_ram_access+0x1e/0x20 [cas_cache]
[ 2302.278865] Code: c0 5b 5d 41 5c c3 0f 0b 0f 0b 0f 0b 0f 1f 44 00 00 48 85 f6 74 14 89 d2 48 3b 56 18 73 0c 8b 46 0c 48 0f af c2 48 03 46 40 c3 <0f> 0b 0f 1f 44 00 00 89 f2 48 89 d0 48 3b 57 18 73 06 31 d2 f7 77
[ 2302.280646] RSP: 0018:ff5a3a3ec8767dd0 EFLAGS: 00010246
[ 2302.281157] RAX: ffffffffc08e9370 RBX: ff5a3a3ec878b000 RCX: ff5a3a3ec84f1000
[ 2302.281840] RDX: 00000000002772c0 RSI: ff5a3a3ec84ed3a0 RDI: ff5a3a3ec8239000
[ 2302.282531] RBP: 0000000067ce9cbf R08: 0014e0e5e1577606 R09: ff43b22b9a86d000
[ 2302.283225] R10: 0000000000000001 R11: 0000021801194331 R12: ff5a3a3ec8239000
[ 2302.283924] R13: 00000000002772c0 R14: ff5a3a3ec878b058 R15: ff5a3a3ec8239080
[ 2302.284611] FS:  0000000000000000(0000) GS:ff43b2335fd00000(0000) knlGS:0000000000000000
[ 2302.285384] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 2302.285951] CR2: 00007f0106701080 CR3: 00000008f4a3c002 CR4: 0000000000771ee0
[ 2302.286637] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 2302.287317] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 2302.288009] PKRU: 55555554
[ 2302.288274] Kernel panic - not syncing: Fatal exception
[ 2302.288930] Kernel Offset: 0x1be00000 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff)
[ 2302.289955] ---[ end Kernel panic - not syncing: Fatal exception ]---

Your Environment

  • OpenCAS version (commit hash or tag): 24.09.0.0932.master d4de219
  • OCF version: a12496320c1852b71ae91b3688e6f1d4697c7fd4 (with OCF_CLEANING_DEBUG=1)
  • Operating System: CentOS Stream 8 (VM)
  • Kernel version: 5.11.16-1.el8.elrepo.x86_64
@Deixx Deixx added bug Something isn't working P3-low labels Mar 10, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working P3-low
Projects
None yet
Development

No branches or pull requests

1 participant