-
-
Notifications
You must be signed in to change notification settings - Fork 214
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
zwave-js-ui crashing after a few days online #4098
Comments
I am experiencing the same issue. Currently running |
@kpanchen thanks for your issue, did you also tried with 9.27.8? Detecting the exact update that broke this is the first step to fix the issue |
@codemannh are you using pkg as well or other deploy method? |
I’m using the docker container. Not sure if that’s what you mean.
…________________________________
From: Daniel Lando ***@***.***>
Sent: Saturday, January 25, 2025 11:00:04 AM
To: zwave-js/zwave-js-ui ***@***.***>
Cc: codemannh ***@***.***>; Mention ***@***.***>
Subject: Re: [zwave-js/zwave-js-ui] zwave-js-ui crashing after a few days online (Issue #4098)
@codemannh<https://github.com/codemannh> are you using pkg as well or other deploy method?
—
Reply to this email directly, view it on GitHub<#4098 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AT4C6FQRHBAG24DVGX7WAFL2MO7JJAVCNFSM6AAAAABVPPPAV2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDMMJUGAZTANRVGM>.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
No, will try today. Note that it takes a few days when it starts crashing so I report when I have a result. 9.27.7 works perfectly since I reported the issue and went back to it. Thank you!! |
And I was a bit wrong, 9.27.8 crashed pretty much right away. Went back to 9.27.7. Logs attached. |
I wanted to chime in here to say we have been experiencing the same issue, it seems, since sometime in December, although to be fair our installation is on Home Assistant. Regrettably, we don’t know if that started after a Z-Wave JS UI update, but we do know that we updated the Home Assistant add-on on 10 December 2024, and I believe we updated on that day from 3.17 to 3.18. These are the version numbers of the HA add-on, and I am not certain what those version numbers would correspond to in the case of Z-Wave JS UI. Since then we have been facing Z-Wave JS UI restarts – which we can ascertain are happening because we see them in the log, like so:
However, we know when it is happening, and have pinned it down to 2 causes in our case: a) Whenever a device dies. This happens in our network to some devices on the physical periphery of the network. We then have an HA automation which pings them and brings them back to life almost instantaneously, and this has worked decently well for a couple of years. That automation also sends us a notification on our mobile apps. However, we have noticed that since early December, whenever we get one of those notifications, the entire Z-Wave network goes down after a while, and then from the log we can see that Z-Wave JS UI restarted. An HA restart is usually the only way to recover from this, as sometimes it keeps restarting, and even if it starts successfully, the integration with HA does not work until an HA restart. b) Whenever we open the Z-Wave JS UI interface from within the HA mobile app – there is a good chance that even doing this simple thing causes Z-Wave JS UI to restart – and therefore the same issues as point 1 above ensue. This mostly happens from the app – from the browser it seems to work better (just a feeling – cannot be too sure). In the log, when this is happening, I can see a lot of connection attempts that seem to fail, but I am not qualified enough to understand what that means:
Anyhow, I hope this helps identify the root cause of the problem. |
@AlCalzone any clue about any changes that could cause this starting from https://github.com/zwave-js/node-zwave-js/releases/tag/v14.3.7 ? |
It can only be this change then:
Maybe the MQTT portion is doing something with the event emitter I haven't found during my search? Because I haven't noticed any issues, but I'm also not using MQTT. |
The strange thing is it doesn't crash immediately but after some days so no clue, never got any unexpected close on my side and I'm using mqtt as well. @kpanchen @codemannh are you using mqtt or do you have mqtt gateway disabled? I will release a new version now that should catch more exceptions so maybe we get some insight of the issue when it happens again. |
These logs show crashes after just a few seconds: |
I have mqtt disabled going straight into HomeAssistant. As one other person noted, I have a couple of devices that are a ways from my controller and it seems when they seem to take a long time updating status or controlling that is when the failure occurs. Sometimes it happens when I'm not looking directly at it, so without log information, its hard to say that is the case all the time. |
Yes, I'm using MQTT with OpenHab. My setup is very straightforward, everything is on the same box, Aeotec Z-Wave Z-Stick 7 ZWA010, Ubuntu latest release / patches, stand alone package for zwave-js-ui, Mosquito, OpenHab binding for MQTT. Everything was rock stable before that release. I'll try a new release, let you know. K. |
Could you guys try with 9.30.0 please and send me a log when the issue happens? We may have some more details about the error now |
Updated my environment to run 9.30 |
Updated to 9.30.1, will report! Thank you very much for all the hard work!! |
Ok LMK :) |
Just a quick update, don't want to speak too soon but 9.30.1 is running fine since updated 4 days ago. I've attached logs from today just in case you want to see if anything was caught in that Exception you added. Will keep monitoring. Thank you, |
Unforutnately not so lucky here. My monitor identified zwavejsui crashed overnight. I still see nothing in the logs that were written by zwavejsui other than device related entries and they just stopped at the time of the crash. |
@AlCalzone In @kpanchen I see many nonce errors. No clue if that could highlight something @codemannh What's the exact time the application crashed? |
Hi Daniel,
The crash happened at 12:38 AM central US time.
Get Outlook for iOS<https://aka.ms/o0ukef>
…________________________________
From: Daniel Lando ***@***.***>
Sent: Monday, February 3, 2025 3:25:12 AM
To: zwave-js/zwave-js-ui ***@***.***>
Cc: codemannh ***@***.***>; Mention ***@***.***>
Subject: Re: [zwave-js/zwave-js-ui] zwave-js-ui crashing after a few days online (Issue #4098)
@AlCalzone<https://github.com/AlCalzone> In @kpanchen<https://github.com/kpanchen> I see many nonce errors. No clue if that could highlight something
@codemannh<https://github.com/codemannh> What's the exact time the application crashed?
—
Reply to this email directly, view it on GitHub<#4098 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AT4C6FRKEOR5LFSASXUW5V32N4YXRAVCNFSM6AAAAABVPPPAV2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDMMZQGM4TKNRWGY>.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
@codemannh I have a feel so the issue is related to the log rotation. Do you see a log like |
Tried to reply to the email. I was incorrect in the statement I made there. The line Cleaning up log files is in the log at 00:44:00. There does not appear to be any issues though with the logs. |
As a test could you try please to disable ZUI log to file from settings and keep enabled only zwavejs one? Also giving that you are using docker could you share your docker-compose file? |
Mine seems to work fine, don't see any obvious abnormalities now. Didn't have a time to have a look at the logs, let me know if you need/want me to test something else or provide more information. K. |
@kpanchen thanks for your feedback! If it works maybe the issue is fixed and the issue faced by @codemannh is related to something else |
Just crashed again. Today it was at 11:50AM local time. Is there something specific I should look at to identify the cause? |
Maybe you could find some more informations on docker logs |
When the container crashes all the logs are gone. That was why I turned on the log to file. I've been actively monitoring the logs since yesterday, not seeing anything unusual. All entries are device related as well as my Uptime Kuma http ping to see if the container is up and running. |
@codemannh I suggest to enable json logging in docker compose, it could be that the exception for some reason is not picked up so will never be available in zui logs. See https://docs.docker.com/engine/logging/configure/#supported-logging-drivers and https://www.squadcast.com/blog/docker-compose-logs#storing-docker-compose-logs |
Checklist
Deploy method
PKG executable
Z-Wave JS UI version
9.29.1
ZwaveJS version
14.3.7
Describe the bug
Any version after 9.27.7 including 9.29.1 starts crashing after a few days of working normally. Attempted to update to versions: 9.28.0, 9.29.0, both versions starts crashing after running normally for couple days, returned to version 9.27.7 - worked fine for 4 weeks, no issues. Update to version 9.29.1 - crashed again after 2 days running normally.
Logs_20250119.zip
To Reproduce
No particular patterns noted, just updated version crashes after 2-3 days of running normally.
Expected behavior
New versions work.
Additional context
Had a quick look at the log, unfortunately I'm not sure what to check there. However from attached logs: 2025-01-18 - everything worked fine and log ended at the end of day and was archived. I'm not sure at what time 2025-01-19 starts crashing, for whatever reason zwave-ui log is missing and starts at 2025-01-19 22:47:14.701. From that log it is evident that UI was restarted multiple times without any particular reason, it looked like a refresh on the console.
Last crash event happened at 2025-01-19 23:12:58.661 and UI was stopped manually.
2025-01-19 23:17:49.213 - rollback to version 9.27.7 which works fine after. (Marked by keyword "rollback" in the attached logs)
Note: it was a change some time ago where required pointer to configuration (or something like this was missing) and UI was not starting at all, not sure if this is the same issue or something new.
20250119_rollback.zip
No response
The text was updated successfully, but these errors were encountered: