Laptop insanity , gpu errors galore

\n studio-striking\n

Message boards : Number crunching : Laptop insanity , gpu errors galore
Message board moderation

To post messages, you must log in.

AuthorMessage
.clair.

Send message
Joined: 2 Dec 15
Posts: 14
Credit: 422,065,673
RAC: 23,488
Message 8439 - Posted: 2 Apr 2023, 20:51:03 UTC

Over the last day or so , unknown to me , my old laptop gpu went mad and started killing work in 12 seconds
648 error workunits by the time I noticed its pending doom
I have stopped it now and am doing a full disk scan just in case its the disk , though a reboot may be all it needs
BUT
Has anyone seen a workunit reissued to the same computer after it got "error while computing"
there are several of them that where reissued to the same delinquent system and it mashed them the second time as well
for instance - https://moowrap.net/workunit.php?wuid=153667905
I know workunits can be reissued to the same system if they get lost in some way
probably a driver crash or the app got in a knot etc
and it was not overheating .
ID: 8439 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Link
Avatar

Send message
Joined: 11 Feb 14
Posts: 109
Credit: 7,625,123
RAC: 5,351
Message 8440 - Posted: 3 Apr 2023, 9:15:59 UTC - in response to Message 8439.  

Has anyone seen a workunit reissued to the same computer after it got "error while computing"

Yes, many times.
ID: 8440 · Rating: 0 · rate: Rate + / Rate - Report as offensive
mikey
Avatar

Send message
Joined: 22 Jun 11
Posts: 2080
Credit: 1,826,336,240
RAC: 828
Message 8441 - Posted: 3 Apr 2023, 18:16:06 UTC - in response to Message 8439.  

Over the last day or so , unknown to me , my old laptop gpu went mad and started killing work in 12 seconds
648 error workunits by the time I noticed its pending doom
I have stopped it now and am doing a full disk scan just in case its the disk , though a reboot may be all it needs
BUT
Has anyone seen a workunit reissued to the same computer after it got "error while computing"
there are several of them that where reissued to the same delinquent system and it mashed them the second time as well
for instance - https://moowrap.net/workunit.php?wuid=153667905
I know workunits can be reissued to the same system if they get lost in some way
probably a driver crash or the app got in a knot etc
and it was not overheating .


A simple reboot may indeed fix it as the error message says:
[Apr 02 19:57:11 UTC] No AMD STREAM compatible devices found
[Apr 02 19:57:11 UTC] Device ID 0 exceed number of detected devices (0), ignored
[Apr 02 19:57:11 UTC] No crunchers to start. Quitting...

It means it lost the gpu someplace
ID: 8441 · Rating: 0 · rate: Rate + / Rate - Report as offensive
.clair.

Send message
Joined: 2 Dec 15
Posts: 14
Credit: 422,065,673
RAC: 23,488
Message 8442 - Posted: 3 Apr 2023, 18:26:54 UTC

After the reboot the gpu was missing , boinc startup messages "no useable gpu`s found" so that gave me a clue
so went looking and found that in windows device manager the gpu was disabled , hey wot!! , how did that happen [well its a computer and they sometimes do that stupid stuff]
so re enable it and after watching the screen go black and flash weird stuff , for far longer than was healthy for me and another reboot , it does now work
So back to crunching .
ID: 8442 · Rating: 0 · rate: Rate + / Rate - Report as offensive
Skillz

Send message
Joined: 12 May 17
Posts: 11
Credit: 1,141,049,439
RAC: 2,328,900
Message 8443 - Posted: 3 Apr 2023, 23:53:49 UTC

I'd blame Windows update.
ID: 8443 · Rating: 0 · rate: Rate + / Rate - Report as offensive

Message boards : Number crunching : Laptop insanity , gpu errors galore


 
Copyright © 2011-2024 Moo! Wrapper Project