PrimeGrid
Please visit donation page to help the project cover running costs for this month

Toggle Menu

Join PrimeGrid

Returning Participants

Community

Leader Boards

Results

Other

drummers-lowrise

Advanced search

Message boards : Proth Prime Search : openclPPSsieveMAC failing with "process got signal 4"

Author Message
Profile [AF>Le_Pommier] Jerome_C2005
Send message
Joined: 5 Mar 08
Posts: 25
ID: 19866
Credit: 45,182,928
RAC: 156,269
321 LLR Bronze: Earned 10,000 credits (17,012)Cullen LLR Bronze: Earned 10,000 credits (33,523)ESP LLR Bronze: Earned 10,000 credits (16,943)Generalized Cullen/Woodall LLR Bronze: Earned 10,000 credits (10,082)PSP LLR Bronze: Earned 10,000 credits (70,981)SoB LLR Bronze: Earned 10,000 credits (98,093)SR5 LLR Bronze: Earned 10,000 credits (10,202)TRP LLR Bronze: Earned 10,000 credits (11,035)Woodall LLR Bronze: Earned 10,000 credits (97,373)PPS Sieve Sapphire: Earned 20,000,000 credits (40,278,655)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Silver: Earned 100,000 credits (411,318)AP 26/27 Bronze: Earned 10,000 credits (44,473)GFN Ruby: Earned 2,000,000 credits (4,077,683)
Message 75861 - Posted: 24 Apr 2014 | 12:16:50 UTC
Last modified: 24 Apr 2014 | 12:18:26 UTC

Hi

do you know why this happens on many PPS (Sieve) v1.39 (openclPPSsieveMAC) WUs, after a more or less long or short time ? :

<core_client_version>7.2.33</core_client_version> <![CDATA[ <message> process got signal 4 </message> <stderr_txt> Sieve started: 389543304000000000 <= p < 389543313000000000 Thread 0 starting Detected 160 multiprocessors (800 SPUs) on device 0. Device 0 is a AMD Radeon HD 4850. </stderr_txt> ]]>


I also have a bigger number of WUs that are working fine on the same Mac.

Thanks.

Profile Michael GoetzProject donor
Volunteer moderator
Project administrator
Project scientist
Avatar
Send message
Joined: 21 Jan 10
Posts: 12195
ID: 53948
Credit: 168,883,595
RAC: 158,745
The "Shut up already!" badge:  This loud mouth has mansplained on the forums over 10 thousand times!  Sheesh!!!Discovered the World's First GFN-19 prime!!!Discovered 1 mega primeFound 1 prime in the 2018 Tour de PrimesFound 1 prime in the 2019 Tour de Primes321 LLR Ruby: Earned 2,000,000 credits (2,063,182)Cullen LLR Ruby: Earned 2,000,000 credits (2,005,249)ESP LLR Ruby: Earned 2,000,000 credits (2,001,789)Generalized Cullen/Woodall LLR Ruby: Earned 2,000,000 credits (2,115,831)PPS LLR Ruby: Earned 2,000,000 credits (2,768,012)PSP LLR Ruby: Earned 2,000,000 credits (2,632,269)SoB LLR Sapphire: Earned 20,000,000 credits (30,035,643)SR5 LLR Turquoise: Earned 5,000,000 credits (7,691,131)SGS LLR Ruby: Earned 2,000,000 credits (2,011,264)TRP LLR Ruby: Earned 2,000,000 credits (2,433,520)Woodall LLR Ruby: Earned 2,000,000 credits (2,176,414)321 Sieve Gold: Earned 500,000 credits (824,488)Cullen/Woodall Sieve (suspended) Ruby: Earned 2,000,000 credits (4,170,256)Generalized Cullen/Woodall Sieve Turquoise: Earned 5,000,000 credits (5,059,304)PPS Sieve Sapphire: Earned 20,000,000 credits (20,110,788)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Amethyst: Earned 1,000,000 credits (1,035,522)TRP Sieve (suspended) Ruby: Earned 2,000,000 credits (2,051,121)AP 26/27 Turquoise: Earned 5,000,000 credits (7,086,053)GFN Emerald: Earned 50,000,000 credits (60,578,808)PSA Jade: Earned 10,000,000 credits (10,038,118)
Message 75862 - Posted: 24 Apr 2014 | 12:48:45 UTC - in response to Message 75861.

This is all I could dig up on this error:

http://setiathome.berkeley.edu/forum_thread.php?id=62653

That, in turn, links to this: http://boincfaq.mundayweb.com/index.php?view=283&sessionID=eb1ac27bebc93ba4353ee9d8146776fb.

Bottom line is that signal 4, in this instance, probably indicates a problem with the computer. It's also possible it's the video driver.


____________
Please do not PM me with support questions. Ask on the forums instead. Thank you!

My lucky number is 75898524288+1

Profile [AF>Le_Pommier] Jerome_C2005
Send message
Joined: 5 Mar 08
Posts: 25
ID: 19866
Credit: 45,182,928
RAC: 156,269
321 LLR Bronze: Earned 10,000 credits (17,012)Cullen LLR Bronze: Earned 10,000 credits (33,523)ESP LLR Bronze: Earned 10,000 credits (16,943)Generalized Cullen/Woodall LLR Bronze: Earned 10,000 credits (10,082)PSP LLR Bronze: Earned 10,000 credits (70,981)SoB LLR Bronze: Earned 10,000 credits (98,093)SR5 LLR Bronze: Earned 10,000 credits (10,202)TRP LLR Bronze: Earned 10,000 credits (11,035)Woodall LLR Bronze: Earned 10,000 credits (97,373)PPS Sieve Sapphire: Earned 20,000,000 credits (40,278,655)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Silver: Earned 100,000 credits (411,318)AP 26/27 Bronze: Earned 10,000 credits (44,473)GFN Ruby: Earned 2,000,000 credits (4,077,683)
Message 75905 - Posted: 25 Apr 2014 | 16:03:06 UTC

Mmmm, so my Mac "sometimes doesn't like PPS"... too bad when it happens after a long time of computation, not a big deal when it crashed right away...

Thanks for the answer anyway.

Profile [AF>Le_Pommier] Jerome_C2005
Send message
Joined: 5 Mar 08
Posts: 25
ID: 19866
Credit: 45,182,928
RAC: 156,269
321 LLR Bronze: Earned 10,000 credits (17,012)Cullen LLR Bronze: Earned 10,000 credits (33,523)ESP LLR Bronze: Earned 10,000 credits (16,943)Generalized Cullen/Woodall LLR Bronze: Earned 10,000 credits (10,082)PSP LLR Bronze: Earned 10,000 credits (70,981)SoB LLR Bronze: Earned 10,000 credits (98,093)SR5 LLR Bronze: Earned 10,000 credits (10,202)TRP LLR Bronze: Earned 10,000 credits (11,035)Woodall LLR Bronze: Earned 10,000 credits (97,373)PPS Sieve Sapphire: Earned 20,000,000 credits (40,278,655)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Silver: Earned 100,000 credits (411,318)AP 26/27 Bronze: Earned 10,000 credits (44,473)GFN Ruby: Earned 2,000,000 credits (4,077,683)
Message 79584 - Posted: 19 Sep 2014 | 11:27:26 UTC

Gee, now it turns out that I have a lot of Sierpinski Problem ESP/PSP/SoB (Sieve) v1.02 Wus that fail with the same error recently, that are not GPU, so nothing to do with my old ATI...

Also some PPS (Sieve) v1.40 (openclPPSsieveMAC) continue to fail and some of both apps are running OK (here and here).

And I'm collatz GPU WUs without such issues, plus a big number of other CPU projects...

So if the answer is still "it's my computer's fault" is not really logical, don't you think ?

Profile [AF>Le_Pommier] Jerome_C2005
Send message
Joined: 5 Mar 08
Posts: 25
ID: 19866
Credit: 45,182,928
RAC: 156,269
321 LLR Bronze: Earned 10,000 credits (17,012)Cullen LLR Bronze: Earned 10,000 credits (33,523)ESP LLR Bronze: Earned 10,000 credits (16,943)Generalized Cullen/Woodall LLR Bronze: Earned 10,000 credits (10,082)PSP LLR Bronze: Earned 10,000 credits (70,981)SoB LLR Bronze: Earned 10,000 credits (98,093)SR5 LLR Bronze: Earned 10,000 credits (10,202)TRP LLR Bronze: Earned 10,000 credits (11,035)Woodall LLR Bronze: Earned 10,000 credits (97,373)PPS Sieve Sapphire: Earned 20,000,000 credits (40,278,655)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Silver: Earned 100,000 credits (411,318)AP 26/27 Bronze: Earned 10,000 credits (44,473)GFN Ruby: Earned 2,000,000 credits (4,077,683)
Message 79601 - Posted: 19 Sep 2014 | 20:10:17 UTC

Very weird issue, if you look at this file (http://cl.ly/Xcw2) (let me know if the share doesn't work or if you can't open ods spreadsheets) you'll see that the failure get concentrated in time frames, then all works fine for several hours, then a group of WUs fail together...

Profile Michael GoetzProject donor
Volunteer moderator
Project administrator
Project scientist
Avatar
Send message
Joined: 21 Jan 10
Posts: 12195
ID: 53948
Credit: 168,883,595
RAC: 158,745
The "Shut up already!" badge:  This loud mouth has mansplained on the forums over 10 thousand times!  Sheesh!!!Discovered the World's First GFN-19 prime!!!Discovered 1 mega primeFound 1 prime in the 2018 Tour de PrimesFound 1 prime in the 2019 Tour de Primes321 LLR Ruby: Earned 2,000,000 credits (2,063,182)Cullen LLR Ruby: Earned 2,000,000 credits (2,005,249)ESP LLR Ruby: Earned 2,000,000 credits (2,001,789)Generalized Cullen/Woodall LLR Ruby: Earned 2,000,000 credits (2,115,831)PPS LLR Ruby: Earned 2,000,000 credits (2,768,012)PSP LLR Ruby: Earned 2,000,000 credits (2,632,269)SoB LLR Sapphire: Earned 20,000,000 credits (30,035,643)SR5 LLR Turquoise: Earned 5,000,000 credits (7,691,131)SGS LLR Ruby: Earned 2,000,000 credits (2,011,264)TRP LLR Ruby: Earned 2,000,000 credits (2,433,520)Woodall LLR Ruby: Earned 2,000,000 credits (2,176,414)321 Sieve Gold: Earned 500,000 credits (824,488)Cullen/Woodall Sieve (suspended) Ruby: Earned 2,000,000 credits (4,170,256)Generalized Cullen/Woodall Sieve Turquoise: Earned 5,000,000 credits (5,059,304)PPS Sieve Sapphire: Earned 20,000,000 credits (20,110,788)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Amethyst: Earned 1,000,000 credits (1,035,522)TRP Sieve (suspended) Ruby: Earned 2,000,000 credits (2,051,121)AP 26/27 Turquoise: Earned 5,000,000 credits (7,086,053)GFN Emerald: Earned 50,000,000 credits (60,578,808)PSA Jade: Earned 10,000,000 credits (10,038,118)
Message 79606 - Posted: 19 Sep 2014 | 23:35:26 UTC - in response to Message 79601.

Very weird issue, if you look at this file (http://cl.ly/Xcw2) (let me know if the share doesn't work or if you can't open ods spreadsheets) you'll see that the failure get concentrated in time frames, then all works fine for several hours, then a group of WUs fail together...


That's indicative of an "event" happening that's messing up the computer.

The event could be software (say, an antivirus scan running and deleting necessary files), a user event (perhaps "World of Warcraft" messes up the GPU for crunching), or a hardware failure.

I had the same thing happen a few days ago. I have a 100% super reliable computer (it's sort of new), and suddenly I had a set of tasks terminate with computation errors. A look at the Windows event log showed clearly what was happening -- the hard drive had started to fail. (It ended well, as the replacement drive -- and SSD this time -- arrived today so all should be good now.) The disks were reused from the previous computer, so they're not so new.

If you're on Windows, look in the event log to see if there's anything interesting happening around the time of the failures. If you're on Linux or Mac, there should be an equivalent type of logging you can examine.

____________
Please do not PM me with support questions. Ask on the forums instead. Thank you!

My lucky number is 75898524288+1

Profile [AF>Le_Pommier] Jerome_C2005
Send message
Joined: 5 Mar 08
Posts: 25
ID: 19866
Credit: 45,182,928
RAC: 156,269
321 LLR Bronze: Earned 10,000 credits (17,012)Cullen LLR Bronze: Earned 10,000 credits (33,523)ESP LLR Bronze: Earned 10,000 credits (16,943)Generalized Cullen/Woodall LLR Bronze: Earned 10,000 credits (10,082)PSP LLR Bronze: Earned 10,000 credits (70,981)SoB LLR Bronze: Earned 10,000 credits (98,093)SR5 LLR Bronze: Earned 10,000 credits (10,202)TRP LLR Bronze: Earned 10,000 credits (11,035)Woodall LLR Bronze: Earned 10,000 credits (97,373)PPS Sieve Sapphire: Earned 20,000,000 credits (40,278,655)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Silver: Earned 100,000 credits (411,318)AP 26/27 Bronze: Earned 10,000 credits (44,473)GFN Ruby: Earned 2,000,000 credits (4,077,683)
Message 79613 - Posted: 20 Sep 2014 | 8:43:44 UTC

Since yesterday 15:30 all the WUs (CPU and GPU) have been crunching with no problem.

I had a look at the Mac console : I can find crash reports for both apps (here and here) but the general event log seems to be limited to the current day, I don't know how to query further in the past.

Profile [AF>Le_Pommier] Jerome_C2005
Send message
Joined: 5 Mar 08
Posts: 25
ID: 19866
Credit: 45,182,928
RAC: 156,269
321 LLR Bronze: Earned 10,000 credits (17,012)Cullen LLR Bronze: Earned 10,000 credits (33,523)ESP LLR Bronze: Earned 10,000 credits (16,943)Generalized Cullen/Woodall LLR Bronze: Earned 10,000 credits (10,082)PSP LLR Bronze: Earned 10,000 credits (70,981)SoB LLR Bronze: Earned 10,000 credits (98,093)SR5 LLR Bronze: Earned 10,000 credits (10,202)TRP LLR Bronze: Earned 10,000 credits (11,035)Woodall LLR Bronze: Earned 10,000 credits (97,373)PPS Sieve Sapphire: Earned 20,000,000 credits (40,278,655)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Silver: Earned 100,000 credits (411,318)AP 26/27 Bronze: Earned 10,000 credits (44,473)GFN Ruby: Earned 2,000,000 credits (4,077,683)
Message 79647 - Posted: 22 Sep 2014 | 10:10:08 UTC

I had another set of 8 CPU WUs failing at the same time Saturday evening at the same moment, I think it did correspond to a moment when I close the session (but didn't switch the computer off, and my boinc is running as a service on the Mac) and opened it again and then found out they had failed.

Since then nonce has failed again.

Profile [AF>Le_Pommier] Jerome_C2005
Send message
Joined: 5 Mar 08
Posts: 25
ID: 19866
Credit: 45,182,928
RAC: 156,269
321 LLR Bronze: Earned 10,000 credits (17,012)Cullen LLR Bronze: Earned 10,000 credits (33,523)ESP LLR Bronze: Earned 10,000 credits (16,943)Generalized Cullen/Woodall LLR Bronze: Earned 10,000 credits (10,082)PSP LLR Bronze: Earned 10,000 credits (70,981)SoB LLR Bronze: Earned 10,000 credits (98,093)SR5 LLR Bronze: Earned 10,000 credits (10,202)TRP LLR Bronze: Earned 10,000 credits (11,035)Woodall LLR Bronze: Earned 10,000 credits (97,373)PPS Sieve Sapphire: Earned 20,000,000 credits (40,278,655)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Silver: Earned 100,000 credits (411,318)AP 26/27 Bronze: Earned 10,000 credits (44,473)GFN Ruby: Earned 2,000,000 credits (4,077,683)
Message 79656 - Posted: 22 Sep 2014 | 18:52:46 UTC

Another crash list this morning, all 8 CPU + 1 GPU WUs did crash at the same time, after 50 mins of calculation.

22/09/2014 11:33:39,484 WindowServer[237]: disable_update_timeout: UI updates were forcibly disabled by application "Finder" for over 1.00 seconds. Server has re-enabled them.
22/09/2014 11:33:39,644 WindowServer[237]: common_reenable_update: UI updates were finally reenabled by application "Finder" after 1.16 seconds (server forcibly re-enabled them after 1.00 seconds)
22/09/2014 11:33:54,233 virusbarriers[325]: Task timeout: </usr/bin/xar -xf /var/folders/zz/zyxvpxvq6csfxvn_n0000000000000/T/.vbfolder_4x0UhH/Packages/iPhoneSDK3_0.pkg>
22/09/2014 11:33:54,233 virusbarriers[325]: Task timeout: </usr/bin/xar -xf /var/folders/zz/zyxvpxvq6csfxvn_n0000000000000/T/.vbfolder_4x0UhH/Packages/iPhoneSDK3_1.pkg>
22/09/2014 11:33:57,501 virusbarriers[325]: Task timeout: </usr/bin/xar -xf /var/folders/zz/zyxvpxvq6csfxvn_n0000000000000/T/.vbfolder_4x0UhH/Packages/iPhoneSDK3_1_2.pkg>
22/09/2014 11:33:58,192 virusbarriers[325]: Task timeout: </usr/bin/gzip -d Payload.gz>
22/09/2014 11:34:01,951 virusbarriers[325]: Task timeout: </usr/bin/xar -xf /var/folders/zz/zyxvpxvq6csfxvn_n0000000000000/T/.vbfolder_4x0UhH/Packages/iPhoneSDK3_1_3.pkg>
22/09/2014 11:34:04,878 virusbarriers[325]: Task timeout: </usr/bin/xar -xf /var/folders/zz/zyxvpxvq6csfxvn_n0000000000000/T/.vbfolder_4x0UhH/Packages/iPhoneSDK3_2.pkg>
22/09/2014 11:34:08,008 ReportCrash[60833]: Saved crash report for primegrid_sr2sieve_wrapper_1.02_x86_64-apple-darwin[51745] version ??? to /Library/Logs/DiagnosticReports/primegrid_sr2sieve_wrapper_1.02_x86_64-apple-darwin_2014-09-22-113407-1_iMac-Famile-Cadet.crash
22/09/2014 11:34:08,129 ReportCrash[60833]: Saved crash report for primegrid_sr2sieve_wrapper_1.02_x86_64-apple-darwin[51960] version ??? to /Library/Logs/DiagnosticReports/primegrid_sr2sieve_wrapper_1.02_x86_64-apple-darwin_2014-09-22-113407_iMac-Famile-Cadet.crash
22/09/2014 11:34:08,134 ReportCrash[60833]: Saved crash report for primegrid_sr2sieve_wrapper_1.02_x86_64-apple-darwin[52156] version ??? to /Library/Logs/DiagnosticReports/primegrid_sr2sieve_wrapper_1.02_x86_64-apple-darwin_2014-09-22-113407-2_iMac-Famile-Cadet.crash
22/09/2014 11:34:08,482 ReportCrash[60833]: Saved crash report for primegrid_sr2sieve_wrapper_1.02_x86_64-apple-darwin[52313] version ??? to /Library/Logs/DiagnosticReports/primegrid_sr2sieve_wrapper_1.02_x86_64-apple-darwin_2014-09-22-113408-3_iMac-Famile-Cadet.crash
22/09/2014 11:34:08,482 ReportCrash[60833]: Saved crash report for primegrid_tpsieve_1.40_x86_64-apple-darwin__openclPPSsieveMAC[51920] version ??? to /Library/Logs/DiagnosticReports/primegrid_tpsieve_1.40_x86_64-apple-darwin__openclPPSsieveMAC_2014-09-22-113407_iMac-Famile-Cadet.crash

22/09/2014 11:34:11,044 virusbarriers[325]: Task timeout: </usr/bin/xar -xf /var/folders/zz/zyxvpxvq6csfxvn_n0000000000000/T/.vbfolder_4x0UhH/Packages/iPhoneSDK4_0.pkg>
22/09/2014 11:34:11,044 virusbarriers[325]: Task timeout: </usr/bin/xar -xf /var/folders/zz/zyxvpxvq6csfxvn_n0000000000000/T/.vbfolder_4x0UhH/Packages/iPhoneSDK4_1.pkg>
22/09/2014 11:34:15,996 virusbarriers[325]: Task timeout: </usr/bin/xar -xf /var/folders/zz/zyxvpxvq6csfxvn_n0000000000000/T/.vbfolder_4x0UhH/Packages/iPhoneSDK4_2.pkg>
22/09/2014 11:34:16,413 virusbarriers[325]: Task timeout: </usr/bin/gzip -d Payload.gz>
22/09/2014 11:34:20,351 virusbarriers[325]: Task timeout: </bin/pax -rf /var/folders/zz/zyxvpxvq6csfxvn_n0000000000000/T/.vbfolder_pCM0ch/Payload -s ,^//*,./,>
22/09/2014 11:34:22,259 virusbarriers[325]: Task timeout: </usr/bin/gzip -d Payload.gz>

Message boards : Proth Prime Search : openclPPSsieveMAC failing with "process got signal 4"

[Return to PrimeGrid main page]
DNS Powered by DNSEXIT.COM
Copyright © 2005 - 2019 Rytis Slatkevičius (contact) and PrimeGrid community. Server load 1.37, 1.92, 1.84
Generated 24 Mar 2019 | 1:08:55 UTC