PrimeGrid
Please visit donation page to help the project cover running costs for this month

Toggle Menu

Join PrimeGrid

Returning Participants

Community

Leader Boards

Results

Other

drummers-lowrise

Advanced search

Message boards : Problems and Help : Genefer v2.05 (cudaGFN) segmentation violation

Author Message
felixonmars
Avatar
Send message
Joined: 24 Apr 13
Posts: 3
ID: 218806
Credit: 9,655,439
RAC: 0
321 LLR Bronze: Earned 10,000 credits (11,272)Cullen LLR Bronze: Earned 10,000 credits (11,134)SoB LLR Bronze: Earned 10,000 credits (40,082)PPS Sieve Amethyst: Earned 1,000,000 credits (1,988,890)GFN Turquoise: Earned 5,000,000 credits (7,594,789)
Message 69556 - Posted: 2 Oct 2013 | 16:21:06 UTC

I'm continuously getting this error for Genefer v2.05 (cudaGFN) WUs:

SIGSEGV: segmentation violation
Stack trace (15 frames):
../../projects/www.primegrid.com/primegrid_genefer_3_1_2_1_2.05_i686-pc-linux-gnu__cudaGFN(boinc_catch_signal+0x1e8)[0x805ce7c]
linux-gate.so.1(__kernel_sigreturn+0x0)[0xf77ed400]
/usr/lib32/libcuda.so(+0x1a704e)[0xf505304e]
/usr/lib32/libcuda.so(+0x177dbc)[0xf5023dbc]
/usr/lib32/libcuda.so(+0x177f3b)[0xf5023f3b]
/usr/lib32/libcuda.so(+0x178507)[0xf5024507]
/usr/lib32/libcuda.so(+0x750e0)[0xf4f210e0]
/usr/lib32/libcuda.so(cuModuleLoadFatBinary+0x5f)[0xf4f07edf]
./libcudart.so.3(+0x2304a)[0xf5c9a04a]
./libcudart.so.3(+0x1833d)[0xf5c8f33d]
./libcudart.so.3(+0x1c80f)[0xf5c9380f]
./libcudart.so.3(+0x1e0ef)[0xf5c950ef]
./libcudart.so.3(+0x1657c)[0xf5c8d57c]
./libcudart.so.3(cudaMalloc+0x56)[0xf5cb4156]
../../projects/www.primegrid.com/primegrid_genefer_3_1_2_1_2.05_i686-pc-linux-gnu__cudaGFN[0x805277c]

Nvidia driver version for libcuda.so: 325.15-1
Kernel: Linux 3.11.3
System is Arch Linux x86_64 with latest boinc stable: 7.0.65.

I can complete PPS (Sieve) v1.39 (cudaPPSsieve) WUs, so it's more likely something wrong in the Genefer program itself.

Thanks for any help!

Related results:
http://www.primegrid.com/result.php?resultid=486718990
http://www.primegrid.com/result.php?resultid=486416016
http://www.primegrid.com/result.php?resultid=487761735
http://www.primegrid.com/result.php?resultid=488075097

Profile Gary Craig
Volunteer tester
Avatar
Send message
Joined: 30 Dec 09
Posts: 3213
ID: 52890
Credit: 1,005,618,748
RAC: 0
Discovered 1 mega prime321 LLR Ruby: Earned 2,000,000 credits (2,893,273)Cullen LLR Ruby: Earned 2,000,000 credits (2,440,687)ESP LLR Turquoise: Earned 5,000,000 credits (5,738,876)Generalized Cullen/Woodall LLR Turquoise: Earned 5,000,000 credits (6,292,626)PPS LLR Turquoise: Earned 5,000,000 credits (9,648,951)PSP LLR Turquoise: Earned 5,000,000 credits (5,653,927)SoB LLR Jade: Earned 10,000,000 credits (10,558,341)SR5 LLR Turquoise: Earned 5,000,000 credits (5,748,705)SGS LLR Ruby: Earned 2,000,000 credits (3,335,713)TRP LLR Jade: Earned 10,000,000 credits (12,602,818)Woodall LLR Ruby: Earned 2,000,000 credits (2,282,622)321 Sieve (suspended) Gold: Earned 500,000 credits (740,566)Cullen/Woodall Sieve (suspended) Emerald: Earned 50,000,000 credits (59,788,598)Generalized Cullen/Woodall Sieve (suspended) Ruby: Earned 2,000,000 credits (2,143,068)PPS Sieve Double Gold: Earned 500,000,000 credits (524,673,938)Sierpinski (ESP/PSP/SoB) Sieve (suspended) Jade: Earned 10,000,000 credits (10,130,821)TRP Sieve (suspended) Jade: Earned 10,000,000 credits (10,074,710)AP 26/27 Sapphire: Earned 20,000,000 credits (43,842,888)GFN Double Silver: Earned 200,000,000 credits (224,648,943)PSA Emerald: Earned 50,000,000 credits (62,378,755)
Message 69559 - Posted: 2 Oct 2013 | 20:40:49 UTC

Based on your kernel version (3.11.3), you're really "bleeding edge"... I think the version of ubuntu that incorporates that kernel, 13.10, has only been available for a few days and is still called "beta".

I'm not pointing fingers; just noting a possible issue. I couldn't say if the root cause is the OS or Genefer. I will say that I'm running that very same version of GeneferCUDA on both ubuntu 12.10 and 13.04 (two different boxes) with no problems. Do you have an older version of the OS to run on? Assuming "no", is anyone else running (or failing!) with the 3.11 kernel?

--Gary

felixonmars
Avatar
Send message
Joined: 24 Apr 13
Posts: 3
ID: 218806
Credit: 9,655,439
RAC: 0
321 LLR Bronze: Earned 10,000 credits (11,272)Cullen LLR Bronze: Earned 10,000 credits (11,134)SoB LLR Bronze: Earned 10,000 credits (40,082)PPS Sieve Amethyst: Earned 1,000,000 credits (1,988,890)GFN Turquoise: Earned 5,000,000 credits (7,594,789)
Message 70209 - Posted: 20 Oct 2013 | 13:29:37 UTC - in response to Message 69559.

Thank you, I tried to downgrade my graphics driver from 325.15 to 304.108 and the problem solved. The 325.15 driver also causes my other BOINC projects (DistrRTgen and Einstein) run much slower, so I guess we should blame nvidia for this :D

felixonmars
Avatar
Send message
Joined: 24 Apr 13
Posts: 3
ID: 218806
Credit: 9,655,439
RAC: 0
321 LLR Bronze: Earned 10,000 credits (11,272)Cullen LLR Bronze: Earned 10,000 credits (11,134)SoB LLR Bronze: Earned 10,000 credits (40,082)PPS Sieve Amethyst: Earned 1,000,000 credits (1,988,890)GFN Turquoise: Earned 5,000,000 credits (7,594,789)
Message 70938 - Posted: 16 Nov 2013 | 9:42:02 UTC

Hi, yet another update for this issue, in case anyone want to know:

After upgrading to the new latest stable driver 331.20, the crash no longer exist.

Message boards : Problems and Help : Genefer v2.05 (cudaGFN) segmentation violation

[Return to PrimeGrid main page]
DNS Powered by DNSEXIT.COM
Copyright © 2005 - 2021 Rytis Slatkevičius (contact) and PrimeGrid community. Server load 0.49, 0.97, 1.27
Generated 6 May 2021 | 21:54:34 UTC