PrimeGrid
Please visit donation page to help the project cover running costs for this month

Toggle Menu

Join PrimeGrid

Returning Participants

Community

Leader Boards

Results

Other

drummers-lowrise

Advanced search

Message boards : Number crunching : GPU tasks faster when PPS runs

Author Message
Profile BurProject donor
Volunteer tester
Avatar
Send message
Joined: 25 Feb 20
Posts: 332
ID: 1241833
Credit: 22,611,276
RAC: 0
321 LLR Gold: Earned 500,000 credits (538,216)Cullen LLR Amethyst: Earned 1,000,000 credits (1,169,946)ESP LLR Gold: Earned 500,000 credits (636,842)Generalized Cullen/Woodall LLR Silver: Earned 100,000 credits (212,232)PPS LLR Gold: Earned 500,000 credits (883,715)PSP LLR Gold: Earned 500,000 credits (663,928)SoB LLR Silver: Earned 100,000 credits (217,346)SR5 LLR Gold: Earned 500,000 credits (531,229)SGS LLR Amethyst: Earned 1,000,000 credits (1,042,382)TRP LLR Gold: Earned 500,000 credits (561,429)Woodall LLR Gold: Earned 500,000 credits (781,741)321 Sieve Ruby: Earned 2,000,000 credits (2,107,153)PPS Sieve Amethyst: Earned 1,000,000 credits (1,045,010)AP 26/27 Ruby: Earned 2,000,000 credits (2,470,273)GFN Turquoise: Earned 5,000,000 credits (7,129,018)PSA Silver: Earned 100,000 credits (244,815)
Message 145299 - Posted: 17 Nov 2020 | 18:32:27 UTC

I'm running GFN17-mega on a 1660 Super. So far the CPU (i5-4790K) was running Cullen, Woodall or SoB in MT (4 threads). I had a GFN throughput of 200 per 24 h.

Now in preparation for the challenge I switched to PPS-mega and throughput increased to 215 per 24 h.

HT is enabled in the BIOS, BOINC is set to 50% CPU and I assigned cores 0,2,4,6 to LLR2 task. GFN task was assigned all cores.

Why is the GPU task that strongly affected by the choice of subproject? All are using 4 threads. Is it the smaller FFT size of PPS-mega? More importantly, can anything be done to minimize the impact of the CPU task on the GPU?
____________
Primes: 1281979 & 12+8+1979 & 1+2+8+1+9+7+9 & 1^2+2^2+8^2+1^2+9^2+7^2+9^2 & 12*8+19*79 & 12^8-1979 & 1281979 + 4 (cousin prime)

Yves Gallot
Volunteer developer
Project scientist
Send message
Joined: 19 Aug 12
Posts: 644
ID: 164101
Credit: 305,010,093
RAC: 0
GFN Double Silver: Earned 200,000,000 credits (305,010,093)
Message 145305 - Posted: 17 Nov 2020 | 20:18:06 UTC - in response to Message 145299.
Last modified: 17 Nov 2020 | 20:19:08 UTC

If the FFT data size is larger than L3 cache size then the L3 cache is continually cleared.
The GPU driver still has to run (to control execution of OpenCL code) and rather than reading data from L3 cache it must tackle LLR in order to read memory.

Profile BurProject donor
Volunteer tester
Avatar
Send message
Joined: 25 Feb 20
Posts: 332
ID: 1241833
Credit: 22,611,276
RAC: 0
321 LLR Gold: Earned 500,000 credits (538,216)Cullen LLR Amethyst: Earned 1,000,000 credits (1,169,946)ESP LLR Gold: Earned 500,000 credits (636,842)Generalized Cullen/Woodall LLR Silver: Earned 100,000 credits (212,232)PPS LLR Gold: Earned 500,000 credits (883,715)PSP LLR Gold: Earned 500,000 credits (663,928)SoB LLR Silver: Earned 100,000 credits (217,346)SR5 LLR Gold: Earned 500,000 credits (531,229)SGS LLR Amethyst: Earned 1,000,000 credits (1,042,382)TRP LLR Gold: Earned 500,000 credits (561,429)Woodall LLR Gold: Earned 500,000 credits (781,741)321 Sieve Ruby: Earned 2,000,000 credits (2,107,153)PPS Sieve Amethyst: Earned 1,000,000 credits (1,045,010)AP 26/27 Ruby: Earned 2,000,000 credits (2,470,273)GFN Turquoise: Earned 5,000,000 credits (7,129,018)PSA Silver: Earned 100,000 credits (244,815)
Message 145335 - Posted: 18 Nov 2020 | 18:08:02 UTC - in response to Message 145305.

Ok, thanks. The throughput increased to 235 vs 200.

But there's nothing that can be done besides having a larger L3 cache, I guess?
____________
Primes: 1281979 & 12+8+1979 & 1+2+8+1+9+7+9 & 1^2+2^2+8^2+1^2+9^2+7^2+9^2 & 12*8+19*79 & 12^8-1979 & 1281979 + 4 (cousin prime)

Yves Gallot
Volunteer developer
Project scientist
Send message
Joined: 19 Aug 12
Posts: 644
ID: 164101
Credit: 305,010,093
RAC: 0
GFN Double Silver: Earned 200,000,000 credits (305,010,093)
Message 145353 - Posted: 18 Nov 2020 | 21:13:11 UTC - in response to Message 145335.

But there's nothing that can be done besides having a larger L3 cache, I guess?

Yes.
i7-4790K 4 cores, L3 8 MB i5-10600K 6 cores, L3 12 MB i7-10700K 8 cores, L3 16 MB Ryzen 5 3600X/5600X 6 cores, L3 32 MB Ryzen 7 3800X/5800X 8 cores, L3 32 MB

Profile j.sheridanProject donor
Volunteer tester
Send message
Joined: 21 Mar 11
Posts: 737
ID: 91622
Credit: 1,267,016,406
RAC: 0
321 LLR Turquoise: Earned 5,000,000 credits (7,879,763)Cullen LLR Turquoise: Earned 5,000,000 credits (6,828,213)ESP LLR Turquoise: Earned 5,000,000 credits (5,055,512)Generalized Cullen/Woodall LLR Turquoise: Earned 5,000,000 credits (6,457,215)PPS LLR Turquoise: Earned 5,000,000 credits (8,645,607)PSP LLR Turquoise: Earned 5,000,000 credits (6,104,395)SoB LLR Jade: Earned 10,000,000 credits (10,245,712)SR5 LLR Turquoise: Earned 5,000,000 credits (6,730,319)SGS LLR Turquoise: Earned 5,000,000 credits (5,334,462)TRP LLR Jade: Earned 10,000,000 credits (10,009,531)Woodall LLR Turquoise: Earned 5,000,000 credits (6,638,546)321 Sieve Sapphire: Earned 20,000,000 credits (20,019,388)Cullen/Woodall Sieve (suspended) Double Silver: Earned 200,000,000 credits (265,102,350)PPS Sieve Double Gold: Earned 500,000,000 credits (546,715,203)TRP Sieve (suspended) Turquoise: Earned 5,000,000 credits (5,801,812)AP 26/27 Emerald: Earned 50,000,000 credits (61,259,536)GFN Double Silver: Earned 200,000,000 credits (249,675,265)
Message 145355 - Posted: 18 Nov 2020 | 21:37:46 UTC - in response to Message 145353.

But there's nothing that can be done besides having a larger L3 cache, I guess?

Yes.
i7-4790K 4 cores, L3 8 MB i5-10600K 6 cores, L3 12 MB i7-10700K 8 cores, L3 16 MB Ryzen 5 3600X/5600X 6 cores, L3 32 MB Ryzen 7 3800X/5800X 8 cores, L3 32 MB


your ryzen numbers aren't right from a performance perspective:

3600X - L3 2x16MB
5600X - L3 32 MB
3800X - L3 2x16MB
5800X - L3 32 MB

Profile BurProject donor
Volunteer tester
Avatar
Send message
Joined: 25 Feb 20
Posts: 332
ID: 1241833
Credit: 22,611,276
RAC: 0
321 LLR Gold: Earned 500,000 credits (538,216)Cullen LLR Amethyst: Earned 1,000,000 credits (1,169,946)ESP LLR Gold: Earned 500,000 credits (636,842)Generalized Cullen/Woodall LLR Silver: Earned 100,000 credits (212,232)PPS LLR Gold: Earned 500,000 credits (883,715)PSP LLR Gold: Earned 500,000 credits (663,928)SoB LLR Silver: Earned 100,000 credits (217,346)SR5 LLR Gold: Earned 500,000 credits (531,229)SGS LLR Amethyst: Earned 1,000,000 credits (1,042,382)TRP LLR Gold: Earned 500,000 credits (561,429)Woodall LLR Gold: Earned 500,000 credits (781,741)321 Sieve Ruby: Earned 2,000,000 credits (2,107,153)PPS Sieve Amethyst: Earned 1,000,000 credits (1,045,010)AP 26/27 Ruby: Earned 2,000,000 credits (2,470,273)GFN Turquoise: Earned 5,000,000 credits (7,129,018)PSA Silver: Earned 100,000 credits (244,815)
Message 145391 - Posted: 19 Nov 2020 | 17:48:00 UTC

Unfortunately it seems the i7-4790K is about the fastest you can go with the LGA 1150 socket. And I don't feel like replacing the mainboard...
____________
Primes: 1281979 & 12+8+1979 & 1+2+8+1+9+7+9 & 1^2+2^2+8^2+1^2+9^2+7^2+9^2 & 12*8+19*79 & 12^8-1979 & 1281979 + 4 (cousin prime)

Post to thread

Message boards : Number crunching : GPU tasks faster when PPS runs

[Return to PrimeGrid main page]
DNS Powered by DNSEXIT.COM
Copyright © 2005 - 2023 Rytis Slatkevičius (contact) and PrimeGrid community. Server load 0.00, 0.00, 0.00
Generated 23 Mar 2023 | 15:26:36 UTC