Author |
Message |
John Honorary cruncher
 Send message
Joined: 21 Feb 06 Posts: 2875 ID: 2449 Credit: 2,681,934 RAC: 0
                 
|
A thread to discuss the anonymous platform for tpsieve.
____________
|
|
|
|
CUDA
I checked the option to run two threads concurrently to increase the workload on my GTX295.
http://www.primegrid.com/results.php?hostid=95767&offset=20&show_names=0&state=3&appid=9
<app_info>
<app>
<name>pps_sr2sieve</name>
<user_friendly_name>Proth Prime Search (Sieve)</user_friendly_name>
</app>
<file_info>
<name>primegrid_tpsieve_1.35_windows_intelx86__cuda23.exe</name>
<executable/>
</file_info>
<file_info>
<name>cudart.dll</name>
</file_info>
<app_version>
<app_name>pps_sr2sieve</app_name>
<version_num>135</version_num>
<file_ref>
<file_name>primegrid_tpsieve_1.35_windows_intelx86__cuda23.exe</file_name>
<main_program/>
</file_ref>
<file_ref>
<file_name>cudart.dll</file_name>
<open_name>cudart.dll</open_name>
</file_ref>
<plan_class>cuda23</plan_class>
<avg_ncpus>0.050000</avg_ncpus>
<max_ncpus>0.050000</max_ncpus>
<coproc>
<type>CUDA</type>
<count>0.500000</count>
</coproc>
</app_version>
</app_info>
GTX295:
GPU Takt (Geometric Domain) 710 MHz
GPU Takt (Shader Domain) 1512 MHz
Speichertakt 783 MHz (Original: 1025 MHz)
System: Microsoft Windows 7 Professional
One thread / GPU:
GPU load ~86% to ~88%, runtime ~980sec to 1020sec
Two threads per GPU:
GPU load ~95% to ~98%, runtime ~1780sec to 1820sec = ~ 900sec/single thread.
Conclusion: ~ 10% less runtime per WU
The same test on a 9800GT (GPU load of 95 % with one thread/GPU) did neither increase the load nor reduce the runtime - but increase the CPU-time.
____________
Member of Crunching Family
http://crunching-family.at/ |
|
|
|
where i can get optimized app in order to play with app_info.xml? or I just should create app_info.xml in the project folder? |
|
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 13513 ID: 53948 Credit: 237,712,514 RAC: 0
                           
|
where i can get optimized app in order to play with app_info.xml? or I just should create app_info.xml in the project folder?
There are no "optimized" apps, like you find at SETI -- pretty much, the stock apps here are as optimized as they get. If someone comes up with a better app, it becomes the official app here very quickly.
The reason for the app_info files is people are having trouble getting a lot of GPUs to work without app_info. It's an ongoing problem.
____________
My lucky number is 75898524288+1 |
|
|
|
OK, clear :-)
So, I just should create app_info.xml in the project folder? I'm running cuda on linux |
|
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 13513 ID: 53948 Credit: 237,712,514 RAC: 0
                           
|
So, I just should create app_info.xml in the project folder? I'm running cuda on linux
If you're running a Fermi card (GTX 4xx series or GTX5xx), yes. If you're running and older nVidia card, no app_info is needed. Also, it appears you need app_info to get any ATI/AMD GPU to work.
I'm guessing you've never done an app_info before, so be warned that the app_info applies to ALL subprojects. The file will need to contain info for every subproject you want to run, including the CPU apps, and you'll need to stay on top of when new versions of the software are released and update your app_info accordingly.
____________
My lucky number is 75898524288+1 |
|
|
|
I'm running older then fermi card (GTX275), but I'd like to play aroung app_info. What I do not like in PrimeGrid - my rig's interface on linux (and other guys reporting the same issue on windows) became very slow in terms of user responce (I'm comparing it with GPUGRID, where GPU usage is barely 70%). Ideally - I'd like to have two app_info files and switch between them: one with slower crunching speed, but when I can use my rig and another one - for max speed, when I'm away and do not really care about responce. I think small bash script will work fine to this stuff.
I also tried to set up PrimeGrid as backup project of Milky Way (MW) on my rig with ATI cards, but, as u mentioned, I got no WUs. What should I do in terms of app_info in order to get WUs? May be, let's try to make app_info file for the community. I'm ready to be a tester :-)
Trust me - I know what app_info is about. At least - on MW and Collatz :-) So, I'm ready to play :-)
____________
|
|
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 13513 ID: 53948 Credit: 237,712,514 RAC: 0
                           
|
Gotcha -- and I feel your pain with regards to display sluggishness. It's the same on my Win7 machine with a GTX280.
What I do is set BOINC to not use the GPU if the computer is in use, with a short 60 second timeout. Then I set BOINC to override that and always run the GPU. I can live with the sluggishness most of the time, but if I'm watching video or playing a game, I set BOINC back to using the GPU according to preferences so it automatically shuts off when I'm using the system.
As you probably know, the "problem" you and I both share is that the app is incredibly efficient, running the GPU at nearly 100% utilization. That often makes the user display very sluggish.
I *think* what you can do is use some flags on the command line, via app_info, to mess up some of the tuning parameters on the tpsieve app. You'll need to check the older posts in the ppsieve testing thread to get some hints as to what parameters you can try. Or maybe Ken or somebody else can chime in with suggestions. I haven't done any testing on the cuda app in several months, so I don't remember which parameters are worth trying.
____________
My lucky number is 75898524288+1 |
|
|
|
and one nore thing: as far as I understand nvidia is way better then ATI in terms of speed, right? Although GFLOPs are higher for ATI... For instance, I saw that here
____________
|
|
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 13513 ID: 53948 Credit: 237,712,514 RAC: 0
                           
|
and one nore thing: as far as I understand nvidia is way better then ATI in terms of speed, right? Although GFLOPs are higher for ATI... For instance, I saw that here
Yes and no. For ppsieve/tpsieve, as of right now the answer is yes. The application for CUDA works far better than the openCL application.
In general, the ATI cards certainly don't have such a large disadvantage, and may even be faster (that changes week by week, almost). But for this application, with the software available today, nVidia is the better choice right now.
____________
My lucky number is 75898524288+1 |
|
|
|
that's a good idea to click "do not use GPU" in BOINC manager :-)
But also I'd like PrimeGrid app run 24/7 and only when I need my rig it will allow somewhat 10% of GPU for the operation system (OS). It's literally pain in @$$ ever during web surfing to wait some time to see tha page even with 15MB/s channel :-)
I'll try to find that topic a bit later and play with different parameters. But any clue from project development team are very much welcome and appreciated :-) I do not think this should be "CIA top secret" :-)
____________
|
|
|
|
Saying frankly, my plan was to buy two ATI 6970 cards for MW rig (it's for crunching only, so I don't mind to run windows there) and one 6970 to run my main rig on linux. While MW do not have OpenCL app yet, my plan was to run Collatz while it will be available. But now I wanna crunch for PrimeGrid and setup PrimeGrid as backup project on my crunching rig.
So, I wonder if there are chances that OpenCL app will be as fast as CUDA one and what I should buy for PrimeGrid - 6970 or 580/570?
____________
|
|
|
|
I quickely estimated thousands credits per day based on link above. Here what I've got:
GTX 580 - 322
GTX 275 - 183 (my card)
GTX 460 - 153
58Ñ…0 - 95
Pretty much clear.
____________
|
|
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 13513 ID: 53948 Credit: 237,712,514 RAC: 0
                           
|
I quickely estimated thousands credits per day based on link above. Here what I've got:
GTX 580 - 322
GTX 275 - 183 (my card)
GTX 460 - 153
58Ñ…0 - 95
Pretty much clear.
Clearer than you think:
I think you underestimated the GTX 580. The example you looked at probably was running two WU's simultaneously on the 580s in order to keep the GPU running at 100%. So its throughput and credits are actually double of what you calculated. The same could be true of the 460 in your example.
____________
My lucky number is 75898524288+1 |
|
|
|
Well, GTX 460 is slower then GTX 275 (907.2 GFLOPs vs 1010.880 GFLOPs - by wiki) and more or less the same amount it slower in that comparison.
so looks my comparison it pretty acurate OR .Steve got room for improvement :-)
But I really should consider 580/570. 570 price looks to be $349, what looks attractive :-)
____________
|
|
|
STE\/E Volunteer tester
 Send message
Joined: 10 Aug 05 Posts: 573 ID: 103 Credit: 3,631,269,523 RAC: 0
                    
|
1.2 Mil for a Pair of 580's & about 600k for a pair of 460's >>> http://stats.free-dc.org/stats.php?page=user&proj=pgrid&name=103
____________
|
|
|
STE\/E Volunteer tester
 Send message
Joined: 10 Aug 05 Posts: 573 ID: 103 Credit: 3,631,269,523 RAC: 0
                    
|
Well, GTX 460 is slower then GTX 275
so looks I really should consider 580/570
A GTX 460 is faster than a GTX 275 by at least 100k a day ...
____________
|
|
|
|
1.2 Mil for a Pair of 580's & about 600k for a pair of 460's >>> http://stats.free-dc.org/stats.php?page=user&proj=pgrid&name=103
Thanks Steve :-) Looks I need to get 580/570
BTW, r u using app_info.xls or not? If so, could u pls publish it?
____________
|
|
|
STE\/E Volunteer tester
 Send message
Joined: 10 Aug 05 Posts: 573 ID: 103 Credit: 3,631,269,523 RAC: 0
                    
|
I've been running this app_info.xml on my 460 & 580 Box's, only way I know of how to get work. Cost wise a pair of 460's are cheaper than a single 580 and will do just as much work ...
<app_info>
<app>
<name>pps_sr2sieve</name>
<user_friendly_name>Proth Prime Search (Sieve)</user_friendly_name>
</app>
<file_info>
<name>primegrid_tpsieve_1.35_windows_intelx86__cuda23.exe</name>
<executable/>
</file_info>
<app_version>
<app_name>pps_sr2sieve</app_name>
<version_num>135</version_num>
<plan_class>cuda23</plan_class>
<avg_ncpus>0.05</avg_ncpus>
<max_ncpus>1</max_ncpus>
<flops>1.0e11</flops>
<coproc>
<type>CUDA</type>
<count>0.5</count>
</coproc>
<cmdline></cmdline>
<file_ref>
<file_name>primegrid_tpsieve_1.35_windows_intelx86__cuda23.exe</file_name>
<main_program/>
</file_ref>
</app_version>
</app_info>
____________
|
|
|
|
thanks a lot, Steve :-) I believe it's to be put somewhere in FAQ
BTW, did u OC'd your cards or they running on stock clocks?
____________
|
|
|
|
Trying to make sense of all the posts dealing with ATI cards, app_info.xml and different platforms. I've noticed that there are several versions of the app_info file being shared and from what I've been able to decipher the app_info file takes control of the PG jobs being sent. However, with so many different posts it was hard to see which one I needed and exactly what folder I needed to put it in. I'm running Windows XP with an ATI Radeon HD 2400XT.
My request is to ask one of you to post the app_info file for the 32bit XP and the commands to get PPS Sieve running along with PPS LLR and SG LLR and what folder to place it in. I think once I get a file that works with these I can figure out how to add or remove jobs. The most important part is getting the GPU up and running to test the PPS Sieve jobs.
Any help would be greatly appreciated - Rick |
|
|
Vato Volunteer tester
 Send message
Joined: 2 Feb 08 Posts: 785 ID: 18447 Credit: 263,436,450 RAC: 0
                     
|
FYI The ATI app uses OpenCL, which is only currently supported on HD4xxx and upwards.
____________
|
|
|
|
Great :(
Thanks for the info |
|
|
Michael Goetz Volunteer moderator Project administrator
 Send message
Joined: 21 Jan 10 Posts: 13513 ID: 53948 Credit: 237,712,514 RAC: 0
                           
|
If you're running a Fermi card, you probably should read this thread.
At least a few people seem to suddenly have success running Fermi without app_info.
____________
My lucky number is 75898524288+1 |
|
|
RytisVolunteer moderator Project administrator
 Send message
Joined: 22 Jun 05 Posts: 2649 ID: 1 Credit: 26,363,112 RAC: 0
                    
|
I believe it is now possible to run ATI cards without app_info.
____________
|
|
|
rroonnaalldd Volunteer developer Volunteer tester
 Send message
Joined: 3 Jul 09 Posts: 1213 ID: 42893 Credit: 34,634,263 RAC: 0
                 
|
I believe it is now possible to run ATI cards without app_info.
Would it be possible to set up lower values for "<avg_ncpus>0.55</avg_ncpus>" and "<max_ncpus>1</max_ncpus>" ?
On a host with a single gpu is this value no problem but on a dual gpu host causes this a blocking from one cpu-core. An idle cpu-core would be useless because the work is done by the gpu only.
____________
Best wishes. Knowledge is power. by jjwhalen
|
|
|
RytisVolunteer moderator Project administrator
 Send message
Joined: 22 Jun 05 Posts: 2649 ID: 1 Credit: 26,363,112 RAC: 0
                    
|
I believe it is now possible to run ATI cards without app_info.
Would it be possible to set up lower values for "<avg_ncpus>0.55</avg_ncpus>" and "<max_ncpus>1</max_ncpus>" ?
On a host with a single gpu is this value no problem but on a dual gpu host causes this a blocking from one cpu-core. An idle cpu-core would be useless because the work is done by the gpu only.
I changed something, is it better now?
____________
|
|
|
|
My 4 core Vista64 box w/ hd-4890 is crunching away
w/ no app_info.xml file
ppsieve GPU (running (.31CPUs + 1.00 ATI GPUs)
4x Sophie CPU WU's
Thanks Rytis
Steve |
|
|
|
I believe it is now possible to run ATI cards without app_info.
Would it be possible to set up lower values for "<avg_ncpus>0.55</avg_ncpus>" and "<max_ncpus>1</max_ncpus>" ?
On a host with a single gpu is this value no problem but on a dual gpu host causes this a blocking from one cpu-core. An idle cpu-core would be useless because the work is done by the gpu only.
If you like doing PSA work, you could run a single-thread PSA job to use that idle core. Of course, it would be best if boinc used an accurate value, if that hasn't been fixed already... :-)
--Gary |
|
|
|
My 4 core Vista64 box w/ hd-4890 is crunching away
w/ no app_info.xml file
ppsieve GPU (running (.31CPUs + 1.00 ATI GPUs)
4x Sophie CPU WU's
Thanks Rytis
Steve
1st WU in, I hit a "Maximum elapsed time exceeded"
http://www.primegrid.com/result.php?resultid=206481258 |
|
|
Scott Brown Volunteer moderator Project administrator Volunteer tester Project scientist
 Send message
Joined: 17 Oct 05 Posts: 2165 ID: 1178 Credit: 8,777,295,508 RAC: 0
                                     
|
I believe it is now possible to run ATI cards without app_info.
Would it be possible to set up lower values for "<avg_ncpus>0.55</avg_ncpus>" and "<max_ncpus>1</max_ncpus>" ?
On a host with a single gpu is this value no problem but on a dual gpu host causes this a blocking from one cpu-core. An idle cpu-core would be useless because the work is done by the gpu only.
I changed something, is it better now?
The fix works for ATI, but would it be possible also to do this for the NVidia cards/CUDA application?
____________
141941*2^4299438-1 is prime!
|
|
|
|
How do you get this to work with linux without an app_info?
I get the error below since one of the requirements of ATI Stream SDK is setting a couple of environmental variables (LD_LIBRARY_PATH, ATISTREAMSDKROOT and ATISTREAMSDKSAMPLESROOT). I know the standalone app works.
So where should these variables be set so BOINC sets these variables correctly?
Stderr output
<core_client_version>6.10.56</core_client_version>
<![CDATA[
<message>
process exited with code 106 (0x6a, -150)
</message>
<stderr_txt>
Unrecognized XML in parse_init_data_file: hostid
Skipping: 161880
Skipping: /hostid
Unrecognized XML in parse_init_data_file: starting_elapsed_time
Skipping: 0.000000
Skipping: /starting_elapsed_time
Sieve started: 3329098000000000 <= p < 3329101000000000
Thread 0 starting
Error: Getting Platforms. (clGetPlatformsIDs)
Please (re)install OpenCL as described at
http://developer.amd.com/gpu/ATIStreamSDK/assets/ATI_Stream_SDK_Installation_Notes.pdf
called boinc_finish
</stderr_txt>
]]>
|
|
|
|
I believe it is now possible to run ATI cards without app_info.
Wu now are downloading also without app_info_xml, but there're some wrong estimates about time / flops required. (yes i know that here we are talking about integers, but boinc talks about flops required to complete a wu. If you under or over estimate them, you can have a lot of troubles - see aqua@home in the past - and I think that this is the case) |
|
|
|
I believe it is now possible to run ATI cards without app_info.
Would it be possible to set up lower values for "<avg_ncpus>0.55</avg_ncpus>" and "<max_ncpus>1</max_ncpus>" ?
On a host with a single gpu is this value no problem but on a dual gpu host causes this a blocking from one cpu-core. An idle cpu-core would be useless because the work is done by the gpu only.
I changed something, is it better now?
I have been running an app_info for both ATI and CUDA cards with a setting of 0.05 CPUs, and haven't had any problems. In fact, the CUDA WUs seem to run faster, by a minute or so....
____________
|
|
|
|
How do you get this to work with linux without an app_info?
I get the error below since one of the requirements of ATI Stream SDK is setting a couple of environmental variables (LD_LIBRARY_PATH, ATISTREAMSDKROOT and ATISTREAMSDKSAMPLESROOT). I know the standalone app works.
So where should these variables be set so BOINC sets these variables correctly?
As an update in case it helps, I just created a entry in the /etc/profile.d directory and fixed the SELinux AVC denial message on my Fedora 14 system. The ATI 5830 took 1600 secs (27 minutes).
Thanks all for getting to to work! |
|
|
Vato Volunteer tester
 Send message
Joined: 2 Feb 08 Posts: 785 ID: 18447 Credit: 263,436,450 RAC: 0
                     
|
How do you get this to work with linux without an app_info?
I get the error below since one of the requirements of ATI Stream SDK is setting a couple of environmental variables (LD_LIBRARY_PATH, ATISTREAMSDKROOT and ATISTREAMSDKSAMPLESROOT). I know the standalone app works.
So where should these variables be set so BOINC sets these variables correctly?
I make sure they are set when I launch the boinc client, and it is then correctly inherited by any apps launched by the boinc client.
____________
|
|
|
|
....
<flops>1.0e11</flops>
....
what's mean flops?
cpu's? or gpu's? or other means?
who know that means?
in my case, increase processing time at two thread.. :( |
|
|
|
ah....
i'm a stupid...
the problem is SLI :( |
|
|
|
I get a computation error on startup for Proth Prime Search ( Sieve ) V1.35 the WU's seem to stop after a few seconds.
Not too much of a problem I just abort the WU in question.
The WU's for Proth Prime Search ( Sieve ) V1.36 appear to run OK.
Is there anything I can do to prevent downloads of Proth Prime Search ( Sieve ) V1.35 WU's.
I am running a Windows 7, 64 Bit Intel Dual Core machine with a Radeon HD4570 GPU.
Any suggestions and help would be most welcome.
Thanks
____________
|
|
|