Message boards :
Number crunching :
OpenCL versus Stream/CAL on distributed.net / Moo client.
Message board moderation
Author | Message |
---|---|
Send message Joined: 4 May 11 Posts: 27 Credit: 112,091,698 RAC: 0 |
There is an upcoming distributed.net OpenCL client in the pipe to support HD7k GPUs. I tested it on my HD6950 (800 MHz, but with reactivated shaders like on HD6970) with 12.6 drivers and I have one good news and one bad news. The good one : performances are identical with Stream/CAL client and with OpenCL one :) Here are the numbers with Stream/CAL client : [Oct 14 21:36:18 UTC] RC5-72: Summary: 6 packets (332.00 stats units) And then with OpenCL one : [Oct 14 21:52:08 UTC] RC5-72: Summary: 6 packets (351.00 stats units) The bad news is : like any other OpenCL application that I know about, it requires one free CPU core to feed the GPU or you'll get pathetic performances :/ With OpenCL client, with BOINC client shut down : [Oct 14 21:52:22 UTC] RC5-72: using core #0 (CL ANSI 1-pipe). And with BOINC client running on the CPU (8 SIMAP cores), with OpenCL client still : [Oct 14 21:54:04 UTC] RC5-72: using core #0 (CL ANSI 1-pipe). This time, BOINC client shut down and with Stream/CAL client : [Oct 14 21:58:01 UTC] RC5-72: using core #0 (IL 4-pipe c). And finally, with BOINC client running (8 SIMAP cores),with Stream/CAL again : [Oct 14 21:58:46 UTC] RC5-72: using core #0 (IL 4-pipe c). I guess you'll have to set up the OpenCL application in Moo to run with 1 CPU + x GPUs ... |
Send message Joined: 2 May 11 Posts: 57 Credit: 250,035,598 RAC: 0 |
How did you get it to work with the Boinc Client, I couldn't get it to work ??? Thanks |
Send message Joined: 4 May 11 Posts: 27 Credit: 112,091,698 RAC: 0 |
I didn't run it under BOINC. I did my tests with the distributed.net clients. |
Send message Joined: 3 May 11 Posts: 8 Credit: 15,002,506 RAC: 0 |
Give it to us! Please! In POEM there is the same problem or worse - few cpu cores needed to feed 7770 so one core in Moo! won't be a problem! :) |
Send message Joined: 22 Jun 11 Posts: 2080 Credit: 1,844,066,440 RAC: 94,284 |
Give it to us! Please! DistRTgen too, one cpu core for each gpu, both AMD and Nvidia need a cpu free to feed the gpu. I only have a single unit running on each gpu, so can't say if you run more then one unit at a time if another cpu core would be needed or not. |
Send message Joined: 22 Oct 11 Posts: 4 Credit: 3,145,103 RAC: 42 |
one additional point, the CPU usage for a result is 100% in boincclient the settings for Distributed.net Client v1.03 (cuda31) are set to 0,2C and 1NV can you set this please to 1C and 1NV This is done on POEM, and it will prevent other results getting problems with the time line while not able to use the CPU when a Distributed.net Client v1.03 (cuda31) result is running. edit: extract from POEM in client_state.xml <app_version> <app_name>poemcl</app_name> <version_num>105</version_num> <platform>windows_intelx86</platform> <avg_ncpus>1.000000</avg_ncpus> <max_ncpus>1.000000</max_ncpus> <flops>955157838839.538820</flops> <plan_class>opencl_nvidia_100</plan_class> <api_version>7.1.0</api_version> <file_ref> <file_name>poemcl_1.5_windows_intelx86__opencl_nvidia_100</file_name> <main_program/> </file_ref> <coproc> <type>NVIDIA</type> <count>1.000000</count> </coproc> <gpu_ram>268435456.000000</gpu_ram> </app_version> and now I found the settings for Moo! Wrapper <app_version> <app_name>dnetc</app_name> <version_num>103</version_num> <platform>windows_intelx86</platform> <avg_ncpus>0.200000</avg_ncpus> <max_ncpus>0.200000</max_ncpus> <flops>117464525973.684710</flops> <plan_class>cuda31</plan_class> <api_version>6.13.12</api_version> <file_ref> <file_name>dnetc_wrapper_1.3_windows_intelx86__cuda31.exe</file_name> <main_program/> </file_ref> <file_ref> <file_name>dnetc518-win32-x86-cuda31.exe</file_name> <copy_file/> </file_ref> <file_ref> <file_name>dnetc-gpu-1.3.ini</file_name> <open_name>dnetc.ini</open_name> <copy_file/> </file_ref> <file_ref> <file_name>job-cuda31-1.00.xml</file_name> <open_name>job.xml</open_name> <copy_file/> </file_ref> <file_ref> <file_name>cudart32_31_9.dll</file_name> <copy_file/> </file_ref> <coproc> <type>NVIDIA</type> <count>1.000000</count> </coproc> <gpu_ram>33554432.000000</gpu_ram> </app_version> Matthias |
Send message Joined: 22 Jun 11 Posts: 2080 Credit: 1,844,066,440 RAC: 94,284 |
one additional point, the CPU usage for a result is 100% Boinc is a funky bird sometimes, although they use the same software most projects do not talk among themselves about how they write their own version of the software they use for crunching. Boinc would like Office, lots of people use it but most use different formulas. I am NOT saying they don't, but most don't on a regular basis. |
Send message Joined: 22 Oct 11 Posts: 4 Credit: 3,145,103 RAC: 42 |
one additional point, the CPU usage for a result is 100% Thats right ;-) Could change the settings locally by app_config.xml Now it could work when Primgrid has finished "high priority working" I'll see Matthias |
Send message Joined: 22 Oct 11 Posts: 4 Credit: 3,145,103 RAC: 42 |
It's working like expected. using Boinc 7.0.52 Matthias |
Send message Joined: 31 Mar 13 Posts: 3 Credit: 326,936 RAC: 0 |
ATI <app_version> <app_name>dnetc</app_name> <version_num>103</version_num> <platform>windows_intelx86</platform> <avg_ncpus>0.050000</avg_ncpus> <max_ncpus>0.050000</max_ncpus> <flops>19368306992.516346</flops> <plan_class>ati14</plan_class> <api_version>6.13.12</api_version> Nvidia <app_version> <app_name>dnetc</app_name> <version_num>103</version_num> <platform>windows_intelx86</platform> <avg_ncpus>0.200000</avg_ncpus> <max_ncpus>0.200000</max_ncpus> <flops>117464525973.684710</flops> <plan_class>cuda31</plan_class> <api_version>6.13.12</api_version> If you compare the client_state.xml files it will probably less CPU intensive on ATI cards than Nvidia cards. JMHO |