だ・か・らっ、Dia“l”yだってばさ!

これは『戯れ言』です。また、“Diary”ではなく“Dialy”です。つまり、日記に似て非なるものです。 所謂『日記』ではありません。お間違えの無いようお願いします。(^^;A

CUDA 2.1用のclient

[AMD64/CUDA-2.1]  	v2.9105.511 (beta7)   	2009-07-08 

これも出てました。
ってことで、CUDAは2.1に落としてありますので、早速benchmark!

$ ./dnetc -bench rc5-72

distributed.net client for CUDA 2.1 on Linux Copyright 1997-2009, distributed.net
Please visit http://www.distributed.net/ for up-to-date contest information.
Start the client with '-help' for a list of valid command line options.


dnetc v2.9105-511-CTL-09070517-*dev* for CUDA 2.1 on Linux (Linux 2.6.24-24-generic).
Please provide the *entire* version descriptor when submitting bug reports.
The distributed.net bug report pages are at http://bugs.distributed.net/

[Jul 08 11:09:38 UTC] RC5-72: using core #0 (CUDA 1-pipe 64-thd).
[Jul 08 11:09:56 UTC] RC5-72: Benchmark for core #0 (CUDA 1-pipe 64-thd)
                      0.00:00:16.18 [247,694,351 keys/sec]
[Jul 08 11:09:56 UTC] RC5-72: using core #1 (CUDA 1-pipe 128-thd).
[Jul 08 11:10:15 UTC] RC5-72: Benchmark for core #1 (CUDA 1-pipe 128-thd)
                      0.00:00:16.63 [220,072,163 keys/sec]
[Jul 08 11:10:15 UTC] RC5-72: using core #2 (CUDA 1-pipe 256-thd).
[Jul 08 11:10:34 UTC] RC5-72: Benchmark for core #2 (CUDA 1-pipe 256-thd)
                      0.00:00:15.65 [276,395,318 keys/sec]
[Jul 08 11:10:34 UTC] RC5-72: using core #3 (CUDA 2-pipe 64-thd).
[Jul 08 11:10:53 UTC] RC5-72: Benchmark for core #3 (CUDA 2-pipe 64-thd)
                      0.00:00:16.26 [252,964,814 keys/sec]
[Jul 08 11:10:53 UTC] RC5-72: using core #4 (CUDA 2-pipe 128-thd).
[Jul 08 11:11:13 UTC] RC5-72: Benchmark for core #4 (CUDA 2-pipe 128-thd)
                      0.00:00:16.88 [255,588,400 keys/sec]
[Jul 08 11:11:13 UTC] RC5-72: using core #6 (CUDA 4-pipe 64-thd).
[Jul 08 11:11:30 UTC] RC5-72: Benchmark for core #6 (CUDA 4-pipe 64-thd)
                      0.00:00:14.38 [299,956,407 keys/sec]
[Jul 08 11:11:30 UTC] RC5-72: using core #7 (CUDA 4-pipe 128-thd).
[Jul 08 11:11:48 UTC] RC5-72: Benchmark for core #7 (CUDA 4-pipe 128-thd)
                      0.00:00:14.98 [289,344,000 keys/sec]
[Jul 08 11:11:48 UTC] RC5-72: using core #9 (CUDA 1-pipe 64-thd busy wait).
[Jul 08 11:12:02 UTC] RC5-72: Benchmark for core #9 (CUDA 1-pipe 64-thd busy wait)
                      0.00:00:11.78 [375,131,087 keys/sec]
[Jul 08 11:12:02 UTC] RC5-72: using core #10 (CUDA 1-pipe 64-thd sleep 100us).
[Jul 08 11:12:22 UTC] RC5-72: Benchmark for core #10 (CUDA 1-pipe 64-thd sleep 100us)
                      0.00:00:17.02 [141,129,853 keys/sec]
[Jul 08 11:12:22 UTC] RC5-72: using core #11 (CUDA 1-pipe 64-thd sleep dynamic).
[Jul 08 11:12:41 UTC] RC5-72: Benchmark for core #11 (CUDA 1-pipe 64-thd sleep dynamic)
                      0.00:00:16.94 [188,539,787 keys/sec]
[Jul 08 11:12:41 UTC] RC5-72 benchmark summary :
                      Default core : #0 (CUDA 1-pipe 64-thd)
                      Fastest core : #9 (CUDA 1-pipe 64-thd busy wait)
[Jul 08 11:12:41 UTC] Core #9 is significantly faster than the default core.
                      The CUDA core selection has been made as a tradeoff between cor ...
                      and responsiveness of the graphical desktop.
                      Please file a bug report along with the output of -cpuinfo
                      only if the the faster core selection does not degrade graphics ...

beta 6よりはちょっと遅いけど、だいぶマシ。