だ・か・らっ、Dia“l”yだってばさ!

これは『戯れ言』です。また、“Diary”ではなく“Dialy”です。つまり、日記に似て非なるものです。 所謂『日記』ではありません。お間違えの無いようお願いします。(^^;A

Benchmark: x86/CUDA-2.1 v2.9105.511 (beta7) on CUDA 2.1

C 7な9800GTX+でもbenchやってみました。

$ ./dnetc-cuda/dnetc -bench rc5-72

distributed.net client for CUDA 2.1 on Linux Copyright 1997-2009, distributed.net
Please visit http://www.distributed.net/ for up-to-date contest information.
Start the client with '-help' for a list of valid command line options.


dnetc v2.9105-511-CTL-09070517-*dev* for CUDA 2.1 on Linux (Linux 2.6.24-24-generic).
Please provide the *entire* version descriptor when submitting bug reports.
The distributed.net bug report pages are at http://bugs.distributed.net/
Using email address (distributed.net ID) 'sr@hyper.cx'

[Jul 10 10:15:13 UTC] RC5-72: using core #0 (CUDA 1-pipe 64-thd).
[Jul 10 10:15:32 UTC] RC5-72: Benchmark for core #0 (CUDA 1-pipe 64-thd)
                      0.00:00:16.47 [135,351,044 keys/sec]
[Jul 10 10:15:32 UTC] RC5-72: using core #1 (CUDA 1-pipe 128-thd).
[Jul 10 10:15:52 UTC] RC5-72: Benchmark for core #1 (CUDA 1-pipe 128-thd)
                      0.00:00:16.94 [145,057,938 keys/sec]
[Jul 10 10:15:52 UTC] RC5-72: using core #2 (CUDA 1-pipe 256-thd).
[Jul 10 10:16:11 UTC] RC5-72: Benchmark for core #2 (CUDA 1-pipe 256-thd)
                      0.00:00:16.86 [198,321,364 keys/sec]
[Jul 10 10:16:11 UTC] RC5-72: using core #3 (CUDA 2-pipe 64-thd).
[Jul 10 10:16:31 UTC] RC5-72: Benchmark for core #3 (CUDA 2-pipe 64-thd)
                      0.00:00:16.76 [201,744,051 keys/sec]
[Jul 10 10:16:31 UTC] RC5-72: using core #4 (CUDA 2-pipe 128-thd).
[Jul 10 10:16:50 UTC] RC5-72: Benchmark for core #4 (CUDA 2-pipe 128-thd)
                      0.00:00:16.75 [183,950,568 keys/sec]
[Jul 10 10:16:50 UTC] RC5-72: using core #6 (CUDA 4-pipe 64-thd).
[Jul 10 10:17:09 UTC] RC5-72: Benchmark for core #6 (CUDA 4-pipe 64-thd)
                      0.00:00:15.89 [271,058,423 keys/sec]
[Jul 10 10:17:09 UTC] RC5-72: using core #7 (CUDA 4-pipe 128-thd).
[Jul 10 10:17:28 UTC] RC5-72: Benchmark for core #7 (CUDA 4-pipe 128-thd)
                      0.00:00:16.93 [250,003,664 keys/sec]
[Jul 10 10:17:28 UTC] RC5-72: using core #9 (CUDA 1-pipe 64-thd busy wait).
[Jul 10 10:17:43 UTC] RC5-72: Benchmark for core #9 (CUDA 1-pipe 64-thd busy wait)
                      0.00:00:12.24 [353,436,972 keys/sec]
[Jul 10 10:17:43 UTC] RC5-72: using core #10 (CUDA 1-pipe 64-thd sleep 100us).
[Jul 10 10:18:02 UTC] RC5-72: Benchmark for core #10 (CUDA 1-pipe 64-thd sleep 100us)
                      0.00:00:16.70 [81,282,145 keys/sec]
[Jul 10 10:18:02 UTC] RC5-72: using core #11 (CUDA 1-pipe 64-thd sleep dynamic).
[Jul 10 10:18:21 UTC] RC5-72: Benchmark for core #11 (CUDA 1-pipe 64-thd sleep dynamic)
                      0.00:00:16.50 [137,084,520 keys/sec]
[Jul 10 10:18:21 UTC] RC5-72 benchmark summary :
                      Default core : #0 (CUDA 1-pipe 64-thd)
                      Fastest core : #9 (CUDA 1-pipe 64-thd busy wait)
[Jul 10 10:18:21 UTC] Core #9 is significantly faster than the default core.
                      The CUDA core selection has been made as a tradeoff between cor ...
                      and responsiveness of the graphical desktop.
                      Please file a bug report along with the output of -cpuinfo
                      only if the the faster core selection does not degrade graphics ...

やはりbeta 6より遅いですね。
当たり前なんですが・・・ orz