CUDAのVer.を2.1にしてみた
$ ./dnetc -bench rc5-72 distributed.net client for CUDA 2.2 on Linux Copyright 1997-2009, distributed.net Please visit http://www.distributed.net/ for up-to-date contest information. Start the client with '-help' for a list of valid command line options. dnetc v2.9105-511-CTL-09070517-*dev* for CUDA 2.2 on Linux (Linux 2.6.24-24-generic). Please provide the *entire* version descriptor when submitting bug reports. The distributed.net bug report pages are at http://bugs.distributed.net/ Using email address (distributed.net ID) 'sr@hyper.cx' [Jul 08 08:49:04 UTC] RC5-72: using core #0 (CUDA 1-pipe 64-thd). [Jul 08 08:49:23 UTC] RC5-72: Benchmark for core #0 (CUDA 1-pipe 64-thd) 0.00:00:16.10 [130,162,542 keys/sec] [Jul 08 08:49:23 UTC] RC5-72: using core #1 (CUDA 1-pipe 128-thd). [Jul 08 08:49:42 UTC] RC5-72: Benchmark for core #1 (CUDA 1-pipe 128-thd) 0.00:00:16.17 [150,149,637 keys/sec] [Jul 08 08:49:42 UTC] RC5-72: using core #2 (CUDA 1-pipe 256-thd). [Jul 08 08:50:01 UTC] RC5-72: Benchmark for core #2 (CUDA 1-pipe 256-thd) 0.00:00:16.72 [172,018,357 keys/sec] [Jul 08 08:50:01 UTC] RC5-72: using core #3 (CUDA 2-pipe 64-thd). [Jul 08 08:50:20 UTC] RC5-72: Benchmark for core #3 (CUDA 2-pipe 64-thd) 0.00:00:16.82 [152,721,912 keys/sec] [Jul 08 08:50:20 UTC] RC5-72: using core #4 (CUDA 2-pipe 128-thd). [Jul 08 08:50:38 UTC] RC5-72: Benchmark for core #4 (CUDA 2-pipe 128-thd) 0.00:00:16.14 [142,300,154 keys/sec] [Jul 08 08:50:38 UTC] RC5-72: using core #6 (CUDA 4-pipe 64-thd). [Jul 08 08:50:58 UTC] RC5-72: Benchmark for core #6 (CUDA 4-pipe 64-thd) 0.00:00:16.40 [181,006,169 keys/sec] [Jul 08 08:50:58 UTC] RC5-72: using core #7 (CUDA 4-pipe 128-thd). [Jul 08 08:51:17 UTC] RC5-72: Benchmark for core #7 (CUDA 4-pipe 128-thd) 0.00:00:16.76 [162,057,058 keys/sec] [Jul 08 08:51:17 UTC] RC5-72: using core #9 (CUDA 1-pipe 64-thd busy wait). [Jul 08 08:51:36 UTC] RC5-72: Benchmark for core #9 (CUDA 1-pipe 64-thd busy wait) 0.00:00:16.73 [194,831,658 keys/sec] [Jul 08 08:51:36 UTC] RC5-72: using core #10 (CUDA 1-pipe 64-thd sleep 100us). [Jul 08 08:51:55 UTC] RC5-72: Benchmark for core #10 (CUDA 1-pipe 64-thd sleep 100us) 0.00:00:16.38 [104,859,117 keys/sec] [Jul 08 08:51:55 UTC] RC5-72: using core #11 (CUDA 1-pipe 64-thd sleep dynamic). [Jul 08 08:52:15 UTC] RC5-72: Benchmark for core #11 (CUDA 1-pipe 64-thd sleep dynamic) 0.00:00:16.81 [130,969,338 keys/sec] [Jul 08 08:52:15 UTC] RC5-72 benchmark summary : Default core : #0 (CUDA 1-pipe 64-thd) Fastest core : #9 (CUDA 1-pipe 64-thd busy wait) [Jul 08 08:52:15 UTC] Core #9 is significantly faster than the default core. The CUDA core selection has been made as a tradeoff between core speed and responsiveness of the graphical desktop. Please file a bug report along with the output of -cpuinfo only if the the faster core selection does not degrade graphics performance.
微妙に改善された。
しかし、まだまだ遅い・・・ orz
【for CUDA 2.2】と宣言しておきながら、CUDA 2.1でも動くって、導誉?