v2.9105.512 (beta8) on 9500GT
早速、9500GTでbeta8を試してみた。
CUDAを2.1から2.2にあげる。
ちゃんとCUDA Clientを停止して、bench!
$ ./dnetc -shutdown dnetc: 1 distributed.net client was shutdown. 4 failures (Operation not permitted). siryu@kanu:~$ ./dnetc -bench rc5-72 distributed.net client for CUDA 2.2 on Linux Copyright 1997-2009, distributed.net Please visit http://www.distributed.net/ for up-to-date contest information. Start the client with '-help' for a list of valid command line options. dnetc v2.9105-512-CTL-09072609-*dev* for CUDA 2.2 on Linux (Linux 2.6.24-22-generic). Please provide the *entire* version descriptor when submitting bug reports. The distributed.net bug report pages are at http://bugs.distributed.net/ Using email address (distributed.net ID) 'sr@hyper.cx' [Jul 27 01:00:12 UTC] RC5-72: using core #0 (CUDA 1-pipe 64-thd). [Jul 27 01:00:31 UTC] RC5-72: Benchmark for core #0 (CUDA 1-pipe 64-thd) 0.00:00:16.47 [39,152,027 keys/sec] [Jul 27 01:00:31 UTC] RC5-72: using core #1 (CUDA 1-pipe 128-thd). [Jul 27 01:00:50 UTC] RC5-72: Benchmark for core #1 (CUDA 1-pipe 128-thd) 0.00:00:16.06 [39,115,880 keys/sec] [Jul 27 01:00:50 UTC] RC5-72: using core #2 (CUDA 1-pipe 256-thd). [Jul 27 01:01:09 UTC] RC5-72: Benchmark for core #2 (CUDA 1-pipe 256-thd) 0.00:00:16.71 [38,557,712 keys/sec] [Jul 27 01:01:09 UTC] RC5-72: using core #3 (CUDA 2-pipe 64-thd). [Jul 27 01:01:29 UTC] RC5-72: Benchmark for core #3 (CUDA 2-pipe 64-thd) 0.00:00:17.06 [39,285,038 keys/sec] [Jul 27 01:01:29 UTC] RC5-72: using core #4 (CUDA 2-pipe 128-thd). [Jul 27 01:01:49 UTC] RC5-72: Benchmark for core #4 (CUDA 2-pipe 128-thd) 0.00:00:16.61 [34,490,187 keys/sec] [Jul 27 01:01:49 UTC] RC5-72: using core #6 (CUDA 4-pipe 64-thd). [Jul 27 01:02:08 UTC] RC5-72: Benchmark for core #6 (CUDA 4-pipe 64-thd) 0.00:00:16.32 [39,530,311 keys/sec] [Jul 27 01:02:08 UTC] RC5-72: using core #7 (CUDA 4-pipe 128-thd). [Jul 27 01:02:27 UTC] RC5-72: Benchmark for core #7 (CUDA 4-pipe 128-thd) 0.00:00:16.57 [34,583,977 keys/sec] [Jul 27 01:02:27 UTC] RC5-72: using core #9 (CUDA 1-pipe 64-thd busy wait). [Jul 27 01:02:47 UTC] RC5-72: Benchmark for core #9 (CUDA 1-pipe 64-thd busy wait) 0.00:00:16.47 [39,152,242 keys/sec] [Jul 27 01:02:47 UTC] RC5-72: using core #10 (CUDA 1-pipe 64-thd sleep 100us). [Jul 27 01:03:06 UTC] RC5-72: Benchmark for core #10 (CUDA 1-pipe 64-thd sleep 100us) 0.00:00:16.48 [39,129,715 keys/sec] [Jul 27 01:03:06 UTC] RC5-72: using core #11 (CUDA 1-pipe 64-thd sleep dynamic). [Jul 27 01:03:25 UTC] RC5-72: Benchmark for core #11 (CUDA 1-pipe 64-thd sleep dynamic) 0.00:00:16.47 [39,152,398 keys/sec] [Jul 27 01:03:25 UTC] RC5-72 benchmark summary : Default core : #0 (CUDA 1-pipe 64-thd) Fastest core : #6 (CUDA 4-pipe 64-thd) [Jul 27 01:03:25 UTC] Core #6 is marginally faster than the default core. Testing variability might lead to pick one or the other.
ここで確認したときは、
core #1 (CUDA 2-pipe 64-thd)を使って、
77,781,297 keys/sec
でしたので、やはり速度は1/2ですかねぇ。
芳しく御座いませんねぇ。
面倒なので、このまま放置。
9800GTX+は、beta7のままってことで。