Link to corresponding reddit article here.
So this is just a quick follow-up to my previous article on the TITAN V. This time I’ve decided to try to benchmark the rest of the whitelisted GPU projects that give NVIDIA work units, and of course I’ll be running my 1080 Ti at the same time for all tests again. See my previous article if you’d like some more details on how the TITAN V stacks up to the 1080 Ti.
Testing setup
So the testing rig is of course exactly the same.
Component | Description |
---|---|
CPU | Intel i7-4790K overclocked to 4.6GHz |
CPU Cooler | NZXT Kraken X62 |
Memory | 2 x 8GB G.Skill TridentX DDR3 @ 2400MHz |
Motherboard | ASUS Sabertooth Z87 |
Storage | Samsung 850 EVO 1TB, 1 x 3TB WD Green & Black |
GPU 1 | NVIDIA TITAN V |
GPU 2 | ASUS GeForce GTX 1080 Ti STRIX OC |
PSU | Corsair HX1000i |
Case | Coolermaster MasterCase 5 Pro |
Testing was conducted at 27C ambient and both GPUs were overclocked using the below settings in MSI Afterburner 4.40 using NVIDIA's 390.77 driver.
Benchmarks
For each section, I’ll be showing a screenshot of:
• The BOINC Manager window, showing the completion time for the TITAN and the relative progress of the task running simultaneously on the 1080 Ti.
• HWiNFO64, to show power draw and other statistics.
• MSI Afterburner’s sensors to show GPU usage, power draw (as a percentage of TDP) and temperatures over time.
Asteroids & SETI
I couldn’t get Asteroids to attach to my BOINC Manager, and SETI refused to send me NVIDIA work units for whatever reason. So we’re 0 for 2 on that score.
Einstein & GPUGrid
Einstein didn’t seem to like the TITAN, and any tasks running on the TITAN spit out computation errors, and when I suspended the project, it actually crashed the GPU drivers on one occasion. So not an ideal result, but I can’t seem to get the Einstein work units to run properly on the TITAN. Removing the overclock on the TITAN did not make a difference.
Similar to Einstein, for GPUGrid any work units that attempted to run on the TITAN resulted in a computation error almost immediately. Likewise, removing the overclock did not help either.
Moo Wrapper
So, I know Moo isn’t on the whitelist any more, but I thought I’d toss it in here as a bonus since the previous four projects didn’t work. Similar to Enigma from the previous article, both the TITAN and 1080 Ti crunch at the same rate, indicating a bottleneck elsewhere.
Amicable Numbers
Now I intentionally saved the best for last here, as these results are somewhat surprising. For Amicable, I’m using the 10^20 GPU work units with a kernel size of 22. Now it’s worth noting that Amicable sends batches of work units, that vary in their time to complete. It seems the main way to distinguish between batches is by looking at this number in the green highlight here in the work unit name.
Henceforth I’ll distinguish them by this number, so for example, Amicable 518. In my testing I encountered three types of work units: 518, 3148 and 6141. I have no doubt that there probably countless other types, so unfortunately this data won’t be 100% representative of total Amicable output.
Amicable 518
Amicable 3148
Amicable 6141
So just like with Collatz, the TITAN seems to outperform the 1080 Ti by a factor of three or more. Power consumption is still significantly lower than the 1080 Ti, averaging about 60W less. Of particular note is the exceptionally short time to compute the 518 & 6141 WUs on the TITAN.
Results
TITAN V | Time to complete | Est. Max RAC | Est. Max Mag | Power Draw (W) | Mag/W |
---|---|---|---|---|---|
Amicable 518 WU | ~0:38 | 15,543,337 | 598.73 | 176 | 3.402 |
Amicable 3148 WU | ~4:00 | 2,461,029 | 94.80 | 166 | 0.571 |
Amicable 6141 WU | ~0:31 | 19,053,123 | 733.92 | 184 | 3.989 |
Moo Wrapper | ~9:50 | 899,731 | - | 137 | - |
GTX 1080Ti | Time to complete | Est. Max RAC | Est. Max Mag | Power Draw (W) | Mag/W |
---|---|---|---|---|---|
Amicable 518 WU | ~2:10 | 4,543,437 | 175.01 | 242 | 0.723 |
Amicable 3148 WU | ~11:20 | 868,598 | 33.46 | 268 | 0.125 |
Amicable 6141 WU | ~2:01 | 4,881,379 | 188.03 | 234 | 0.804 |
Moo Wrapper | ~9:50 | 899,731 | - | 260 | - |
The estimated max RAC for each project was calculated as:
- n = number of WUs completed simultaneously
- c = credit per work unit
- t = time to completion for work unit(s) in seconds
The estimated max magnitude was calculated from multiplying mag/RAC by the above figure. Mag/RAC was calculated from analysing project statistics on gridcoinstats.eu’s website. These estimations are of course a rough approximation, and may not be entirely accurate.
Amicable, the best project for the TITAN?
So initially I was going to publish this article on the day after I published my previous TITAN article, but I was unhappy with the conclusion I had drawn from the data above, and thought the Amicable results warranted some further investigation. So to get a much more accurate assessment of estimated RAC for Amicable, I left my TITAN crunching Amicable for 24 hours straight, with no interruptions. I also changed the app_config.xml to allow two work units to be run simultaneously to slightly increase output, because it appeared there was a small downtime between work units where the GPU wasn’t in use. So to calculate the estimated RAC, we’ll find the difference in total credit, add validation pending work units (assuming they will all validate, which they probably will), and then we’ll have total credit delta, from which we can estimate magnitude.
Total credit at start | 36,901,773.1 |
---|---|
Total credit at end | 40,962,472 |
Add validation pending work units | 2,044,021 |
Total credit delta (est. max RAC) | 6,104,720 |
Est. Max Mag from credit delta | 235.15 |
Average Power draw | 180W |
Magnitude per watt | 1.306 |
At 235.15 magnitude, Amicable looks to be the most profitable project for the TITAN. Let’s collate all the data in a new table to compare the results:
Project | Est. Max Mag | Mag/W |
---|---|---|
Amicable | 235.15 | 1.306 |
Collatz | 175.68 | 0.960 |
Primegrid | 162.98 | 0.858 |
Milkyway | 177.54 | 1.365 |
Milkyway 16 WU | 329.71 | - |
Enigma | 56.81 | 0.394 |
Now I’ve included a “Milkyway 16 WU” entry in there to roughly estimate what the magnitude would be if Milkyway actually worked properly, and I was able to run 16 WUs concurrently, just to show the potential of the TITAN. So currently, looks like Amicable is the most profitable, whilst Milkyway with 8 WUs is just barely more efficient.
Conclusion
According to these numbers, Amicable’s probably the best project for the TITAN at the moment, given that Milkyway doesn’t push it to its full potential. The magnitude estimate here is roughly 60 higher than any of the other projects, so sorry Milkyway, looks like I’ll be discovering Amicable pairs until your NVIDIA app becomes more optimised.
What I see there is a lot more than just 3 listed types of work units for Amicable Numbers. I expect these numbers might be batch numbers rather than WU types (worth to find out), but those numbers don't seem to be in order. See also AN WU optimization.
These are some huge RAC numbers, lol. I have expected for 1080 Ti to reach ~1M RAC, 4M surprised me, but you haven't re-tested it. 6M for Titan (after re-test) seems reasonable.
Ah I didn't mean to imply types of work units like PPS-Sieve vs GFN on PrimeGrid, I guess the word I was looking for was batches. I think you're absolutely correct, and I'm going to update the article with that correction.
I do remember reading that article a while back, I guess 518 and 6141 in this article are examples of more GPU-optimised work units, whilst the 3148 is less so.
OK, I have 1st hand explanation for WU naming in the Amicable Project.
Example: amicable_10_20_20198_1519206302.310434_177
amicable_10_20 - app name
20198 - PID of the process that generated this WU
1519206302.310434 - timestamp
177 - number in the batch
Conclusion - It's only a coincidence than 518 WUs took much shorter time than 3148 WUs.