Latest Update: 02.04.2021
Hello Chess Friends,
As usual, I'm pleased to announce that,
I've managed to organize another EvalFile competition!
Note: the previous EvalFile tour can be viewed: Results
Where 1520.bin managed to perform with best performance!
And since previous tour...several months have passed,
So now let's see which NN files are the strongest...
Note that as Book Openings,
I used a new Balsa suite, which is based on previous NNUE wins
Sure my main target was simply to reduce the draw percentage
According to current testings: approx. 5% are reduced, not bad
Anyhow, the overall draws are still very high, in case of NNUE
I mean in case of close in strength 3700 Elo + same EvalFile
Btw, to be more clear, here the draw problem is not in openings,
The real problem comes/related to Top NNUE chess engines...
E.g via NNUE: minus plus 15% are higher...that's too much!
Usually, we can see wins only with very critical openings...
For example, the overall NNUE draw percentage is close to 95%
That means in every 100 games, we can see minus plus 5 wins
Sure I am referring for Blitz, but with Slow time controls:
I expect to see almost 100% draws, e.g with Slow TC 40/120:
Maybe with very weak openings may appear a few wins...
In other words,
I hope to see a real 'Hero' programmer,
Who will manage to release a Top NNUE engine, which will produce
Minus plus 70-75% draws, sure under Blitz, Rapid... conditions!
And then there is no doubt that,
3700 Elo NNUE competitions will be much more exciting...!
Just I'd like to point out that too,
I've realized the EvalFilws to be played by two NNUE engines:
Cf EXT 291120 and SugaR AI 1.80
The reason about why I preferred older CF EXT engine is that,
Cf EXT 291120 produces lower draw percentage than all Top...
Also in this Evalfile tour,
Cf EXT 291120 produced approx. 3% less draws than SugaR AI 1.80
Note: These stats are based on Top 10 Evalfiles (played each other..)
But please bear in mind this too, in case of NON-NNUE: Results
Cf EXT 291120 produces close to 70% draws (e.g playing itself...)!
That means approx. 20% less draws...unbelievable draw numbers!
Meanwhile, a NNUE Duel (2m+1s): CF EXT vs SugaR:
And as we see,
Cf EXT 291120's engine's strength is still not dead...!
What does it mean ? No much Elo progress by the latest
SF based engine releases...at least under these conditions!
1 CF EXT 291120 +6/-6/=214 50.00% 113.0/226
2 SugaR AI 1.80 +6/-6/=214 50.00% 113.0/226
Some more notes about the above Duel match:
Both engines played via same net: 62ef826d1a6d.nnue
SugaR AI 1.80 engine is played with its default Contempt
Where CF EXT 291120 is used to play via Contempt 40
Duel's overall draw percentage (based on 226 games) 95%
Plus, I run another Draw test: CF EXT (playing itself)
Which can be useful, for anyone who is interested:
CF EXT 291120 C40 (via 62ef826d1a6d) produced: 93%
These Draw value statistics are based on 366 games
And,as I mentioned before,
Via NN file: CF EXT's draws are approx. 20% higher...!
And now let's back to the current NNUE tour,
As Evalfile players,
I've picked mainly the latest NN releases (per author) Net
Btw, not sure exactly why, but some NN files didn't work..
I mean via both engines: Cf EXT 291120, SugaR AI 1.80, e.g:
FatFritz2_v1, 2edce6db11bb, 9ac267288b8f, b700127fe341
20f7dbe4b33b, 57bacd2f041e, 9aaa2abe4015, b20e264c6f88
However, I have good news,
Mostly which I tried to test worked flawlessly...
And here is the final 'EvalFile' ranking:
The Winner: nn-62ef826d1a6d.nnue - Congrats to SFisGOD!
Note also that nn-62ef826d1a6d nnue released on 28.10.2020
# PLAYER : RATING POINTS GAMES (%)
1 62ef826d1a6d : 3692.7 313.0 576 54.3%
2 371e294c6117 : 3687.1 307.0 576 53.3%
3 66a8461e0dd6 : 3686.4 307.0 576 53.3%
4 cb26f10b1fd9 : 3685.1 305.5 576 53.0%
5 ae5925b37cc9 : 3681.4 302.5 576 52.5%
6 82f30097537a : 3680.5 301.5 576 52.3%
7 20200914-1520 : 3675.0 296.5 576 51.5%
8 30f6ddc579c2 : 3664.9 288.0 576 50.0%
9 9ed9e3a70a1b : 3660.2 283.5 576 49.2%
10 1de20d6fd769 : 3659.4 282.5 576 49.0%
11 011f4b2f4629 : 3601.8 163.5 396 41.3%
12 c193f6be9438 : 3534.0 125.5 396 31.7%
All EvalFiles are played by CF EXT 291120 and SugaR AI 1.80
Due to parallel matches, so for maximum performance/strength:
Several Arena Chess GUIs used, so for each Evalfile player is
Installed separate copy of engine, for more information: Details
011f4b2f4629+c193f6be9438 are played less games, the reason:
Tour main target is testing NN files close to 3700 Elo points!
Current overall draw percentage (based on 3276 games) is 85%
Here I think, Balsa (Wins) Opening Suite played a BIG role..!
Plus another factor is that, some of the NN files are weaker...
In case of same NN file...the draw numbers are going higher...
And as a last note,
Running these kind of testings are full with pain..not so easy...
I mean a lot of details, attention, free time etc. are required....
Even publishing the current online results take a lot of free time!
But for a chess LOVER:
All the efforts should not be counted as a BIG problem, right ?!)
And many thanks for your interest...!)