Testing...

...is a somewhat complicated topic, also mentioned in the section "Ratings lists". To find out if a new revision of Aristarch is better than my reference version, I play 144 games under the following conditions:

This testing method is still unreliable as 144 games are not enough to find out small improvements or disimprovements. But it is necessary to find a compromise between testing time and development cycle.

Sometimes I send an engine to external testers, but unfortunately this does not help much, because I can only compare these results to exactly the same test conditions, which are normally not available or out-dated. Especially helpful are testers who have defined testing conditions and are able to play lots of games.

An important point is that testing in practical games is not the only way to determin playing strength: