Critical difference diagram for 14 classifiers included in the new evaluation. Solid bars indicate cliques, within which there is no significant difference in rank. Tests are performed with the sign rank test using the Holm correction. Top clique of four classifiers represent the state of the art in Spring 2020.