data set: hom CV-loop best_performing_clusters 0 'YN', 'GF', 'VQ', 'MKC', 'L', 'DT' 1 'YNW', 'K', 'HLPR', 'MC', 'GVT', 'S', 'A' 2 'Y', 'NFQDGEI', 'KMHC', 'LSPT', 'R' 3 'YNDQ', 'LSV', 'CK', 'M', 'GF', 'RP', 'H' 4 'Y', 'N', 'KM', 'DEA', 'FCVRQ', 'L' 5 'NY', 'KLM', 'C', 'AE', 'RH', 'TGQF', 'S' 6 'YNWFI', 'M', 'KLPR', 'CEH', 'SA' 7 'YF', 'N', 'KMC', 'GVDW', 'LH', 'QEA', 'R' 8 'YN', 'GDI', 'KH', 'MC', 'LR', 'FAEQT', 'W' 9 'NF', 'Y', 'KCM', 'LPR', 'SH' number of clusters per CV-loop: [6, 7, 5, 7, 6, 7, 5, 7, 7, 5] mean length: 6.2 (0.9) cluster sizes: [2, 2, 2, 3, 1, 2] [3, 1, 4, 2, 3, 1, 1] [1, 7, 4, 4, 1] [4, 3, 2, 1, 2, 2, 1] [1, 1, 2, 3, 5, 1] [2, 3, 1, 2, 2, 4, 1] [5, 1, 4, 3, 2] [2, 1, 3, 4, 2, 3, 1] [2, 3, 2, 2, 2, 5, 1] [2, 1, 3, 3, 2] mean cluster sizes per CV-loop: 2.0 2.1 3.4 2.1 2.2 2.1 3.0 2.3 2.4 2.2 data set: het CV-loop best_performing_clusters 0 'Y', 'GFQ', 'HM', 'W', 'NPV', 'ER' 1 'NFA', 'Y', 'GD', 'M' 2 'Y', 'EMH', 'WVQD', 'GF' 3 'Y', 'GFWD', 'MHI', 'RE' 4 'YD', 'GFWV', 'HCRM', 'SP', 'L', 'IE' 5 'YWV', 'IH', 'GFDQN', 'KMR', 'LC', 'S', 'EP' 6 'YF', 'GQDV', 'MKR', 'EN' 7 'YVQ', 'GFN', 'HEM', 'WL' 8 'FPA', 'YGD', 'RSK', 'MI', 'L' 9 'GFWDQ', 'Y', 'ME', 'HIK', 'L' number of clusters per CV-loop: [6, 4, 4, 4, 6, 7, 4, 4, 5, 5] mean length: 4.9 (1.0) cluster sizes: [1, 3, 2, 1, 3, 2] [3, 1, 2, 1] [1, 3, 4, 2] [1, 4, 3, 2] [2, 4, 4, 2, 1, 2] [3, 2, 5, 3, 2, 1, 2] [2, 4, 3, 2] [3, 3, 3, 2] [3, 3, 3, 2, 1] [5, 1, 2, 3, 1] mean cluster sizes per CV-loop: 2.0 1.8 2.5 2.5 2.5 2.6 2.8 2.8 2.4 2.4