Modern methods for ab initio prediction of protein structures typically explore multiple simulated conformations, called decoys, to find the best native-like conformations. To limit the search space, clustering algorithms are routinely used to group similar decoys, based on the hypothesis that the largest group of similar decoys will be the closest to the native state. In this paper a novel clustering algorithm, called Graded Possibilistic c-medoids, is proposed and applied to a decoy selection problem. As it will be shown, the added flexibility of the graded possibilistic framework allows an effective selection of the best decoys with respect to similar methods based on medoids - that is on the most central points belonging to each cluster. The proposed algorithm has been compared with other c-medoids algorithms and also with SPICKER on real data, the large majority of times outperforming both.
Decoy clustering through graded possibilistic c-medoids
FERONE, Alessio;MARATEA, Antonio
2017-01-01
Abstract
Modern methods for ab initio prediction of protein structures typically explore multiple simulated conformations, called decoys, to find the best native-like conformations. To limit the search space, clustering algorithms are routinely used to group similar decoys, based on the hypothesis that the largest group of similar decoys will be the closest to the native state. In this paper a novel clustering algorithm, called Graded Possibilistic c-medoids, is proposed and applied to a decoy selection problem. As it will be shown, the added flexibility of the graded possibilistic framework allows an effective selection of the best decoys with respect to similar methods based on medoids - that is on the most central points belonging to each cluster. The proposed algorithm has been compared with other c-medoids algorithms and also with SPICKER on real data, the large majority of times outperforming both.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.