Current ab initio methods for structure–prediction of proteins explore multiple simulated conformations, called de- coys, to generate families of folds, one of which is the closest to the native one. To limit the exploration of the conformational space, clustering algorithms are routinely applied to group similar decoys and then finding the most plausible cluster centroid, based on the hypothesis that there are more low–energy conformations surrounding the native fold than the others; nevertheless different clustering algorithms, or different parameters, are likely to output different partitions of the input data and choosing only one of the possible solutions can be too restrictive and unreliable. meta–clustering algorithms allow to reconcile multiple clustering solutions by grouping them into meta-clusters (i.e. clusters of clusterings), so that similar partitions are grouped in the same meta–cluster. In this paper the use of meta–clustering is proposed for the selection of lowest energy decoys, testing the Rough Graded Possibilistic c-medoids clustering algorithm for both baseline clustering and meta–clustering. Preliminary tests on real data suggest that meta–clustering is effective in reducing the sensitivity to parameters of the clustering algorithm and to expand the explored space.

Decoy Meta–Clustering Through Rough Graded Possibilistic C-Medoids

Ferone, Alessio
;
Maratea, Antonio
2020-01-01

Abstract

Current ab initio methods for structure–prediction of proteins explore multiple simulated conformations, called de- coys, to generate families of folds, one of which is the closest to the native one. To limit the exploration of the conformational space, clustering algorithms are routinely applied to group similar decoys and then finding the most plausible cluster centroid, based on the hypothesis that there are more low–energy conformations surrounding the native fold than the others; nevertheless different clustering algorithms, or different parameters, are likely to output different partitions of the input data and choosing only one of the possible solutions can be too restrictive and unreliable. meta–clustering algorithms allow to reconcile multiple clustering solutions by grouping them into meta-clusters (i.e. clusters of clusterings), so that similar partitions are grouped in the same meta–cluster. In this paper the use of meta–clustering is proposed for the selection of lowest energy decoys, testing the Rough Graded Possibilistic c-medoids clustering algorithm for both baseline clustering and meta–clustering. Preliminary tests on real data suggest that meta–clustering is effective in reducing the sensitivity to parameters of the clustering algorithm and to expand the explored space.
2020
978-1-7281-4384-2
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11367/84550
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 0
social impact