Clustering, Assessment and Validation: an application to gene expression data

IRIS

In this work a multi-step approach for clustering assessment, visualization and data validation is introduced. Three main approaches for data clustering are used and compared: K-means, Self Organizing Maps and Probabilistic Principal Surfaces. A model explorer approach with different similarity measures is used to obtain the best parameters of the methods. The approach is used to identify genes periodically expressed in tumors related to the human cell cycle. Finally, clusters are validated by using GO Term information.

Clustering, Assessment and Validation: an application to gene expression data

CIARAMELLA, Angelo;S. COCOZZA;F. IORIO;G. MIELE;F. NAPOLITANO;M. PINELLI;G. RAICONI;R. TAGLIAFERRI

2007-01-01

Abstract

In this work a multi-step approach for clustering assessment, visualization and data validation is introduced. Three main approaches for data clustering are used and compared: K-means, Self Organizing Maps and Probabilistic Principal Surfaces. A model explorer approach with different similarity measures is used to obtain the best parameters of the methods. The approach is used to identify genes periodically expressed in tumors related to the human cell cycle. Finally, clusters are validated by using GO Term information.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2007
			
	Codice ISBN
	
				142441380X
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11367/29106

Citazioni

ND

4

0

social impact