Aarhus University Seal / Aarhus Universitets segl

Studies on the Clustering Algorithm for Analyzing Gene Expression Data with a Bidirectional Penalty

Publikation: Bidrag til tidsskrift/Konferencebidrag i tidsskrift /Bidrag til avisTidsskriftartikelForskningpeer review

DOI

  • Hu Yang, Cent Univ Finance & Econ, Central University of Finance & Economics, Sch Informat
  • ,
  • Xiaoqin Liu

This article reports a new clustering method based on the k-means algorithm to high-dimensional gene expression data. The proposed approach makes use of bidirectional penalties to constrain the number of clusters and centroids of clusters to simultaneously determine the unknown number of clusters and handle large amounts of noise in gene expression data. Numeric studies indicate that this algorithm not only performs better in clustering but is also comparable to other approaches in its ability to obtain the correct number of clusters and correct signal features. Finally, we apply the proposed approach to analyze two benchmark gene expression datasets. These analyses again indicate that the proposed algorithm performs well in clustering high-dimensional gene expression data with an unknown number of clusters.

OriginalsprogEngelsk
TidsskriftJournal of Computational Biology
Vol/bind24
Nummer7
Sider (fra-til)689-698
Antal sider10
ISSN1066-5277
DOI
StatusUdgivet - jul. 2017

Se relationer på Aarhus Universitet Citationsformater

ID: 121438287