Skip to main content

Advertisement

Log in

Gene ordering in partitive clustering using microarray expressions

  • Published:
Journal of Biosciences Aims and scope Submit manuscript

Abstract

A central step in the analysis of gene expression data is the identification of groups of genes that exhibit similar expression patterns. Clustering and ordering the genes using gene expression data into homogeneous groups was shown to be useful in functional annotation, tissue classification, regulatory motif identification, and other applications. Although there is a rich literature on gene ordering in hierarchical clustering framework for gene expression analysis, there is no work addressing and evaluating the importance of gene ordering in partitive clustering framework, to the best knowledge of the authors. Outside the framework of hierarchical clustering, different gene ordering algorithms are applied on the whole data set, and the domain of partitive clustering is still unexplored with gene ordering approaches. A new hybrid method is proposed for ordering genes in each of the clusters obtained from partitive clustering solution, using microarry gene expressions. Two existing algorithms for optimally ordering cities in travelling salesman problem (TSP), namely, FRAG_GALK and Concorde, are hybridized individually with self organizing MAP to show the importance of gene ordering in partitive clustering framework. We validated our hybrid approach using yeast and fibroblast data and showed that our approach improves the result quality of partitive clustering solution, by identifying subclusters within big clusters, grouping functionally correlated genes within clusters, minimization of summation of gene expression distances, and the maximization of biological gene ordering using MIPS categorization. Moreover, the new hybrid approach, finds comparable or sometimes superior biological gene order in less computation time than those obtained by optimal leaf ordering in hierarchical clustering solution.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Abbreviations

GA:

genetic algorithm

MIPS:

Munich Information for Protein Sequences

NF:

nearest-neighbor

SOM:

self-organizing map

TSP:

traveling salesman problem

References

  • Applegate D, Bixby R, Chvtal V and Cook W 2003 Concorde Package. [Online], www.tsp.gatech.edu/concorde/downloads/codes/src/co031219.tgz

  • Bar-Joseph Z, Gifford D K and Jaakkola T S 2001 Fast optimal leaf ordering for hierarchical clustering; Bioinformatics 17 2229

    Google Scholar 

  • Biedl T, Brejov B, Demaine E D, Hamel A M and Vinar T 2001 Optimal arrangement of leaves in the tree representing hierarchical clustering of gene expression data (Technical report, Department of Computer Science, University of Waterloo)

  • Eisen M B, Spellman P T, Brown P O and Botstein D 1998 Cluster analysis and display of genome-wide expression patterns; Proc. Natl Acad. Sci., USA 95 14863–14868

    Article  PubMed  CAS  Google Scholar 

  • Iyer V R, Eisen M B, Ross D T, Schuler G, Moore T, Lee J C F, Trent J M, Staudt L M et al 1999 The transcriptional program in the response of human fibroblasts to serum; Science 283 83–87

    Article  PubMed  CAS  Google Scholar 

  • Pal S K, Bandyopadhyay S and Ray S S 2006 Evolutionary computation in bioinformatics: A review; IEEE Trans. Systems Man Cybernetics Part C 36 601–615

    Article  Google Scholar 

  • Ray S S, Bandyopadhyay S and Pal S K 2007 Genetic operators for combinatorial optimization in TSP and microarray gene ordering; Appl. Intelligence 26 183–195

    Article  Google Scholar 

  • Sharan R, Maron-Katz A and Shamir R 2003 CLICK and EXPANDER: a system for clustering and visualizing gene expression data; Bioinformatics 19 1787–1799

    Article  PubMed  CAS  Google Scholar 

  • Sherlock G, Hernandez-Boussard T, Kasarskis A, Binkley G, Matese J C, Dwight S S, Kaloper M, Weng S et al 2001 The Stanford microarray database; Nucleic Acids Res. 29 152–155

    Article  PubMed  CAS  Google Scholar 

  • Tamayo P, Slonim D, Mesirov J, Zhu Q, Kitareewan S, Dmitrovsky E, Lander E S and Golub T R 1999 Interpreting patterns of gene expression with self-organizing maps: Methods and application to hematopoietic differentiation; Proc. Natl. Acad. Sci. USA 96 2907–2912

    Article  PubMed  CAS  Google Scholar 

  • Tsai H K, Yang J M, Tsai Y F and Kao C Y 2004 An evolutionary approach for gene expression patterns; IEEE Trans. Info. Tech. Biomed. 8 69–78

    Article  Google Scholar 

  • Venet D 2003 MatArray: a Matlab toolbox for microarray data; Bioinformatics 19 659–660

    Article  PubMed  CAS  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shubhra Sankar Ray.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ray, S.S., Bandyopadhyay, S. & Pal, S.K. Gene ordering in partitive clustering using microarray expressions. J Biosci 32 (Suppl 1), 1019–1025 (2007). https://doi.org/10.1007/s12038-007-0101-5

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12038-007-0101-5

Keywords

Navigation