DGL global strategies in DNA microarray gene expression analysis and data mining for human blood cancers
Faculty of Computing, Health and Science
School of Computer and Information Science
Computation is required to extract meaningful information from the large amount of data generated by gene expression profiling [1, 2, 3]. Most of the algorithms commonly applied to microarray data analysis have been correlation-based approaches named cluster analysis . For example, an efficient two-way clustering algorithm was applied to a colon cancer data set consisting of the expression patterns of different cell types. Gene expression in 40 tumour and 22 normal colon tissue samples was analysed across 2000 genes . Cluster analysis groups the genes involved in microarray data. Those clustered genes are likely to be functionally linked and need to be looked into closely. Although cluster analysis has widely been accepted in analysing the patterns of gene expression, the methods developed may not be able to fully extract the information from the microarray data corrupted by high-dimensional noise. If the noise from the genes that are irrelevant is not sufficiently...