Softwares:
1.
Spanning-tree Progression Analysis of Density-normalized Events (SPADE)
2.
Sample Progression Discovery (SPD)
3.
Spectral Analysis for Class Discovery and Classification (SPACC)
4.
Fast Calculation of Pairwise Mutual Information Based on Kernel
Estimation
5.
TreeVis: A MATLAB-based Tool for Tree
Visualization
กก
1. Spanning-tree Progression Analysis of Density-normalized Events (SPADE)
Flow cytometry captures the heterogeneity of biological systems by providing multiparametric measurements of individual cells. Traditional analysis of flow cytometry datasets is often a subjective process that requires prior knowledge of the biological system. We present a novel analytical approach, Spanning-tree Progression Analysis of Density-normalized Events (SPADE), that uncovers an underlying cellular hierarchy from flow cytometry data, without requiring prior knowledge.
The software can be downloaded here
2. Sample Progression Discovery (SPD)
In contrast to the majority of existing methods which focus on identifying differences between sample groups (i.e. normal vs. cancer, treated vs. control), SPD aims to identify an underlying progression among individual samples, both within and across sample groups. We view SPD as a hypothesis generation tool, when applied to datasets where the progression is unclear. For example, when applied to a microarray dataset of cancer samples, SPD would assume that the cancer samples collected from individual patients represent different stages during an intrinsic progression underlying cancer development. The inferred relationship among the samples may therefore indicate a pathway or hierarchy of cancer progression, which serves as a hypothesis to be tested.
The software can be downloaded
here 3.
Spectral Analysis for Class Discovery and Classification (SPACC) Classification methods are mainly divided into two categories.
Unsupervised methods do not use class labels. They have the ability to
discover new classes, but lack ways to identify new classes
systematically. In supervised methods, the class label information plays
such an important role that supervised methods do not have the ability
to discover new classes. I developed a novel classification method,
SPACC, where the key idea is to make the class label information play a
less important role. SPACC is the first that performs class discovery
and classification simultaneously.
The software can be downloaded
here 4.
Fast Calculation of Pairwise Mutual Information Based on Kernel
Estimation We present a new software implementation for more efficiently
computing the mutual information for all pairs of genes from gene
expression microarrays. Computation of the mutual information is a
necessary first step in various information theoretic approaches for
reconstructing gene regulatory networks from microarray data. When the
mutual information is estimated by kernel methods, computing the
pairwise mutual information is quite time-consuming. Our implementation
significantly reduces the computation time. For an example data set of 9563 genes and 336 samples, the
current available software for ARACNE requires 142 hours to compute the
mutual information between all possible pairs of genes, whereas our
implementation requires 1.6 hours. The Matlab code is available at
here 5.
TreeVis: A MATLAB-based Tool for Tree Visualization Many analyses of biological data produce results in the form of tree
structures. Generating easily interpretable layouts to visualize
these tree structures is a non-trivial task. We present a new
visualization algorithm, TreeVis, to generate clear
two-dimensional layouts of complex tree
structures. The Matlab code is available at
here