TY - JOUR
T1 - A simple and exact Laplacian clustering of complex networking phenomena
T2 - Application to gene expression profiles
AU - Kim, Choongrak
AU - Cheon, Mookyung
AU - Kang, Minho
AU - Chang, Iksoo
PY - 2008/3/18
Y1 - 2008/3/18
N2 - Unraveling of the unified networking characteristics of complex networking phenomena is of great interest yet a formidable task. There is currently no simple strategy with a rigorous framework. Using an analogy to the exact algebraic property for a transition matrix of a master equation in statistical physics, we propose a method based on a Laplacian matrix for the discovery and prediction of new classes in the unsupervised complex networking phenomena where the class of each sample is completely unknown. Using this proposed Laplacian approach, we can simultaneously discover different classes and determine the identity of each class. Through an illustrative test of the Laplacian approach applied to real datasets of gene expression profiles, leukemia data [Golub TR, et al. (1999) Science 286:531-537], and lymphoma data [Alizadeh AA, et al. (2000) Nature 403:503-511], we demonstrate that this approach is accurate and robust with a mathematical and physical realization. It offers a general framework for characterizing any kind of complex networking phenomenon in broad areas irrespective of whether they are supervised or unsupervised.
AB - Unraveling of the unified networking characteristics of complex networking phenomena is of great interest yet a formidable task. There is currently no simple strategy with a rigorous framework. Using an analogy to the exact algebraic property for a transition matrix of a master equation in statistical physics, we propose a method based on a Laplacian matrix for the discovery and prediction of new classes in the unsupervised complex networking phenomena where the class of each sample is completely unknown. Using this proposed Laplacian approach, we can simultaneously discover different classes and determine the identity of each class. Through an illustrative test of the Laplacian approach applied to real datasets of gene expression profiles, leukemia data [Golub TR, et al. (1999) Science 286:531-537], and lymphoma data [Alizadeh AA, et al. (2000) Nature 403:503-511], we demonstrate that this approach is accurate and robust with a mathematical and physical realization. It offers a general framework for characterizing any kind of complex networking phenomenon in broad areas irrespective of whether they are supervised or unsupervised.
KW - Complex networking phenomenon
KW - Leukemia
KW - Lymphoma
UR - http://www.scopus.com/inward/record.url?scp=41949091653&partnerID=8YFLogxK
U2 - 10.1073/pnas.0708598105
DO - 10.1073/pnas.0708598105
M3 - Article
C2 - 18337496
AN - SCOPUS:41949091653
SN - 0027-8424
VL - 105
SP - 4083
EP - 4087
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 11
ER -