DARYA CHUDOVA


Home Department

Information and Computer Science

Thesis Advisor
Padhraic Smyth - Information and Computer Science

Co-thesis Advisor
G. Wesley Hatfield – Microbiology and Molecular Genetics, College of Medicine

Email


Darya Chudova is studying the problem of joint reconstruction of structure and dynamics of regulatory sub-networks in yeast from available experimental data. Smyth’s lab presents a family of probabilistic models and inference algorithms that attempt to simultaneously recover the structure of interactions between the nodes in the network and to quantify these interactions. She makes specific assumptions about the structure of interactions and the dynamics in order to constrain the set of probable models. The models assume a small number of tightly interacting "master" genes in the core of the network that govern the dynamics of the system. The rest of the genes follow the dynamics of their "master" nodes with some time lag, without influencing the core dynamics. Given the structure of the network, the dynamics is described by a system of differential equations that can be modeled with a neural network. She uses Bayesian approach and allow to incorporate prior knowledge about (1) the out-degrees of the "master" genes, (2) specific interactions that take place (known pathways) and (3) general structure of the regulation matrix (known co-regulated families of genes etc.). The approach extends commonly used techniques that de-couple structure from dynamics, and perform clustering of expression profiles, followed by modeling the dynamics within the learned clusters.

The computational challenge of the problem is in simultaneous recovery of structure of the interactions (discrete optimization problem) and dynamics of the system (continuous optimization problem). She, in collaboration with Dr. Mjolsness’ lab developed the Gibbs sampler to address this problem, and demonstrated that it can successfully recover the data generating models on synthetic data sets with characteristics similar to that of real gene expression data. At present, are working on gathering applicable prior information and applying the models to the gene expression time course data in yeast. She plans to incorporate the binding ratio data and protein/protein interaction data as a future step.


Publications


D. Chudova, P. Smyth, Anaylsis of pattern discovery in sequences using Bayes error rate framework. (Acccepted for publication in the Data Mining and Kowledge Discovery journal.)

N. Kh. Rozov, V. G. Sushko, D. I. Chudova. Differential equations with degenerate coefficient by order derivative. Fundamentalnaya I prikladnaya matematika (Fundamental and applied mathematics), 1998. v. 4, # 3, pp.1063-1095 (In Russian)

D. Speights, J. Brodsky and D. Chudova. Using Neural Networks to Predict Claim Duration in the Presence of Right Censoring and Covariates. Casual Actuarial Society Forum, Winter 1999


Conference Proceedings


Probabilistic Models for Joint Clustering and Time-Warping of Multidimensional Curves. D. Chudova, S. Gaffney, P. Smyth. Submitted to UAI-03 (Uncertainty in Artificial Intelligence) conference.

Translation-invariant clustering of multidimensional curves. D. Chudova, S. Gaffney, E. Mjolsness, P. Smyth, Submitted to ACM SIGKDD-03 conference.

Pattern discovery in sequences under a Markov assumption. D. Chudova and P. Smyth. Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Edmonton, 2002.
Unsupervised identification of sequential patterns under a Markov assumption. D. Chudova and P. Smyth. Presented at the ACM SIGKDD 2001 Workshop on Temporal Data Mining, San Francisco, 2001.

Towards scalable support vector machines using squashing. D.Pavlov, D. Chudova and P. Smyth. Proceedings of the Sixths ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Boston, 2000

Benchmarking of Different Modifications of the Cascade Correlation Algorithm. D. I. Chudova, S. A. Dolenko, Yu. V. Orlov, D. Yu. Pavlov, I. G. Persiantsev. Proc. 3rd International Conference on Adaptive Computing in Design and Manufacture, 1998, pp.339-344.

Development of a Statistics Based System for Fluorescent Diagnostics of Organic Pollution in Water. Yu.V.Orlov, I.G.Persiantsev, D.I.Chudova, D.Yu.Pavlov, S.M.Babichenko. Proc. of the 3rd EARSeL Workshop on Lidar Remote Sensing of Land and Sea, 1997, pp.157-162.


 
 
 
    Back to Predoctoral Trainees  
     
© copyright 2003. Institute for Genomics and Bioinformatics | 949.824.9296 tel | 949.824.9813 fax | igb@ics.uci.edu