NEW: The ChIPseeqer paper is out and highlighted as

This is a partial list of possible projects in my group (contact Olivier Elemento for more details on any of these projects)

GENOMIC DATA MINING AND DISPLAY

  • A PAGE-based framework for systematic interpretation of gene expression studies
  • Displaying genomic data in the CAVE virtual environment
  • Clustering large scale heterogeneous genomic datasets using information theory
  • Explaining gene expression using synergistically informative pathways (extension to PAGE)
  • Software library for information-theoretic analysis of genomic data
  • Integrating FIRE, FIRE-pro and PAGE in one tool with GUI (Java or C++)
  • Measure clustering quality using PAGE and FIRE
  • Software/wiki to draw/distribute/simulate/etc gene networks
  • Improving discovery of orthologs using genetic algorithms

    DISEASE-ORIENTED COMPUTATIONAL BIOLOGY

  • Studying the co-expression network of GPCRs, predict drug specificity
  • Identifying A-to-I editing targets in the brain using data integration
  • Predicting methylation patterns in lymphoma
  • Predicting AID targets in B cells using data integration

    REGULATORY ELEMENTS AND GENE/PROTEIN REGULATION

  • Integrating conservation into the FIRE motif discovery process
  • Understanding the interplay between regulatory elements and nucleosomes
  • Discovering enhancers using comparative genomics
  • Discovering boundary elements
  • Dissecting the structure of core promoters
  • Discovering motifs involved in splicing using comparative genomics
  • Predicting the functional consequence of regulatory mutations
  • Detecting selection in human promoters
  • Extracting regulatory sequences from the literature using text mining
  • Discovering protein motifs using network-level conservation
  • An information-theorerical approach for learning regulatory programs
  • Finding informative RNA structural elements in 5' and 3'UTRs
  • Explaining conservation using FIRE (systematic discovery of contextual regulatory sequences)
  • Reconstructing a cancer-oriented regulatory network using comparative genomics
  • Studying the link between motif affinity and function
  • Identifying regulators from motifs and large expression datasets
  • Revealing motifs underlying chromatin changes and epigenomic modifications
  • Understanding mRNA and protein abundance (as opposed to gene expression differences)
  • Discovering motifs with a role in dynamic nucleosome positioning
  • Learning predictive model of regulation using complete feature integration
  • An integrative framework for explaining gene expression (learning motifs and rules at the same time)
  • Evolution of the transcription machinery in vertebrates and mammals and euks