Computational Regulatory Genomics
We are in the Department of Biomedical Engineering and the McKusick-Nathans Institute of Genetic Medicine at Johns Hopkins University. You can apply for graduate study in my lab through BME or the Ph.D. program in Human Genetics.
Research Interests:The ultimate goal of our research is to understand how gene regulatory information is encoded in genomic DNA sequence.
We have recently made significant progress in understanding how DNA sequence features specify cell-type specific mammalian enhancer activity by using kmer-based SVM machine learning approaches. For details, see:
- A method to predict the impact of regulatory variants from DNA sequence. Lee D, Gorkin DU, Baker M, Strober BJ, Asoni AL, McCallion AS, Beer, MA. Nature Genetics 2015.
- Enhanced Regulatory Sequence Prediction Using Gapped k-mer Features. Ghandi M, Lee D, Mohammad-Noori M, and Beer MA. 2014. PLOS Computational Biology. July 17, 2014.
- Mammalian Enhancer Prediction. Lee D, Beer MA. 2014. Genome Analysis: Current Procedures and Applications. Horizon Press
- Robust k-mer Frequency Estimation Using Gapped k-mers. Ghandi M, Mohammad-Noori M, and Beer MA. 2013. Journal of Mathematical Biology 69:469-500.
- kmer-SVM: a web server for identifying predictive regulatory sequence features in genomic datasets. Fletez-Brant C*, Lee D*, McCallion AS and Beer MA. 2013. Nucleic Acids Research 41: W544–W556.
- Integration of ChIP-seq and Machine Learning Reveals Enhancers and a Predictive Regulatory Sequence Vocabulary in Melanocytes. Gorkin DU, Lee D, Reed X, Fletez-Brant C, Blessling SL, Loftus SK, Beer MA, Pavan WJ, and McCallion AS. 2012. Genome Research 22:2290-2301.
- Discriminative prediction of mammalian enhancers from DNA sequence. Lee D, Karchin R, and Beer MA. 2011. Genome Research 21:2167-2180.
Our work uses functional genomics DNase-seq, ATAC-seq, ChIP-seq, RNA-seq, MPRA, Hi-C, and chromatin state data to computationally identify combinations of transcription factor binding sites which operate to define the activity of cell-type specific enhancers. We are currently focused on:
- improving on the SVM methodology by including more general sequence features and constraints
- predicting the impact of SNPs on enhancer activity (delta-SVM) and GWAS association for specific diseases
- experimentally assessing the predicted impact of regulatory element mutation in mammalian cells
- systematically determining regulatory element logic from ENCODE human and mouse data
- using this sequence based regulatory code to assess common modes of regulatory element evolution and variation
We are located in the McKusick-Nathans Institute of Genetic Medicine, and the Department of Biomedical Engineering, which has long been a leader in the development of rigorous quantitative modeling of biological systems, and is a natural home for graduate studies in Bioinformatics and Computational Biology at Johns Hopkins, including research in Genomics, Systems Biology, Machine Learning, and Network Modeling.
About Computational Biology in JHU Biomedical Engineering:
The Department of Biomedical Engineering has long been a leader in the development of rigorous quantitative modeling of biological systems, and is a natural home for graduate studies in Bioinformatics and Computational Biology at Johns Hopkins. Students with backgrounds in Physics, Mathematics, Computer Science and Engineering are encouraged to apply. Opportunities for research include: Computational Medicine, Genomics, Systems Biology, Machine Learning, and Network Modeling. Graduate students in Johns Hopkins' Biomedical Engineering programs can select research advisors from throughout Johns Hopkins' Medical Institutions, Whiting School of Engineering, and Krieger School of Arts and Sciences.