Thursday, December 1, 2016

Lecture 14: David Holland (Margaret Johnson Lab)

Finding the right partner in a crowded world

Mammalian cells contain ∼21,000 genes encoding upwards to 100,000 protein types. 5-40% of cell volume is occupied by macromolecules, posing challenges for cell proteins to locate functional partners and increasing the risk of nonspecific (nonfunctional) interactions. Overexpressed proteins, in particular, will saturate functional partners, leaving leftovers for nonspecific binding instead. Eukaryotic cells have evolved various methods to help proteins function reliably, including compartmentalization, allostery, and structural and chemical properties of binding sites. Cells may optimize specificity as a function of their binding networks; first through concentration balance, next through network structure. Using the Gillespie algorithm, David Holland and Dr. Margaret Johnson from Johns Hopkins Biophysics simulated specific and nonspecific binding in 500 networks of 90-200 nodes with varying topological properties under equal, random, and stoichiometrically balanced protein concentrations. Binding affinities for all specific and nonspecific interactions were determined using a coarse-grained protein sequence model. The research team found out that the concentration balance significantly reduced the number of nonspecific interactions, as did local topological patterns (motifs) that allowed increased difference between specific and nonspecific binding affinities. 
Schematic illustrating (left) the binding of three proteins, where matching features indicate binding interfaces, (center) the corresponding interface-interaction network with orange and green indicating shared and non-shared binding interfaces, respectively, and (right) the protein-protein interaction network (from [1]).
To test if these motifs are selected for in real networks, they sampled possible interface-interaction networks (IINs) for the Clathrin-mediated endocytosis PPI network in yeast, using a Monte Carlo algorithm and a fitness function to select for features that allow increased specificity. The fitness function reproduced several features of the real IIN, including fragmentation, a scale-free degree distribution, the presence of square motifs and a low number of chain and triangle motifs. These features differed significantly from unbiased sampling. In conclusion, there is selective pressure on the evolution of real IINs to avoid nonfunctional interactions and that non-optimal features of the real IIN (chains, larger network components) likely serve a functional purpose.

Relevant Publications:
[1]Johnson, ME, & G Hummer (2013) “Evolutionary Pressure on the Topology of Protein Interface Interaction Networks.” J Phys Chem B 117:13098-106. 
[2]Johnson, ME, G Hummer (2013). “Interface-resolved network of protein-protein interactions.” PLoS Comput Biol 9(5):e1003065 
[3]Keil, C, E Verschueren, J Yang, & L Serrano (2013) “Integration of protein abundance and structure data reveals competition in the ErbB signaling network.” Science Signaling 6(306): ra109 [4]Johnson, ME, & G Hummer (2011) “Nonspecific binding limits the number of proteins in a cell and shapes their interaction networks.” PNAS 108(2):603-8. 
[5]Zhang, J, S Maslov, & EI Shakhnovich (2008) “Constraints imposed by non-functional protein-protein interactions on gene expression and proteome size.” Mol Sys Biol 4:210 
[6]Vavouri, T, JI Semple, R Garcia-Verdugo, & B Lehner (2009) “Intrinsic protein disorder and interaction promiscuity are widely associated with dosage sensitivity.” Cell 138: 198-208.

David Holland received his B.Sc. in Biomedical Engineering from the University of Virginia in 2011. After graduating, he joined the department of Biomedical Engineering at Johns Hopkins. He is currently earning his Ph.D. under Margaret Johnson where he studies the effects of protein abundance and protein-protein interaction network structure on protein mis-interactions. In his spare time, David practices taekwondo and is also teaching a course on network science. 

Tuesday, November 8, 2016

Lecture 13: Athena Chen (Margaret Johnson Lab)

Spatial Cell Modeling Methods 
Due to the complexity of cells, it is useful to use computational tools to understand and predict mechanisms of biological processes. Ordinary differential equations (ODEs) and partial differential equations (PDEs) have proven to be successful at modeling various large systems. However, to understand certain biological processes such as bacterial cell division, high spatial resolution is necessary to discern the underlying mechanisms and interactions. ODEs and PDEs, along with the Gillespie algorithm for stochastic modeling, do not provide spatial resolution at a single-particle level; they are all concentration-based methods. MCell, the Free Propagator Reweighting Algorithm (FPR), and Smoldyn are algorithms that provide single-particle resolution, but may be computationally expensive and may not necessarily yield accurate protein dynamics.

Change in concentration of molecule A in an irreversible 3D reaction A+A -> 0. In this parameter set, the initial distances between molecules causes an increased initial reaction rate.
In her recent work, Athena Chen and her mentors Dr. Margaret Johnson and Dr. Osman Yogurtcu in the Johns Hopkins Biophysics department, analyzed the strengths and limitations of ODEs, PDEs, Gillespie, MCell, Smoldyn, and FPR through establishing and performing a set of benchmark tests. Though all modeling methods extracted the correct equilibrium concentrations for most of the tested reactions, the resulting protein dynamics were not necessarily correct. Simulations at high rates and large densities showed that despite providing single-particle resolution, Smoldyn and MCell do not pick up single-particle effects where the distance between two molecules affects the probability of binding. Furthermore, the dynamics given by Smoldyn and MCell are dependent on the time step selected. On the other hand, FPR correctly identified single-particle effects and yielded dynamics independent of the selected timestep.
As an example of the effects of molecular geometry, diffusion, and stochasticity of protein dynamics, we examined a model for bacterial cell division. From oscillations in protein concentrations, the cell can identify the center of the cell to ensure identical offspring and division of genetic information. 
Written by Athena Chen
Relevant Articles:
1-Yogurtcu, Osman N., and Margaret E. Johnson. "Theory of bi-molecular association dynamics in 2D for accurate model and experimental parameterization of binding rates.The Journal of chemical physics 143.8 (2015): 084117.
2-Andrews, Steven S., et al. "Detailed simulations of cell biology with Smoldyn 2.1." PLoS Comput Biol 6.3 (2010): e1000705.
4-Kerr, Rex A., et al. "Fast Monte Carlo simulation methods for biological reaction-diffusion systems in solution and on surfaces." SIAM journal on scientific computing 30.6 (2008): 3126-3149.
Athena Chen is currently working on her Bachelors of Arts in Biophysics and Bachelors of Science in Applied Mathematics and Statistics from Johns Hopkins University. As part of Dr. Margaret Johnson’s lab in the department of Biophysics, she analyzes the accuracy of methods for modeling the dynamics of protein interactions. In her free time, she enjoys yoga and figure skating. 

Tuesday, October 25, 2016

Lecture 12: Andrei Kucharavy (Rong Li Lab)

Cellular Adaptation Under Stress
Diagram for a cell population adaptation.
Whether the products of human activity, or naturally occurring social, economic or biological, complex systems share the same properties. Composed of a large number of individual components, their components do not have a straightforward relation to their properties and often interact one with another in unexpected ways. Because of that, different instances of the same complex systems are built from slightly different components. Such differences give rise to heterogeneity within a population, which in turn raises significant difficulties for their study. From the biological perspective, such events have been formalized as Fisher’s geometric model, that has been formalized in the thirties of the last century and has been independently re-discovered in unrelated domains as algorithms for ergodic explorations of multi-dimensional spaces for an optimal value function point. 

Andrei Kucharavy and Dr. Rong Li from Johns Hopkins Medicine propose an enhancement of Fisher’s geometric model, allowing to explain a range of previously unexplained observations in biology. Mathematical analysis of their enhancement provides a set of rules applicable to the optimization of a large class of ergodic exploration algorithms.

Related Journal Articles:
1. H. A. Orr, The genetic theory of adaptation: a brief history. Nat. Rev. Genet. 6, 119–127 (2005) 
2. H. A. Orr, R. L. Unckless, The Population Genetics of Evolutionary Rescue. PLoS Genet. 10, e1004551 (2014). 
3. P. S. Pennings, Standing genetic variation and the evolution of drug resistance in HIV. PLoS Comput. Biol. 8 (2012).
Andrei Kucharavy received his Engineer’s Degree in Physics, Mathematics, Programming and Bioinformatics from Ecole Polytechnique, France in 2011. After graduation, he did masters in computational biology at Ecole Polytechnique Fédérale de Lausanne, Switzerland. Currently, Andrei is a Ph.D. student under the joint direction of Dr. Rong Li from Johns Hopkins and Dr. Gilles Fischer, exploring molecular mechanisms of aneuploidy-enabled stress adaptation and drug resistance it enables in yeast and cancer. He works on developing computational methods enabling systematic analysis of molecular mechanisms underlying complex traits. The interface of biological network analysis and evolution theory is his particular interest.

Tuesday, October 11, 2016

Lecture 11: Dr. Ana Damjanović

Simulating with the right pH.
Schematic representation of a constant-pH simulation
with the two-dimensional EDS-HREM method.
Solution pH is one of the most important environmental factors that affects the structure and dynamics of proteins. Almost all biologically relevant properties of proteins are affected by pH: stability, folding and assembly, interactions with ligands and other biological molecules, solubility, aggregation properties, and enzymatic activity. A change in pH may induce a change in the protonation state of ionizable groups, which in turn can cause structural changes in proteins. Structural changes triggered by protonation/deprotonation can be exploited for function, such as in the case of ATP synthase, bacteriorhodopsin, cytochrome c oxidase, or the photoactive yellow protein.

Superimposed 1 ns trajectories of snake cardiotoxin
from the (A) 2D and (B) 1D constant-pH
EDS-HREM simulations at pH = 2.
Ana Damjanovic from Johns Hopkins Biophysics and her colleagues from NIH present a new method for enhanced sampling for constant-pH simulations in explicit water based on a two-dimensional (2D) replica exchange scheme. The new method is a significant extension of a previously developed constant-pH simulation method, which is based on enveloping distribution sampling (EDS) coupled with a one-dimensional (1D) Hamiltonian exchange method (HREM). EDS constructs a hybrid Hamiltonian from multiple discrete end state Hamiltonians that, in this case, represent different protonation states of the system. The ruggedness and heights of the hybrid Hamiltonian’s energy barriers can be tuned by the smoothness parameter. Within the context of the 1D EDS-HREM method, exchanges are performed between replicas with different smoothness parameters, allowing frequent protonation-state transitions and sampling of conformations that are favored by the end-state Hamiltonians. In this work, the 1D method is extended to 2D with an additional dimension, external pH. Within the context of the 2D method (2D EDS-HREM), exchanges are performed on a lattice of Hamiltonians with different pH conditions and smoothness parameters. The research team demonstrates that both the 1D and 2D methods exactly reproduce the thermodynamic properties of the semi-grand canonical (SGC) ensemble of a system at a given pH. They have tested the new 2D method on aspartic acid, glutamic acid, lysine, a four-residue peptide (sequence KAAE), and snake cardiotoxin. In all cases, the 2D method converges faster and without loss of precision; the only limitation is a loss of flexibility in how CPU time is employed. The results for snake cardiotoxin demonstrate that the 2D method enhances protonation-state transitions, samples a wider conformational space with the same amount of computational resources, and converges significantly faster overall than the original 1D method.
Ana Damjanovic received her B.Sc. in Physics from Belgrade University (1995), and a Ph.D. in Physics from University of Illinois at Urbana-Champaign (2001). For her Ph.D. she worked with Prof. Klaus Schulten on QM description of energy transfer in light-harvesting complexes in various photosynthetic organisms. She did postdocs at UC Berkeley and Johns Hopkins University. At JHU she worked on understanding hydration and conformational changes in proteins through molecular dynamics simulations. She is presently an Associate Research Scientist and Lecturer in the Dept. of Biophysics at Johns Hopkins University. Her current research interests are in the area of development and applications of molecular dynamics simulations at constant pH.

Tuesday, September 27, 2016

Lecture 10: Collin Tokheim (Rachel Karchin Lab)

Hotspots in Cancer.
Missense mutations are perhaps the most difficult mutation type to interpret in human cancers. Truncating loss-of-function mutations and structural rearrangements generate major changes in the protein product of a gene, but a single missense mutation yields only a small change in protein chemistry. The impact of missense mutation on protein function, cellular behavior, cancer etiology, and progression may be negligible or profound, for reasons that are not yet well understood. Missense mutations are frequent in most cancer types, accounting for approximately 85% of the somatic mutations observed in solid human tumors, and the cancer genomics community has prioritized the task of identifying important missense mutations discovered in sequencing studies. Whole exome sequencing (WES) studies of cancer have created new opportunities to better understand the importance of missense mutations. This enormous collection of data now allows detection of patterns with power that was unheard of a few years ago.

Comparison of hotspot detection in the TSG FBXW7 in 1D and 3D [1].
In their recent work, Collin Tokheim from Johns Hopkins Biomedical Engineering and his colleagues used The Cancer Genome Atlas mutation data and identified 3D clusters of cancer mutations ("hotspot regions") at amino-acid-residue resolution in 91 genes, of which 56 are known cancer-associated genes. The hotspot regions identified by their method are smaller than a protein domain or protein– protein interface and in many cases can be linked precisely with functional features such as binding sites, active sites, and sites of experimentally characterized mutations. The hotspot regions are shown to be biologically relevant to cancer, and they discovered that there are characteristic differences between regions in the two types of driver genes, oncogenes and tumor suppressor genes (TSG). These differences include region size, mutational diversity, evolutionary conservation, and amino acid residue physiochemistry. For the first time, the research team quantifies why the great majority of well-known hotspot regions occur in oncogenes. Because hotspot regions in TSGs are larger, more heterogeneous than those in oncogenes, they are more difficult to detect using protein sequence alone and are likely to be underreported. The results indicate that protein structure–based 3D mutation clustering increases power to find hotspot regions, particularly in TSGs.

Good reads on the subject:
[1] Tokheim, Collin, et al. "Exome-scale discovery of hotspot mutation regions in human cancer using 3D protein structure." Cancer research (2016): canres-3190.
[2] Perdigão, Nelson, et al. "Unexpected features of the dark proteome." Proceedings of the National Academy of Sciences 112.52 (2015): 15898-15903.
[3] Kamburov, Atanas, et al. "Comprehensive assessment of cancer missense mutation clustering in protein structures." Proceedings of the National Academy of Sciences 112.40 (2015): E5486-E5495.
Collin Tokheim got his bachelor's degree in biomedical engineering at the University of Iowa. After a short stint working in an RNA genomics lab, he came to Hopkins to work on a Ph.D. in Biomedical Engineering. He currently works in Rachel Karchin's lab doing computational research applied to cancer genomics. Collin's current research focuses on how protein structures used on an exome-scale can inform which missense mutations are likely drivers of cancer.

Tuesday, September 13, 2016

Lecture 9: Max C. Klein (Elijah Roberts Lab)

An epigenetic landscape
(from Epigenetics Unraveled).
Analyzing rare events in biology. 
Changes in cellular phenotype can often be tightly correlated with changes in biochemistry. The macroscopic (phenotypical) and the microscopic (biochemical) descriptions of these phenotype switching events can be unified through the theoretical framework of epigenetic landscapes (EL). In approximate terms, the EL of a cellular system is a map that describes every possible system state (expressed in terms of a count of relevant DNA, proteins, etc.) and the probability of the system being in each of these states. An EL can be used to completely describe the dynamics of its associated system, allowing for a deep level of understanding and, potentially, control of the system. It is possible to take the list of chemical species and reactions involved in a cellular process and, using theory and modeling, generate the corresponding EL. Unfortunately, the computer time required to generate an EL using the standard stochastic simulation method, called Brute Force Sampling (BFS), makes it difficult to generate the EL of even a relatively simple multi-state system.

The Forward Flux Sampling (FFS) method, which belongs to a larger family of Enhanced Sampling methods, was previously developed by ten Wolde and coworkers to address this computational limitation. Although FFS can speed up EL simulations by multiple orders of magnitude, there is a commensurate increase in the complexity of simulation setup. Specifically, there are a number of novel free parameters that the simulation user must specify. Each of these parameters has a significant and indirect influence on the precision of an FFS simulation’s final results. It is, therefore, desirable to develop a version of the FFS algorithm in which the relationship of parameter choice to error is made explicit.

Max Klein and Dr. Elijah Roberts from Johns Hopkins Biophysics have designed and implemented a supercomputer-compatible version of the Forward Flux Enhanced Sampling method for stochastic simulation of biochemical networks. Their version greatly simplifies parameter choice and simulation setup relative to the base Forward Flux algorithm. A user of the program needs only to specify the desired level of precision error. The program will determine a set of parameters to use that will achieve this target error level while optimizing the simulation run time. The Forward Flux implementation has been tested and verified in terms of its ability to calculate switching rate constants and epigenetic landscapes. A preview build of the code is currently available here.

Related Articles:
[1] Dickson, Alex, and Aaron R. Dinner. "Enhanced sampling of nonequilibrium steady states." Annual review of physical chemistry 61 (2010): 441-459.
[2] Allen, Rosalind J., Daan Frenkel, and Pieter Rein ten Wolde. "Forward flux sampling-type schemes for simulating rare events: Efficiency analysis." The Journal of chemical physics 124.19 (2006): 194111.
[3] Becker, Nils B., and Pieter Rein ten Wolde. "Rare switching events in non-stationary systems." The Journal of chemical physics 136.17 (2012): 174119.
[4] Gardner, Timothy S., Charles R. Cantor, and James J. Collins. "Construction of a genetic toggle switch in Escherichia coli." Nature 403.6767 (2000): 339-342.
Max C. Klein received his B.A. in Physics, from Reed College, Oregon in 2013. After graduation, he joined the Program in Molecular Biophysics at Johns Hopkins University. Currently, with Dr. Elijah Roberts from the Biophysics Department, Max is developing new methods to simulate decision making in cells using a hybrid multi-CPU/multi-GPU computational architecture.

Tuesday, September 6, 2016

Lecture 8: Dr. Yasser Aboelkassem (Natalia Trayanova Lab)

The heart of the matter. 
Mesh representation of the
ventricles of the heart
Ventricular relaxation occurs as intracellular calcium drops to resting levels. Under low calcium conditions, contraction is inhibited by the troponin/tropomyosin complex. However, experimental evidence has long suggested that some degree of actin-myosin interaction is possible even in the absence of calcium. Under calcium-free conditions, as many as 5% of actin binding sites are occupied by myosin, according to some estimates made from solution studies of purified myofilament components. Despite abundant in vitro evidence for calcium-independent activation (CIA), its relevance to in vivo cardiac function is not clear. Striated muscle preparations can produce small amounts of actin-myosin-based force under low calcium conditions, especially near physiological temperatures. This suggests that residual actin-myosin cross bridges resist diastolic filling, adding to the resistance provided by other structures such as collagen and titin. However, distinguishing the contributions of these various factors is technically challenging, and cross bridge-based diastolic stiffness remains controversial.

Schematic diagram of model components and states [1].
In their recent study, Dr. Yasser Aboelkassem and his colleagues investigate CIA using computational analysis by adding a structurally motivated representation of this phenomenon to an existing myofilament model, which allowed predictions of CIA-dependent muscle behavior. The researchers found that a certain amount of CIA was essential for the model to reproduce reported effects of nonfunctional troponin C on myofilament force generation. Consequently, those data enabled estimation of ΔGCIA, the energy barrier for activating a thin filament regulatory unit in the absence of calcium. Using this estimate of ΔGCIA as a point of reference (∼7 kJ mol−1), they examined its impact on various aspects of muscle function through additional simulations. CIA decreases the Hill coefficient of steady-state force while increasing myofilament calcium sensitivity. At the same time, CIA has minimal effect on the rate of force redevelopment after slack/restretch. Simulations of twitch tension show that the presence of CIA increases peak tension while profoundly delaying relaxation. We tested the model’s ability to represent perturbations to the calcium regulatory mechanism by analyzing twitch records measured in transgenic mice expressing a cardiac troponin I mutation (R145G). The effects of the mutation on twitch dynamics were fully reproduced by a single parameter change, namely lowering ΔGCIA by 2.3 kJ mol−1 relative to its wild-type value. The analyses of Dr. Aboelkassem's and his team suggest that CIA is present in cardiac muscle under normal conditions and that its modulation by gene mutations or other factors can alter both systolic and diastolic function.
[1] Aboelkassem, Yasser, et al. "Contributions of Ca 2+-Independent Thin Filament Activation to Cardiac Muscle Function." Biophysical journal 109.10 (2015): 2101-2112.
Dr. Yasser Aboelkassem obtained his BSc. in Aerospace Engineering from Cairo University, Egypt and received his master's degree in Mechanical Engineering from a joint program between Concordia-McGill University, Montreal, Canada. After graduation, Dr. Aboelkassem moved to Virginia Tech where he obtained a Master in Applied Mathematics degree and a Ph.D. in Engineering Science and Mechanics in 2012. He did his first postdoctoral training working in cardiac mechanics at the department of Biomedical Engineering at Yale University. Currently, he is a postdoc research associate with Natalia Trayanova at the Institute of Computational Medicine, Johns Hopkins University.

Dr. Aboelkassem has published 16 first-author papers and co-authored 6 papers in peer-reviewed international journals. His current research focus is the multiscale modelling of cardiac thin filament activation.

Tuesday, August 16, 2016

Lecture 7: Chris Bohrer (Elijah Roberts & Jie Xiao Labs)

The Chromosomes Fight Against Disorder in E. coli
Transcription in Escherichia coli generates positive supercoiling in the DNA, which is relieved by the enzymatic activity of gyrase. Recently published experimental evidence suggests that transcription initiation and elongation are inhibited by the buildup of positive supercoiling. It has therefore been proposed that intermittent binding of gyrase plays a role in transcriptional bursting. Considering that transcription is one of the most fundamental cellular processes, it is desirable to be able to account for the buildup and release of positive supercoiling in models of transcription.
Positive Supercoiling (PCOIL) is
produced when mRNA is transcribed
(from Bohrer&Roberts,BMC,2016).

In their recent paper, Chris Bohrer and Elijah Roberts from Johns Hopkins Biophysics present a detailed biophysical model of gene expression that incorporates the effects of supercoiling due to transcription. By directly linking the amount of positive supercoiling to the rate of transcription, the model predicts that highly transcribed genes’ mRNA distributions should substantially deviate from Poisson distributions, with enhanced density at low mRNA copy numbers. Additionally, the model predicts a high degree of correlation between expression levels of genes inside the same supercoiling domain.

The model, incorporating the supercoiling state of the gene, makes specific predictions that differ from previous models of gene expression. Genes in the same supercoiling domain influence the expression level of neighboring genes. Such structurally dependent regulation predicts correlations between genes in the same supercoiling domain. The topology of the chromosome therefore creates a higher level of gene regulation, which has broad implications for understanding the evolution and organization of bacterial genomes.
Chris graduated from The Kent State University with a degree in physics and education. He is interested in studying the dynamics of gene regulatory circuits, and so decided to join both Dr. Elijah Robert’s lab as well as Dr. Jie Xiao’s lab at Johns Hopkins University Biophysics, in order to be proficient in both theory and experiments. Chris plays the guitar, runs, and makes sushi when not in the lab.

Tuesday, August 2, 2016

Lecture 6: Henry Lessen (Karen Fleming Lab)

Surprising results from a recent Beta-barrel membrane protein elasticity

OMPs have a common beta-barrel architecture. Image
from: Plummer, A. M., & Fleming, K. G. (2016). From
Chaperones to the Membrane with a BAM!. TiBS.
Outer membrane proteins (OMPs) play a central role in the integrity of the outer membrane of Gram-negative bacteria. The outermost membrane of Gram-negative bacteria is the ultimate protective barrier of the cell, serving as the first line of defense that guards against extracellular threats. Composed of both lipids and thousands of OMPs, biogenesis of outer membrane (OM) components and consequent OM integrity is essential for cell viability. Targeting these processes is a promising route for directed drug design against bacterial pathogens. Understanding of the OMP assembly machinery in bacteria has grown immensely owing to recent discoveries using several orthogonal techniques that include the publications of the crystal structures of key proteins recently.

OMPs have a common (beta-barrel) architecture, but they come in different sizes and functions (e.g. structural, adhesive, enzymatic and transport). OMPs are devoid of traditional cellular energy sources which point to the fact that the physical properties of the system are important. Lateral pressure applied onto the OMPs through the membrane is a key physics to study. Based on these observations, Henrey Lessen and his colleagues, using molecular dynamics simulations and an in-house developed biophysics model, obtain the elastic modulus and time-dependent forces acting on the barrel structure.
Henry Lessen is from Alexandria, Louisiana. He completed his Bachelors of Science in Microbiology at Texas A&M University in College Station, TX where his undergraduate research was focused on improving the alkaline tolerance of cyanide degrading enzymes from yeast for possible use in bioremediation of metal-mining waste.

Currently, Henry is in Karen Fleming's lab in the Jenkins Department of Biophysics at Johns Hopkins University. His thesis research involves using experimental and computational methods to study the energetics of transmembrane backbone hydrogen bonds in the outer membrane proteins of gram-negative bacteria.

Tuesday, July 19, 2016

Lecture 5: Dr. Julia Köhler Leman (Jeff Gray Lab)

Computational membrane protein structure prediction

Membrane proteins are critical functional molecules in the human body, constituting more than 30% of open reading frames in the human genome. As we also learned in Lecture 4, the determination of membrane protein (MP) structures has always trailed that of soluble proteins due to difficulties in their overexpression, reconstitution into membrane mimetics, and subsequent structure determination. The percentage of MP structures in the protein databank (PDB) has been at a constant 1–2% for the last decade. In contrast, over half of all drugs targetMPs, only highlighting how little we understand about drug-specific effects in the human body. To reduce this gap, researchers have attempted to predict structural features of MPs even before the first structure was experimentally elucidated.

Computational tools are instrumental to membrane protein structure prediction, consequently increasing our understanding of membrane protein function and their role in disease. In their recent article, Dr. Julia Kohler Leman and her colleagues describe a general framework facilitating membrane protein modeling and design that combines the scientific principles for membrane protein modeling with the flexible software architecture of Rosetta3. This new framework, called RosettaMP, provides a general membrane representation that interfaces with scoring, conformational sampling, and mutation routines that can be easily combined to create new protocols. To demonstrate the capabilities of this implementation, the researchers developed four proof-of-concept applications for (1) prediction of free energy changes upon mutation; (2) high-resolution structural refinement; (3) protein-protein docking; and (4) assembly of symmetric protein complexes, all in the membrane environment. The preliminary results show that these algorithms can produce meaningful scores and structures. The data also suggest needed improvements to both sampling routines and score functions. Importantly, the applications collectively demonstrate the potential of combining the flexible nature of RosettaMP with the power of Rosetta algorithms to facilitate membrane protein modeling and design.
Dr. Julia Koehler Leman received her Bachelors and Masters of Science degrees in physics from the University of Leipzig, Germany. Her doctoral research revolved around membrane protein structure determination using NMR spectroscopy and computational techniques and she received her Ph.D. in Chemical and Physical Biology in 2012 while working with Drs. Jens Meiler and Charles Sanders at Vanderbilt University. After graduation, she worked as a postdoctoral fellow under the supervision of Dr. Jeff Gray at Hopkins on the projects that involve development and application of Rosetta software, focusing on creating and improving applications for structure prediction, docking, and design. Very recently, she started working as a research fellow at Simons Foundation. 

Dr. Koehler Leman has published 7 first-author papers and co-authored 6 papers in peer-reviewed international journals. Her current research interests lie in computational methods development in modeling membrane proteins with and without restraints from NMR spectroscopy.

Tuesday, July 5, 2016

Lecture 4: Dr. Sirish Kaushik Lakkaraju (Alex MacKerell Lab at UMB)

In search for a better drug against asthma (... and many others)
The seven-transmembrane α-helix structure
of a G protein–coupled receptor (wikipedia).

Occluded ligand binding pockets (LBP) in proteins with minimal or no accessibility to the surrounding environment represent a significant, yet challenging opportunity for structure-based and computer-aided drug design approaches. LBPs of more than half of all clinical drug targets, including the G-protein coupled receptors (GPCR) and nuclear receptors (NR) are either partially or fully occluded. As the efficacies of ligands of both GPCRs and NRs are known to be coupled to small conformational changes in their binding sites, accurate modeling of these sites is critical for future development of therapeutic agents.

To determine free energy maps of functional groups of LBPs, Dr. Kaushik Lakkaraju and his colleagues from the University of Maryland Baltimore combined a Grand-Canonical Monte Carlo/Molecular Dynamics (GCMC/MD) strategy [1] with the Site Identification by Ligand Competitive Saturation (SILCS) methodology. In this method, the inclusion of protein flexibility helps identify regions of the binding pockets not accessible in crystal conformations and allows for better quantitative estimates of relative ligand binding affinities in all the proteins tested. The research group’s approach successfully recapitulates locations of functional groups across diverse classes of ligands in the LBPs of many important NR and GPCR proteins. In their study, the differences in functional group requirements of the active and inactive states of the β2AR LBP were used in virtual drug screening to identify high-efficacy agonists targeting β2AR in Airway Smooth Muscle (ASM) cells which are very important drug targets in patients with severe asthma. The experimental tests showed that 7 of the 15 selected ligands effect ASM relaxation, representing a very high hit rate (46%) [2].

FragMaps overlaid on the LBP of AR [2]
The conformational dynamics of a macromolecule can be modulated by a number of factors, including changes in the environment, ligand binding, and interactions with other macromolecules, among others. Therefore, there is always a need for faster extraction and comparison of functionally relevant conformational dynamics information from extended molecular simulations of large and complex macromolecular systems. Dr. Lakkaraju and his colleagues also have developed DIRECT-ID, a method that quantifies the differences in macromolecular conformational dynamics and automatically extracts the structural features responsible for these changes. Given a set of molecular dynamics (MD) simulations of a macromolecule, the method calculates the norms of the differences in covariance matrices for each pair of trajectories. A matrix of these norms thus quantifies the differences in conformational dynamics across the set of simulations. As a demonstration of its applicability to bio-macromolecular systems, DIRECT-ID was used to identify relevant ligand-modulated structural variations in the β2 -adrenergic (β2 AR) G-protein coupled receptor. Micro-second MD simulations of the β2 AR in an explicit lipid bilayer were run in the apo-state and complexed with the ligands: BI-167107 (agonist), epinephrine (agonist), salbutamol (long-acting partial agonist), or carazolol (inverse agonist). Each ligand was found to modulate the conformational dynamics of β2 AR differently and DIRECT-ID analysis of the inverse-agonist vs. agonist-modulated β2 AR identified residues known through previous studies to selectively propagate deactivation/activation information, along with some previously unidentified ligand-specific microswitches across the GPCR [3]. DIRECT-ID is accessible from here.

[1] Lakkaraju, S. K.; Raman, E. P.; Yu, W.; MacKerell, A. D.Sampling of Organic Solutes in Aqueous and Heterogeneous Environments using Oscillating Excess Chemical Potentials in Grand Canonical-Like Monte Carlo-Molecular Dynamics Simulations J. Chem. Theory Comput. 2014,10, 2281– 2290
[2] Lakkaraju, Sirish Kaushik, et al. "Mapping Functional Group Free Energy Patterns at Protein Occluded Sites: Nuclear Receptors and G-Protein Coupled Receptors." Journal of chemical information and modeling 55.3 (2015): 700-708.
[3] Lakkaraju, Sirish Kaushik, et al. "DIRECT‐ID: An automated method to identify and quantify conformational variations—application to β2‐adrenergic GPCR." Journal of computational chemistry 37.4 (2016): 416-425.

Dr. Sirish Kaushik Lakkaraju received his Bachelor of Engineering degree in electronics and instrumentation from the University of Madras at Chennai, India in 2003. He received his Master of Science degree in biomedical engineering from Texas A&M University in 2008 and his Ph.D. in December 2011 with Dr. Wonmuk Hwang. After graduation, he worked as a postdoctoral fellow under the supervision of Dr. Alex MacKerell at University of Maryland, Baltimore on the projects that involve method development and in-silico drug design. Recently, he started his second postdoc at Pfizer. 

Dr. Lakkaraju has published 8 first-author papers and co-authored 2 papers in peer-reviewed international journals. His current research interests lie in studying the role of dynamics of biological macromolecules and macromolecular assemblies on their function and also computational drug design.

Tuesday, June 21, 2016

Lecture 3: Dr. Jin Seob Kim

How do Keratin Intermediate Filaments self-assemble?

   Skin epidermal layers [2]
Intermediate filaments (IFs) provide structural and mechanical support vitally important to maintenance of cell and tissue integrity under stress. In vitro studies established that IFs must be crosslinked into a network in order to generate the elasticity and mechanical properties consistent with their mechanical support role in living cells. Accordingly, keratin IFs are typically organized into large bundles in epithelial surface. Skin epithelial cells express type I and type II IF genes, whose protein products copolymerize to form 10-nm-wide IFs in their cytoplasm. Experimentally, various types of keratin IFs are able to self-organize into crosslinked networks at subphysiological concentrations. For IFs comprised of keratin proteins 5 and 14 (K5, K14), found in basal keratinocytes of epidermis, bundling can be self-driven through interactions between K14's carboxy-terminal tail domain and two regions in the central α-helical rod domain of K5. 
Mechanisms of keratin filament bundling in basal keratinocytes of epidermis.
Ultrastructural examination shows that keratin filaments are abundant and show a
loosely bundled organization in basal layer keratinocytes of epidermis [3].
A number of fundamental questions remain about the pathway through which the keratin IF surface can promote the crosslinked network formation. In their work, Jin Seob Kim, Chang-Hun Lee and Pierre Coulombe from Johns Hopkins exploit theoretical principles and computational modeling to investigate how IF crosslinking happens. In their simple model, keratin IFs are treated as rigid rods to apply Brownian dynamics simulation. The authors' findings suggest that long-range interactions between IFs are required to initiate the formation of bundle-like configurations, while tail domain-mediated binding events act to stabilize them. The simple model explains the differences observed in the mechanical properties of wild-type versus disease-causing, defective IF networks. This effort extends the notion that the structural support function of keratin IFs necessitates a combination of intrinsic and extrinsic determinants, and makes specific predictions about the mechanisms involved in the formation of crosslinked keratin networks in vivo.
Theoretical time evolution of bundle formation [1]. 

Jin Seob's Article:
[1] Kim, Jin Seob, Chang-Hun Lee, and Pierre A. Coulombe. "Modeling the self-organization property of keratin intermediate filaments." Biophysical journal 99.9 (2010): 2748-2756.

[2] Matsusaki, Michiya, et al. "Development of full‐thickness human skin equivalents with blood and lymph‐like capillary networks by cell coating technology." Journal of Biomedical Materials Research Part A 103.10 (2015): 3386-3396.
[3] Lee, Chang-Hun, and Pierre A. Coulombe. "Self-organization of keratin intermediate filaments into cross-linked networks." The Journal of cell biology 186.3 (2009): 409-421.
Dr. Jin Seob Kim received a B.A. degree and M.S. degree in mechanical engineering from Seoul National University. After his master’s degree, he worked as a research engineer at SAMSUNG Electronics. In 2006, he received his Ph.D. degree in mechanical engineering at the Johns Hopkins University under the supervision of Dr. Gregory S. Chirikjian. After graduation, he worked as a postdoctoral fellow under the supervision of Drs. Sean Sun and Pierre Coulombe in the projects on computational cell morphology, cytoskeletal filaments, and skin homeostasis. Currently, he is an assistant research professor in the department of mechanical engineering at the Johns Hopkins University.

Dr. Kim’s research interests lie in mathematical modeling and simulation in the broad areas ranging from biology (molecular to cellular biology, systems biology and mechanobiology) and engineering (robotics and dynamics). 

Dr. Kim has published highly cited 9 first author papers and co-authored 4 papers in peer reviewed international journals. Also he has published 4 peer-reviewed papers in highly cited international conferences on robotics and computational biology.

Tuesday, June 7, 2016

Lecture 2: Nash Rochman (Sean Sun Lab)

To Grow is Not Enough

E. coli cell cycle duration distributions measured
at constant nutrient conditions. [1]
The mantra, “Survival of the Fittest,” coined by Spencer and popularized by Darwin himself, pervades every corner of biology. Fitness is usually defined to be the “birth-rate” or the rate at which new individuals are added to the population. Cooperative and multicellular systems may require a more complicated definition; but often even these phenomena are shown to derive from the maximization of total sustainable single cell number. In the case of non-cooperative, single cell species (e.g. bacteria at low cell density), fitness as birth-rate is accepted. For such a population during exponential growth, the number of cells in an ensemble can be well described as a function of time if we know the initial number N0, and the cell cycle duration τ, yielding N(t) = N0 exp(ln(2)t/τ). In this way the constant r = ln(2)/τ, often labeled the“growth-rate”, is used to measure fitness - the larger r and the faster an organism grows, the fitter it is.

Quantitative single cell measurements have shown that cell cycle duration (CCD, the time between cell divisions) for diverse cell types is a noisy variable. The underlying distribution of CCD is mean scalable with a universal shape for many cell types in a variety of environments. In their recent article, Nash Rochman and Sean Sun from Johns Hopkins mechanical engineering department have developed a phenomenological model for the regulation of cellular division time distributions determining both bulk growth rate and ensemble fluctuations. In this model, they propose a cellular ‘‘fitness’’ function which incorporates not only growth rate, which is maximized when fluctuations are minimized, but also ensemble response time to environmental stimulus which decreases for increasing fluctuations. Experimental single cell division data is collected on a population of isogenic cells subjected to varying environmental stimuli and compared to the model. The authors then have shown through both experiment and theory that increasing the amount of noise in the regulation of the cell cycle negatively impacts the growth rate, but positively correlates with improved cellular response to fluctuating environments. These results suggest that even non-cooperative cells in exponential growth phase do not optimize fitness through growth rate alone, but also optimize adaptability to changing conditions. In a manner similar to genetic evolution, increasing the noise in biochemical processes correlates with improved response of the system to environmental changes.
Nash's theory compared with eight step environmental change experiments. The experimental distributions are displayed using colors with highest probability in red and lowest probability in blue. The black lines are the model predictions for the average. [1]
Nash’s Article:
[1]: Rochman, Nash, Fangwei Si, and Sean X. Sun. "To Grow is Not Enough: Impact of Noise on Cell Environmental Response and Fitness." arXiv preprint arXiv:1603.01579 (2016).

Good reads on the subject:
P1: Hashimoto, Mikihiro, et al. "Noise-driven growth rate gain in clonal cellular populations." Proceedings of the National Academy of Sciences (2016): 201519412.
P3: Stukalin, Evgeny B., et al. "Age-dependent stochastic models for understanding population fluctuations in continuously cultured cells." Journal of The Royal Society Interface 10.85 (2013): 20130325.

Nash Rochman studied chemical physics and mathematics in college (started at Bard College at Simon’s Rock and transferred to Brown) taking a particular interest in evolutionary dynamics which brought him to the ChemBE department here at Hopkins for his PhD. With his advisor Sean Sun (MechE), he has become engaged in a variety of problems motivated by exciting analytical predictions that also provide the potential for convincing experimental verification. When not in the office/lab, he likes to play and compose music – playing mostly jazz (trumpet) and writing mostly concert music.