Recently, studying the structure of proteins using angular representations has attracted much attention. To address the challenge of efficient modeling, we take into consideration the continuous conformational space of protein structures and model to discover the commonalities between different Ramachandran plots. It is not usual to observe entirely different biological functionality for certain groups of proteins that have common overall structure, because of local differences in that structure. Therefore, protein structure classification, which reveals an evolutionary relationship between the biochemical compounds, has broad implications and applications in understanding the functionality of the proteins for scientists. This general framework also provides a comprehensive machinery for clustering, model assessment,and data modeling for groups of protein backbone angles. Joint estimation of the protein sets lets us use commonalities within the group for estimating the density of small sample size Ramachandran plots and also allows for recognition of outliers.
In Maadooliat et. al (2015), we developed a comprehensive statistical model to characterize the protein as a circular manifold and describe the protein angles with a smaller number of dimensions. The proposed method takes into account the circular nature of the angular data using a penalized spline which is statistically more relaxed than a parametric angular distribution using a sphere or torus.
To assess the proposed model, we implemented estimations of Ramachandran neighbor-dependent of amino acids in Rosetta software and showed that the results of loop modeling in a benchmark set are more consistent than competing techniques.
Leave a comment
Make sure you enter all the required information, indicated by an asterisk (*). HTML code is not allowed.