Understanding the relations between transcript features and gene expression
שחם גלעד
תלמיד המחלקה להנדסה ביו רפואית לתואר שני ירצה בנושא:
Understanding the relations between transcript features and gene expression
Gene expression is the process in which information encoded in the DNA is used to synthesize new proteins. Understanding the way gene expression is encoded in transcripts should not only contribute to disciplines, such as functional genomics and molecular evolution, but also to biotechnology and human health. Previous studies in the field mainly aimed at predicting protein levels of genes based on their transcript features by assuming that the effect of each transcript feature on gene expression is monotonic.
In this work, we aim to understand, for the first time, if indeed the relations between transcript features and measurements related to the different stages of gene expression are monotonic. To this end, we analyze 5,432 transcript features and gene expression measurements (e.g. mRNA levels, ribosomal densities, protein levels) of
4,367 S. cerevisiae genes. We use the Maximal Information Coefficient (MIC) in order to identify potential relations that are not necessarily linear or monotonic.
Our analyses demonstrate that the relation between most transcript features and the examined gene expression measurements is monotonic. In addition, in the cases of deviation from monotonicity the relation/deviation is very weak. These results should help in guiding the development of computational gene expression modeling and engineering, and improve the understanding of this process. Furthermore, the relatively simple relations between a transcript’s nucleotide composition and its expression should contribute towards better understanding of transcript evolution at the molecular level.
