Revisiting the Central Dogma: the distinct roles of genome, methylation, transcription, and translation on protein expression in Arabidopsis thaliana
Authors: Zhong, Z., Bailey, M., Kim, Y.-I., Pesaran-Afsharyan, N., Parker, B., Arathoon, L., Li, X., Rundle, C. A., Behrens, A., Nedialkova, D. D., Slavov, G., Hassani-Pak, K., Lilley, K. S., Theodoulou, F. L., Mott, R.
The study combined long‑read whole‑genome assembly, multi‑omics profiling (DNA methylation, mRNA, ribosome‑associated transcripts, tRNA abundance, and protein levels) in two Arabidopsis thaliana accessions to evaluate how genomic information propagates through the Central Dogma. Codon usage in gene sequences emerged as the strongest predictor of both mRNA and protein abundance, while methylation, tRNA levels, and ribosome‑associated transcripts contributed little additional information under stable conditions.
The study performed a comprehensive computational analysis of the Arabidopsis thaliana proteome, classifying 48,359 proteins by melting temperature (Tm) and melting temperature index (TI) and linking thermal stability to amino acid composition, molecular mass, and codon usage. Machine‑learning and evolutionary analyses revealed that higher molecular mass and specific codon pairs correlate with higher Tm, and that gene duplication has driven the evolution of high‑Tm proteins, suggesting a genomic basis for stress resilience.
The study examined transposable element (TE) silencing in the duckweed Spirodela polyrhiza, which exhibits unusually low DNA methylation, scarce 24‑nt siRNAs, and missing RdDM components. While degenerated TEs lack DNA methylation and H3K9me2, they retain heterochromatin marks H3K9me1 and H3K27me1, whereas the few intact TEs show high DNA methylation and H3K9me2, indicating a shift in RdDM focus toward potentially active TEs and suggesting heterochromatin can be maintained independently of DNA methylation in flowering plants.