Revisiting the Central Dogma: the distinct roles of genome, methylation, transcription, and translation on protein expression in Arabidopsis thaliana
Authors: Zhong, Z., Bailey, M., Kim, Y.-I., Pesaran-Afsharyan, N., Parker, B., Arathoon, L., Li, X., Rundle, C. A., Behrens, A., Nedialkova, D. D., Slavov, G., Hassani-Pak, K., Lilley, K. S., Theodoulou, F. L., Mott, R.
The study combined long‑read whole‑genome assembly, multi‑omics profiling (DNA methylation, mRNA, ribosome‑associated transcripts, tRNA abundance, and protein levels) in two Arabidopsis thaliana accessions to evaluate how genomic information propagates through the Central Dogma. Codon usage in gene sequences emerged as the strongest predictor of both mRNA and protein abundance, while methylation, tRNA levels, and ribosome‑associated transcripts contributed little additional information under stable conditions.
Using a barley pangenome of 76 genotypes and a pan‑transcriptome subset of 20, the study characterizes the diversity and evolutionary dynamics of CCT motif genes, uncovering novel frameshift variants and clade‑specific domain expansions. Phylogenetic and tissue‑specific expression analyses reveal functional divergence among paralogs, and the unexpected retention of the VRN2 repressor in spring barley suggests additional regulatory mechanisms beyond vernalization.
Phylogenetic analysis reveals that non‑seed plants, exemplified by the liverwort Marchantia polymorpha, possess a streamlined repertoire of cyclin and CDK genes, with only three cyclins active in a phase‑specific manner during vegetative development. Single‑cell RNA‑seq and fluorescent reporter assays, combined with functional overexpression studies, demonstrate the distinct, non‑redundant roles of MpCYCD;1, MpCYCA, and MpCYCB;1 in G1 entry, S‑phase progression, and G2/M transition, respectively.
The study performed a comprehensive computational analysis of the Arabidopsis thaliana proteome, classifying 48,359 proteins by melting temperature (Tm) and melting temperature index (TI) and linking thermal stability to amino acid composition, molecular mass, and codon usage. Machine‑learning and evolutionary analyses revealed that higher molecular mass and specific codon pairs correlate with higher Tm, and that gene duplication has driven the evolution of high‑Tm proteins, suggesting a genomic basis for stress resilience.
The study shows that silencing of NOR2 rRNA genes in Arabidopsis thaliana depends primarily on CHH-context cytosine methylation, particularly mediated by CMT2 and the chromatin remodeler DDM1, rather than CG or CHG methylation. Comparative promoter analysis revealed a prevalence of CHH sites in plant rDNA promoters, explaining why CHH methylation mutants disrupt NOR2 silencing more strongly, while NOR2 loci are hyper‑methylated and more condensed than NOR4.
The study characterizes all seven malic enzyme genes in tomato, analyzing their tissue-specific expression, temperature and ethylene responsiveness, and linking specific isoforms to metabolic processes such as starch and lipid biosynthesis during fruit development. Phylogenetic, synteny, recombinant protein biochemical assays, and promoter analyses were used to compare tomato enzymes with Arabidopsis counterparts, revealing complex evolutionary dynamics that decouple phylogeny from functional orthology.
The study sampled 94 individuals from eight Atlantic Forest populations to assess morphological and genetic variation among Inga subnuda subspecies and the related Inga vera subsp. affinis. Using plastid trnD‑trnT spacer and nuclear ITS1/2 sequences, phylogenetic analyses revealed distinct structuring of I. subnuda subsp. subnuda and a cohesive group comprising I. subnuda subsp. luschnathiana and I. vera subsp. affinis, indicating retention of ancestral polymorphism from recent diversification and prompting a taxonomic revision of subsp. luschnathiana.