-
The PLSDB 2025 update: enhanced annotations and improved functionality for comprehensive plasmid research Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-20 Leidy-Alejandra G Molano, Pascal Hirsch, Matthias Hannig, Rolf Müller, Andreas Keller
Plasmids are extrachromosomal DNA molecules in bacteria and archaea, playing critical roles in horizontal gene transfer, antibiotic resistance, and pathogenicity. Since its first release in 2018, our database on plasmids, PLSDB, has significantly grown and enhanced its content and scope. From 34 513 records contained in the 2021 version, PLSDB now hosts 72 360 entries. Designed to provide life scientists
-
CATH v4.4: major expansion of CATH by experimental and predicted structural data Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-20 Vaishali P Waman, Nicola Bordin, Andy Lau, Shaun Kandathil, Jude Wells, David Miller, Sameer Velankar, David T Jones, Ian Sillitoe, Christine Orengo
CATH (https://www.cathdb.info) is a structural classification database that assigns domains to the structures in the Protein Data Bank (PDB) and AlphaFold Protein Structure Database (AFDB) and adds layers of biological information, including homology and functional annotation. This article covers developments in the CATH classification since 2021. We report the significant expansion of structural information
-
[SNG2], a prion form of Cut4/Apc1, confers non-Mendelian inheritance of heterochromatin silencing defect in fission yeast Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-20 Suman Sharma, Suchita Srivastava, Rudra Narayan Dubey, Poonam Mishra, Jagmohan Singh
Prions represent epigenetic regulator proteins that can self-propagate their structure and confer their misfolded structure and function on normally folded proteins. Like the mammalian prion PrPSc, prions also occur in fungi. While a few prions, like Swi1, affect gene expression, none are shown to affect heterochromatin structure and function. In fission yeast and metazoans, histone methyltransferase
-
Harmonizome 3.0: integrated knowledge about genes and proteins from diverse multi-omics resources Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-20 Ido Diamant, Daniel J B Clarke, John Erol Evangelista, Nathania Lingam, Avi Ma’ayan
By processing and abstracting diverse omics datasets into associations between genes and their attributes, the Harmonizome database enables researchers to explore and integrate knowledge about human genes from many central omics resources. Here, we introduce Harmonizome 3.0, a significant upgrade to the original Harmonizome database. The upgrade adds 26 datasets that contribute nearly 12 million associations
-
L1-ORF1p nucleoprotein can rapidly assume distinct conformations and simultaneously bind more than one nucleic acid Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-20 Ben A Cashen, M Nabuan Naufer, Michael Morse, Micah J McCauley, Ioulia Rouzina, Charles E Jones, Anthony V Furano, Mark C Williams
LINE-1 (L1) is a parasitic retrotransposable DNA element, active in primates for the last 80–120 Myr. L1 has generated nearly one-third of the human genome by copying its transcripts, and those of other genetic elements (e.g. Alu and SVA), into genomic DNA by target site-primed reverse transcription (TPRT) and remains active in modern humans. L1 encodes two proteins that bind their encoding transcript
-
Glucose binds and activates NSUN2 to promote translation and epidermal differentiation Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-20 Weili Miao, Douglas F Porter, Ya Li, Lindsey M Meservey, Yen-Yu Yang, Chengjie Ma, Ian D Ferguson, Vivian B Tien, Timothy M Jack, Luca Ducoli, Vanessa Lopez-Pajares, Shiying Tao, Paul B Savage, Yinsheng Wang, Paul A Khavari
Elevations in intracellular glucose concentrations are essential for epithelial cell differentiation by mechanisms that are not fully understood. Glucose has recently been found to directly bind several proteins to alter their functions to enhance differentiation. Among the newly identified glucose-binding proteins is NSUN2, an RNA-binding protein that we identified as indispensable for epidermal differentiation
-
PWAS Hub: exploring gene-based associations of complex diseases with sex dependency Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-20 Roei Zucker, Guy Kelman, Michal Linial
The Proteome-Wide Association Study (PWAS) is a protein-based genetic association approach designed to complement traditional variant-based methods like GWAS. PWAS operates in two stages: first, machine learning models predict the impact of genetic variants on protein-coding genes, generating effect scores. These scores are then aggregated into a gene-damaging score for each individual. This score
-
InterPro: the protein sequence classification resource in 2025 Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-20 Matthias Blum, Antonina Andreeva, Laise Cavalcanti Florentino, Sara Rocio Chuguransky, Tiago Grego, Emma Hobbs, Beatriz Lazaro Pinto, Ailsa Orr, Typhaine Paysan-Lafosse, Irina Ponamareva, Gustavo A Salazar, Nicola Bordin, Peer Bork, Alan Bridge, Lucy Colwell, Julian Gough, Daniel H Haft, Ivica Letunic, Felipe Llinares-López, Aron Marchler-Bauer, Laetitia Meng-Papaxanthos, Huaiyu Mi, Darren A Natale
InterPro (https://www.ebi.ac.uk/interpro) is a freely accessible resource for the classification of protein sequences into families. It integrates predictive models, known as signatures, from multiple member databases to classify sequences into families and predict the presence of domains and significant sites. The InterPro database provides annotations for over 200 million sequences, ensuring extensive
-
Crosstalk between paralogs and isoforms influences p63-dependent regulatory element activity Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-20 Gabriele Baniulyte, Abby A McCann, Dana L Woodstock, Morgan A Sammons
The p53 family of transcription factors (p53, p63 and p73) regulate diverse organismal processes including tumor suppression, maintenance of genome integrity and the development of skin and limbs. Crosstalk between transcription factors with highly similar DNA binding profiles, like those in the p53 family, can dramatically alter gene regulation. While p53 is primarily associated with transcriptional
-
ECOD: integrating classifications of protein domains from experimental and predicted structures Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-20 R Dustin Schaeffer, Kirill E Medvedev, Antonina Andreeva, Sara Rocio Chuguransky, Beatriz Lazaro Pinto, Jing Zhang, Qian Cong, Alex Bateman, Nick V Grishin
The evolutionary classification of protein domains (ECOD) classifies protein domains using a combination of sequence and structural data (http://prodata.swmed.edu/ecod). Here we present the culmination of our previous efforts at classifying domains from predicted structures, principally from the AlphaFold Database (AFDB), by integrating these domains with our existing classification of PDB structures
-
PGxDB: an interactive web-platform for pharmacogenomics research Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-20 Trinh Trung Duong Nguyen, Ziaurrehman Tanoli, Saad Hassan, Umut Onur Özcan, Jimmy Caroli, Albert J Kooistra, David E Gloriam, Alexander S Hauser
Pharmacogenomics, the study of how an individual's genetic makeup influences their response to medications, is a rapidly evolving field with significant implications for personalized medicine. As researchers and healthcare professionals face challenges in exploring the intricate relationships between genetic profiles and therapeutic outcomes, the demand for effective and user-friendly tools to access
-
Protein moonlighting by a target gene dominates phenotypic divergence of the Sef1 transcriptional regulatory network in yeasts Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-20 Po-Chen Hsu, Tzu-Chiao Lu, Po-Hsiang Hung, Jun-Yi Leu
Transcriptional rewiring generates phenotypic novelty, acting as an important mechanism contributing to evolutionary development, speciation, and adaptation in all organisms. The phenotypic outcomes (functions) of transcription factor (TF) activity are determined by the combined effects of all target genes in the TF’s regulatory network. Plastic rewiring of target genes accumulates during species divergence
-
scCancerExplorer: a comprehensive database for interactively exploring single-cell multi-omics data of human pan-cancer Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-19 Changzhi Huang, Zekai Liu, Yunlei Guo, Wanchu Wang, Zhen Yuan, Yusheng Guan, Deng Pan, Zhibin Hu, Linhua Sun, Zan Fu, Shuhui Bian
Genomic, epigenomic and transcriptomic alterations are hallmarks of cancer cells, and are closely connected. Especially, epigenetic regulation plays a critical role in tumorigenesis and progression. The growing single-cell epigenome data in cancer research provide new opportunities for data mining from a more comprehensive perspective. However, there is still a lack of databases designed for interactively
-
The European Nucleotide Archive in 2024 Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-19 Colman O’Cathail, Alisha Ahamed, Josephine Burgin, Carla Cummins, Rajkumar Devaraj, Khadim Gueye, Dipayan Gupta, Vikas Gupta, Muhammad Haseeb, Maira Ihsan, Eugene Ivanov, Suran Jayathilaka, Vishnukumar Kadhirvelu, Manish Kumar, Ankur Lathi, Rasko Leinonen, Jasmine McKinnon, Lili Meszaros, Joana Pauperio, Stephane Pesant, Nadim Rahman, Gabriele Rinck, Sandeep Selvakumar, Swati Suman, Yanisa Sunthornyotin
The European Nucleotide Archive (ENA, https://www.ebi.ac.uk/ena), maintained at the European Molecular Biology Laboratory's European Bioinformatics Institute (EMBL-EBI) provides freely accessible services, both for deposition of, and access to, open nucleotide sequencing data. Open scientific data are of paramount importance to the scientific community and contribute daily to the acceleration of scientific
-
GenBank 2025 update Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-19 Eric W Sayers, Mark Cavanaugh, Linda Frisse, Kim D Pruitt, Valerie A Schneider, Beverly A Underwood, Linda Yankie, Ilene Karsch-Mizrachi
GenBank® (https://www.ncbi.nlm.nih.gov/genbank/) is a comprehensive, public data repository that contains 34 trillion base pairs from over 4.7 billion nucleotide sequences for 581 000 formally described species. Daily data exchange with the European Nucleotide Archive and the DNA Data Bank of Japan ensures worldwide coverage. We summarize the content of the database in 2025 and recent updates such
-
Full-length direct RNA sequencing reveals extensive remodeling of RNA expression, processing and modification in aging Caenorhabditis elegans Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-19 Erin C Schiksnis, Ian A Nicastro, Amy E Pasquinelli
Organismal aging is marked by decline in cellular function and anatomy, ultimately resulting in death. To inform our understanding of the mechanisms underlying this degeneration, we performed standard RNA sequencing (RNA-seq) and Oxford Nanopore Technologies direct RNA-seq over an adult time course in Caenorhabditis elegans. Long reads allowed for identification of hundreds of novel isoforms and age-associated
-
A deep mutational scanning platform to characterize the fitness landscape of anti-CRISPR proteins Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-19 Tobias Stadelmann, Daniel Heid, Michael Jendrusch, Jan Mathony, Sabine Aschenbrenner, Stéphane Rosset, Bruno E Correia, Dominik Niopek
Deep mutational scanning is a powerful method for exploring the mutational fitness landscape of proteins. Its adaptation to anti-CRISPR proteins, which are natural CRISPR-Cas inhibitors and key players in the co-evolution of microbes and phages, facilitates their characterization and optimization. Here, we developed a robust anti-CRISPR deep mutational scanning pipeline in Escherichia coli that combines
-
PubChem 2025 update Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-19 Sunghwan Kim, Jie Chen, Tiejun Cheng, Asta Gindulyte, Jia He, Siqian He, Qingliang Li, Benjamin A Shoemaker, Paul A Thiessen, Bo Yu, Leonid Zaslavsky, Jian Zhang, Evan E Bolton
PubChem (https://pubchem.ncbi.nlm.nih.gov) is a large and highly-integrated public chemical database resource at NIH. In the past two years, significant updates were made to PubChem. With additions from over 130 new sources, PubChem contains >1000 data sources, 119 million compounds, 322 million substances and 295 million bioactivities. New interfaces, such as the consolidated literature panel and
-
Intrinsically disordered RNA-binding motifs cooperate to catalyze RNA folding and drive phase separation Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-19 Annika Niedner-Boblenz, Thomas Monecke, Janosch Hennig, Melina Klostermann, Mario Hofweber, Elena Davydova, André P Gerber, Irina Anosova, Wieland Mayer, Marisa Müller, Roland Gerhard Heym, Robert Janowski, Jean-Christophe Paillart, Dorothee Dormann, Kathi Zarnack, Michael Sattler, Dierk Niessing
RNA-binding proteins are essential for gene regulation and the spatial organization of cells. Here, we report that the yeast ribosome biogenesis factor Loc1p is an intrinsically disordered RNA-binding protein with eight repeating positively charged, unstructured nucleic acid binding (PUN) motifs. While a single of these previously undefined motifs stabilizes folded RNAs, multiple copies strongly cooperate
-
RADIP technology comprehensively identifies H3K27me3-associated RNA–chromatin interactions Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-19 Xufeng Shu, Masaki Kato, Satoshi Takizawa, Yutaka Suzuki, Piero Carninci
Many RNAs associate with chromatin, either directly or indirectly. Several technologies for mapping regions where RNAs interact across the genome have been developed to investigate the function of these RNAs. Obtaining information on the proteins involved in these RNA–chromatin interactions is critical for further analysis. Here, we developed RADIP [RNA and DNA interacting complexes ligated and sequenced
-
Plant Metabolic Network 16: expansion of underrepresented plant groups and experimentally supported enzyme data Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-19 Charles Hawkins, Bo Xue, Yasmin Farida, Gabrielle Wyatt, Philipp Zerbe, Seung Y Rhee
The Plant Metabolic Network (PMN) is a free online database of plant metabolism available at https://plantcyc.org. The latest release, PMN 16, provides metabolic databases representing >1200 metabolic pathways, 1.3 million enzymes, >8000 metabolites, >10 000 reactions and >15 000 citations for 155 plant and green algal genomes, as well as a pan-plant reference database called PlantCyc. This release
-
The rice genome annotation project: an updated database for mining the rice genome Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-19 John P Hamilton, Chenxin Li, C Robin Buell
Rice (Oryza sativa L.) is a major cereal crop that provides calories across the world. With a small genome, rice has been used extensively as a model for genetic and genomic studies in the Poaceae. Since the release of the first rice genome sequence in 2002, an improved reference genome assembly, multiple whole genome assemblies, extensive gene expression profiles, and resequencing data from over 3000
-
The STRING database in 2025: protein networks with directionality of regulation Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-19 Damian Szklarczyk, Katerina Nastou, Mikaela Koutrouli, Rebecca Kirsch, Farrokh Mehryary, Radja Hachilif, Dewei Hu, Matteo E Peluso, Qingyao Huang, Tao Fang, Nadezhda T Doncheva, Sampo Pyysalo, Peer Bork, Lars J Jensen, Christian von Mering
Proteins cooperate, regulate and bind each other to achieve their functions. Understanding the complex network of their interactions is essential for a systems-level description of cellular processes. The STRING database compiles, scores and integrates protein–protein association information drawn from experimental assays, computational predictions and prior knowledge. Its goal is to create comprehensive
-
CAUSALdb2: an updated database for causal variants of complex traits Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-19 Jianhua Wang, Liao Ouyang, Tianyi You, Nianling Yang, Xinran Xu, Wenwen Zhang, Hongxi Yang, Xianfu Yi, Dandan Huang, Wenhao Zhou, Mulin Jun Li
Unraveling the causal variants from genome wide association studies (GWASs) is pivotal for understanding genetic underpinnings of complex traits and diseases. Despite continuous efforts, tools to refine and prioritize GWAS signals need enhancement to address the direct causal implications of genetic variations. To overcome challenges related to statistical fine-mapping in identifying causal variants
-
Deep learning insights into distinct patterns of polygenic adaptation across human populations Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-19 Devashish Tripathi, Chandrika Bhattacharyya, Analabha Basu
Response to spatiotemporal variation in selection gradients resulted in signatures of polygenic adaptation in human genomes. We introduce RAISING, a two-stage deep learning framework that optimizes neural network architecture through hyperparameter tuning before performing feature selection and prediction tasks. We tested RAISING on published and newly designed simulations that incorporate the complex
-
OncoSplicing 3.0: an updated database for identifying RBPs regulating alternative splicing events in cancers Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-19 Yangjun Zhang, Kai Liu, Zhenzhen Xu, Bo Li, Xiaoliang Wu, Ruixin Fan, Xiangyang Yao, Huahui Wu, Chen Duan, Yan Gong, Ke Chen, Jin Zeng, Lu Li, Hua Xu
Alternative splicing (AS) is a crucial mechanism to regulate gene expression and protein complexity. RNA-binding proteins (RBPs) play an important role in regulating abnormal alternative splicing in cancers. However, few resources are available to identify specific RBPs responsible for regulating individual AS event. We have developed the OncoSplicing database for integrative analysis of clinically
-
NASA open science data repository: open science for life in space Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-19 Samrawit G Gebre, Ryan T Scott, Amanda M Saravia-Butler, Danielle K Lopez, Lauren M Sanders, Sylvain V Costes
Space biology and health data are critical for the success of deep space missions and sustainable human presence off-world. At the core of effectively managing biomedical risks is the commitment to open science principles, which ensure that data are findable, accessible, interoperable, reusable, reproducible and maximally open. The 2021 integration of the Ames Life Sciences Data Archive with GeneLab
-
HERB 2.0: an updated database integrating clinical and experimental evidence for traditional Chinese medicine Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-19 Kai Gao, Liu Liu, Shuangshuang Lei, Zhinong Li, Peipei Huo, Zhihao Wang, Lei Dong, Wenxin Deng, Dechao Bu, Xiaoxi Zeng, Chun Li, Yi Zhao, Wei Zhang, Wei Wang, Yang Wu
Clinical trials and meta-analyses are considered high-level medical evidence with solid credibility. However, such clinical evidence for traditional Chinese medicine (TCM) is scattered, requiring a unified entrance to navigate all available evaluations on TCM therapies under modern standards. Besides, novel experimental evidence has continuously accumulated for TCM since the publication of HERB 1.0
-
The Chemical Probes Portal – 2024: update on this public resource to support best-practice selection and use of small molecules in biomedical research Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-19 Domenico Sanfelice, Albert A Antolin, Alisa Crisp, Yi Chen, Benjamin Bellenie, Paul E Brennan, Aled Edwards, Susanne Müller, Bissan Al-Lazikani, Paul Workman
The Chemical Probes Portal (www.chemicalprobes.org) is a free, public resource, based on expert-reviews, that supports the assessment, selection and use of small-molecule compounds that qualify as chemical probes. These high-quality reagents are essential for exploring the function of individual proteins in complex biological systems, such as cells and organisms, and for validating proteins as potential
-
Efficient suppression of premature termination codons with alanine by engineered chimeric pyrrolysine tRNAs Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-19 Aya Awawdeh, Alejandro Tapia, Sarah A Alshawi, Olabode Dawodu, Sarah A Gaier, Caitlin Specht, Jean-Denis Beaudoin, Jeffery M Tharp, Oscar Vargas-Rodriguez
Mutations that introduce premature termination codons (PTCs) within protein-coding genes are associated with incurable and severe genetic diseases. Many PTC-associated disorders are life-threatening and have no approved medical treatment options. Suppressor transfer RNAs (sup-tRNAs) with the capacity to translate PTCs represent a promising therapeutic strategy to treat these conditions; however, developing
-
Functional and molecular insights into the role of Sae2 C-terminus in the activation of MRX endonuclease Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-19 Chiara Vittoria Colombo, Erika Casari, Marco Gnugnoli, Flavio Corallo, Renata Tisi, Maria Pia Longhese
The yeast Sae2 protein, known as CtIP in mammals, once phosphorylated at Ser267, stimulates the endonuclease activity of the Mre11-Rad50-Xrs2 (MRX) complex to cleave DNA ends that possess hairpin structures or protein blocks, such as the Spo11 transesterase or trapped topoisomerases. Stimulation of the Mre11 endonuclease by Sae2 depends on a Rad50–Sae2 interaction, but the mechanism by which this is
-
GPCRdb in 2025: adding odorant receptors, data mapper, structure similarity search and models of physiological ligand complexes Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-19 Luis P Taracena Herrera, Søren N Andreassen, Jimmy Caroli, Ismael Rodríguez-Espigares, Ali A Kermani, György M Keserű, Albert J Kooistra, Gáspár Pándy-Szekeres, David E Gloriam
G protein-coupled receptors (GPCRs) are membrane-spanning transducers mediating the actions of numerous physiological ligands and drugs. The GPCR database GPCRdb supports a large global research community with reference data, analysis, visualization, experiment design and dissemination. Here, we describe our sixth major GPCRdb release starting with an overview of all resources for receptors and ligands
-
MatrixDB 2024: an increased coverage of extracellular matrix interactions, a new Network Explorer and a new web interface Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-19 Kasun W Samarasinghe, Max Kotlyar, Sylvain D Vallet, Catherine Hayes, Alexandra Naba, Igor Jurisica, Frédérique Lisacek, Sylvie Ricard-Blum
MatrixDB, a member of the International Molecular Exchange consortium (IMEx), is a curated interaction database focused on interactions established by extracellular matrix (ECM) constituents including proteins, proteoglycans, glycosaminoglycans and ECM bioactive fragments. The architecture of MatrixDB was upgraded to ease interaction data export, allow versioning and programmatic access and ensure
-
CAF-1 promotes efficient PrimPol recruitment to nascent DNA for single-stranded DNA gap formation Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-19 Joshua Straka, Jude B Khatib, Lindsey Pale, Claudia M Nicolae, George-Lucian Moldovan
Suppression of single-stranded DNA (ssDNA) gap accumulation at replication forks has emerged as a potential determinant of chemosensitivity in homologous recombination (HR)-deficient tumors, as ssDNA gaps are transformed into cytotoxic double-stranded DNA breaks. We have previously shown that the histone chaperone CAF-1’s nucleosome deposition function is vital to preventing degradation of stalled
-
Complex portal 2025: predicted human complexes and enhanced visualisation tools for the comparison of orthologous and paralogous complexes Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-19 Sucharitha Balu, Susie Huget, Juan Jose Medina Reyes, Eliot Ragueneau, Kalpana Panneerselvam, Samantha N Fischer, Erin R Claussen, Savvas Kourtis, Colin W Combe, Birgit H M Meldal, Livia Perfetto, Juri Rappsilber, Georg Kustatscher, Kevin Drew, Sandra Orchard, Henning Hermjakob
The Complex Portal (www.ebi.ac.uk/complexportal) is a manually curated reference database for molecular complexes. It is a unifying web resource linking aggregated data on composition, topology and the function of macromolecular complexes from 28 species. In addition to significantly extending the number of manually curated complexes, we have massively extended the coverage of the human complexome
-
Selective recognition of RNA G-quadruplex in vitro and in cells by L-aptamer–D-oligonucleotide conjugate Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-19 Haizhou Zhao, Hill Lam Lau, Kun Zhang, Chun Kit Kwok
RNA Guanine-quadruplexes (rG4s) are important nucleic acid structures that govern vital biological processes. Although numerous tools have been developed to target rG4s, few specific tools are capable of discerning individual rG4 of interest. Herein, we design and synthesize the first L-aptamer–antisense oligonucleotide (ASO) conjugate, L-Apt.4–1c-ASO15nt(APP), with a focus on recognizing the amyloid
-
Profiling of i-motif-binding proteins reveals functional roles of nucleolin in regulation of high-order DNA structures. Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-19 Yuki Ban,Yuka Ando,Yuma Terai,Risa Matsumura,Keita Nakane,Shigenori Iwai,Shinichi Sato,Junpei Yamamoto
Non-canonical DNA structures, such as the G-quadruplex (G4) and i-motif (iM), are formed at guanine- and cytosine-rich sequences, respectively, in living cells and involved in regulating various biological processes during the cell cycle. Therefore, the formation and resolution of these non-canonical structures must be dynamically regulated by physiological conditions or factors that can bind G4 and
-
Correction to 'Regulation of the androgen receptor by SET9-mediated methylation'. Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-18
-
Correction to 'Click display: a rapid and efficient in vitro protein display method for directed evolution'. Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-18
-
Correction to 'Genome manipulation by guide-directed Argonaute cleavage'. Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-18
-
AcrIIIA1 is a protein–RNA anti-CRISPR complex that targets core Cas and accessory nucleases Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-18 Lucy Chou-Zheng, Olivia Howell, Tori A Boyle, Motaher Hossain, Forrest C Walker, Emma K Sheriff, Barbaros Aslan, Asma Hatoum-Aslan
Clustered regularly-interspaced short palindromic repeats (CRISPRs) and CRISPR-associated (Cas) proteins protect bacteria and archaea from their viruses, and anti-CRISPRs (Acrs) are small virus-encoded proteins that inhibit CRISPR-Cas immunity. Over 80 families of Acrs have been described to date; however, only three of these subvert Type III CRISPR-Cas immunity. Type III systems employ a complex network
-
SPathDB: a comprehensive database of spatial pathway activity atlas Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-15 Feng Li, Xinyu Song, Wenli Fan, Liying Pei, Jiaqi Liu, Rui Zhao, Yifang Zhang, Mengyue Li, Kaiyue Song, Yu Sun, Chunlong Zhang, Yunpeng Zhang, Yanjun Xu
Spatial transcriptomics sequencing technology deepens our understanding of the diversity of cell behaviors, fates and states within complex tissue, which is often determined by the fine-tuning of regulatory network functional activities. Therefore, characterizing the functional activity within tissue space is helpful for revealing the functional features that drive spatial heterogeneity, and understanding
-
RASP v2.0: an updated atlas for RNA structure probing data Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-14 Kunting Mu, Yuhan Fei, Yiran Xu, Qiangfeng Cliff Zhang
RNA molecules function in numerous biological processes by folding into intricate structures. Here we present RASP v2.0, an updated database for RNA structure probing data featuring a substantially expanded collection of datasets along with enhanced online structural analysis functionalities. Compared to the previous version, RASP v2.0 includes the following improvements: (i) the number of RNA structure
-
The secondary metabolism collaboratory: a database and web discussion portal for secondary metabolite biosynthetic gene clusters. Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-14 Daniel W Udwary,Drew T Doering,Bryce Foster,Tatyana Smirnova,Satria A Kautsar,Nigel J Mouncey
Secondary metabolites are small molecules produced by all corners of life, often with specialized bioactive functions with clinical and environmental relevance. Secondary metabolite biosynthetic gene clusters (BGCs) can often be identified within DNA sequences by various sequence similarity tools, but determining the exact functions of genes in the pathway and predicting their chemical products can
-
PharmFreq: a comprehensive atlas of ethnogeographic allelic variation in clinically important pharmacogenes. Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-14 Roman Tremmel,Yitian Zhou,Mahamadou D Camara,Sofiene Laarif,Erik Eliasson,Volker M Lauschke
Genetic polymorphisms in drug metabolizing enzymes, drug transporters as well as in genes encoding the human major histocompatibility complex contribute to inter-individual differences in drug efficacy and safety. The extent, pattern and complexity of such pharmacogenetic variation differ drastically across human populations. Here, we present PharmFreq, a global repository of pharmacogenetic frequency
-
SETDB1 activity is globally directed by H3K14 acetylation via its Triple Tudor Domain Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-14 Thyagarajan T Chandrasekaran, Michel Choudalakis, Alexander Bröhm, Sara Weirich, Alexandra G Kouroukli, Ole Ammerpohl, Philipp Rathert, Pavel Bashtrykov, Albert Jeltsch
SETDB1 (SET domain bifurcated histone lysine methyltransferase 1) is a major protein lysine methyltransferase trimethylating lysine 9 on histone H3 (H3K9) which is involved in heterochromatin formation and silencing of repeat elements (REs). It contains a unique Triple Tudor Domain (3TD), which specifically binds the dual modification of H3K14ac in the presence of H3K9me1/2/3. Here, we explored the
-
GTO: a comprehensive gene therapy omnibus Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-14 Xuehang Meng, Yujia Du, Chang Liu, Zhaoyu Zhai, Jianbo Pan
Gene therapy, which involves the delivery of genetic material into cells to correct an underlying genetic problem, has emerged as a promising approach for treating various conditions. To promote research in this rapidly evolving field, we developed the Gene Therapy Omnibus (GTO) (http://www.inbirg.com/gto/), a comprehensive resource containing detailed clinical trial data and molecular information
-
dbAMP 3.0: updated resource of antimicrobial activity and structural annotation of peptides in the post-pandemic era Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-14 Lantian Yao, Jiahui Guan, Peilin Xie, Chia-Ru Chung, Zhihao Zhao, Danhong Dong, Yilin Guo, Wenyang Zhang, Junyang Deng, Yuxuan Pang, Yulan Liu, Yunlu Peng, Jorng-Tzong Horng, Ying-Chih Chiang, Tzong-Yi Lee
Antimicrobial resistance is one of the most urgent global health threats, especially in the post-pandemic era. Antimicrobial peptides (AMPs) offer a promising alternative to traditional antibiotics, driving growing interest in recent years. dbAMP is a comprehensive database offering extensive annotations on AMPs, including sequence information, functional activity data, physicochemical properties and
-
miRNATissueAtlas 2025: an update to the uniformly processed and annotated human and mouse non-coding RNA tissue atlas Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-14 Shusruto Rishik, Pascal Hirsch, Friederike Grandke, Tobias Fehlmann, Andreas Keller
MiRNAs represent a non-coding RNA class that regulate gene expression and pathways. While miRNAs are evolutionary conserved most data stems from Homo sapiens and Mus musculus. As miRNA expression is highly tissue specific, we developed miRNATissueAtlas to comprehensively explore this landscape in H. sapiens. We expanded the H. sapiens tissue repertoire and included M. musculus. In past years, the number
-
The Pfam protein families database: embracing AI/ML Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-14 Typhaine Paysan-Lafosse, Antonina Andreeva, Matthias Blum, Sara Rocio Chuguransky, Tiago Grego, Beatriz Lazaro Pinto, Gustavo A Salazar, Maxwell L Bileschi, Felipe Llinares-López, Laetitia Meng-Papaxanthos, Lucy J Colwell, Nick V Grishin, R Dustin Schaeffer, Damiano Clementel, Silvio C E Tosatto, Erik Sonhammer, Valerie Wood, Alex Bateman
The Pfam protein families database is a comprehensive collection of protein domains and families used for genome annotation and protein structure and function analysis (https://www.ebi.ac.uk/interpro/). This update describes major developments in Pfam since 2020, including decommissioning the Pfam website and integration with InterPro, harmonization with the ECOD structural classification, and expanded
-
RiboSeq.Org: an integrated suite of resources for ribosome profiling data analysis and visualization Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-14 Jack A S Tierney, Michał I Świrski, Håkon Tjeldnes, Anmol M Kiran, Gionmattia Carancini, Stephen J Kiniry, Audrey M Michel, Joanna Kufel, Eivind Valen, Pavel V Baranov
Ribosome profiling (Ribo-Seq) has revolutionised our understanding of translation, but the increasing complexity and volume of Ribo-Seq data present challenges for its reuse. Here, we formally introduce RiboSeq.Org, an integrated suite of resources designed to facilitate Ribo-Seq data analysis and visualisation within a web browser. RiboSeq.Org comprises several interconnected tools: GWIPS-viz for
-
The international nucleotide sequence database collaboration (INSDC): enhancing global participation. Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-13 Ilene Karsch-Mizrachi,Masanori Arita,Tony Burdett,Guy Cochrane,Yasukazu Nakamura,Kim D Pruitt,Valerie A Schneider,On Behalf Of The International Nucleotide Sequence Database Collaboration
The members of the International Nucleotide Sequence Database Collaboration (INSDC; https://insdc.org) have built systems to collect, archive and disseminate sequence data for more than four decades. The three collaborating organizations, the National Library of Medicine, National Center for Biotechnology Information (NLM-NCBI) in the United States, Research Organization of Information and Systems
-
Rediscovering publicly available single-cell data with the DISCO platform Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-13 Mengwei Li, Kok Siong Ang, Brian Teo, Uddamvathanak Rom, Minh N Nguyen, Sebastian Maurer-Stroh, Jinmiao Chen
Single-cell RNA sequencing (scRNA-seq) has emerged as the key technique for studying transcriptomics at the single-cell level. In our previous work, we presented the DISCO database (https://www.immunesinglecell.org/) that integrates publicly available human scRNA-seq data. We now introduce an enhanced version of DISCO, which has expanded fourfold to include >100 million cells from >17 thousand samples
-
MDRepo—an open data warehouse for community-contributed molecular dynamics simulations of proteins Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-13 Amitava Roy, Ethan Ward, Illyoung Choi, Michele Cosi, Tony Edgin, Travis S Hughes, Md Shafayet Islam, Asif M Khan, Aakash Kolekar, Mariah Rayl, Isaac Robinson, Paul Sarando, Edwin Skidmore, Tyson L Swetnam, Mariah Wall, Zhuoyun Xu, Michelle L Yung, Nirav Merchant, Travis J Wheeler
Molecular Dynamics (MD) simulation of biomolecules provides important insights into conformational changes and dynamic behavior, revealing critical information about folding and interactions with other molecules. The collection of simulations stored in computers across the world holds immense potential to serve as training data for future Machine Learning models that will transform the prediction of
-
canSAR 2024—an update to the public drug discovery knowledgebase Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-13 Phillip W Gingrich, Rezvan Chitsazi, Ansuman Biswas, Chunjie Jiang, Li Zhao, Joseph E Tym, Kevin M Brammer, Jun Li, Zhigang Shu, David S Maxwell, Jeffrey A Tacy, Ioan L Mica, Michael Darkoh, Patrizio di Micco, Kaitlyn P Russell, Paul Workman, Bissan Al-Lazikani
canSAR (https://cansar.ai) continues to serve as the largest publicly available platform for cancer-focused drug discovery and translational research. It integrates multidisciplinary data from disparate and otherwise siloed public data sources as well as data curated uniquely for canSAR. In addition, canSAR deploys a suite of curation and standardization tools together with AI algorithms to generate
-
OncoSexome: the landscape of sex-based differences in oncologic diseases Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-13 Xinyi Shen, Yintao Zhang, Jiamin Li, Ying Zhou, Samuel D Butensky, Yechi Zhang, Zongwei Cai, Andrew T DeWan, Sajid A Khan, Hong Yan, Caroline H Johnson, Feng Zhu
The NIH policy on sex as biological variable (SABV) emphasized the importance of sex-based differences in precision oncology. Over 50% of clinically actionable oncology genes are sex-biased, indicating differences in drug efficacy. Research has identified sex differences in non-reproductive cancers, highlighting the need for comprehensive sex-based cancer data. We therefore developed OncoSexome, a
-
BitterDB: 2024 update on bitter ligands and taste receptors Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-13 Evgenii Ziaikin, Moran David, Sofya Uspenskaya, Masha Y Niv
BitterDB (http://bitterdb.agri.huji.ac.il) was introduced in 2012 as a central resource for information on bitter-tasting molecules and their receptors, and was updated in 2019. The information in BitterDB is used for tasks such as exploring the bitter chemical space, choosing suitable ligands for experimental studies, analyzing receptors’ selectivity and promiscuity, and developing machine learning
-
MarkerDB 2.0: a comprehensive molecular biomarker database for 2025 Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-13 Hayley Jackson, Eponine Oler, Claudia Torres-Calzada, Ray Kruger, Amandeep Singh Hira, Yamilé López-Hernández, Devanshi Pandit, Jiaxuan Wang, Kellie Yang, Omolola Fatokun, Mark Berjanskii, Scott MacKay, Tanvir Sajed, Scott Han, Robyn Woudstra, Gina Sykes, Jenna Poelzer, Aadhavya Sivakumaran, Vasuk Gautam, Gane Wong, David S Wishart
MarkerDB (https://markerdb.ca) has become a leading resource for comprehensive information on molecular biomarkers. Over the past 3 years, the database has evolved significantly, reflecting the dynamic landscape of biomarker research and increasing demands from its user community. This year’s update, which is called MarkerDB 2.0, introduces key improvements to enhance the database’s usability, consistency
-
OrthoDB and BUSCO update: annotation of orthologs with wider sampling of genomes Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-13 Fredrik Tegenfeldt, Dmitry Kuznetsov, Mosè Manni, Matthew Berkeley, Evgeny M Zdobnov, Evgenia V Kriventseva
OrthoDB (https://www.orthodb.org) offers evolutionary and functional annotations of orthologous genes in the widest sampling of eukaryotes, prokaryotes, and viruses, extending experimental gene function knowledge to newly sequenced genomes. We collect gene annotations, delineate hierarchical gene orthology and annotate the orthologous groups (OGs) with functional and evolutionary traits. OrthoDB is
-
The conformation of FOXM1 homodimers in vivo is crucial for regulating transcriptional activities Nucleic Acids Res. (IF 16.6) Pub Date : 2024-11-13 Chia-Chan Hsu, Xiang Yao, Shang-Yao Chen, Tsui-Chun Tsuo, I-Ching Wang
Conformational changes in a transcription factor can significantly affect its transcriptional activity. The activated form of the FOXM1 transcription factor regulates the transcriptional network of genes essential for cell cycle progression and carcinogenesis. However, the mechanism and impact of FOXM1 conformational change on its transcriptional activity in vivo throughout the cell cycle progression