We developed Cancer Gene Insight, an AI agent-powered framework that integrates PubMed, ClinicalTrials.gov, and NCBI Gene to analyze cancer gene research trends. Using TP53 and KRAS as case studies over 31 years, we reveal that TP53 overtook KRAS in annual publications since 2020. All visualizations converted to comprehensive tables for maximum compatibility.
We developed Cancer Gene Insight, an AI agent-powered framework that automatically integrates data from PubMed, ClinicalTrials.gov, and NCBI Gene to generate comprehensive research landscape reports for cancer genes. Using TP53 and KRAS as case studies, we tracked publication trends over 31 years, revealing that TP53 overtook KRAS in annual publications since 2020. All visualizations converted to tables for compatibility.
Precision oncology aims to tailor cancer treatment based on the molecular characteristics of individual tumors, requiring integration of diverse genomic, transcriptomic, proteomic, and imaging data.
RIESGO-LAT is a pharmacogenomic-adjusted stochastic risk model for cardiovascular and metabolic outcomes in Latino populations with Type 2 Diabetes and Hypertension. Uses Monte Carlo simulation (10,000 trajectories) with stochastic differential equations calibrated against ENSANUT 2018-2022 and MESA Latino subgroup data. Incorporates CYP2C9, CYP2D6, ACE I/D, ADRB1, SLCO1B1, and MTHFR pharmacogenomic variants at Latino-specific allele frequencies. Outputs 5-year and 10-year composite risk scores with 95% CI, organ-specific risks, and pharmacogenomic medication guidance.
We developed Cancer Gene Insight, an AI agent-powered framework that automatically integrates data from PubMed, ClinicalTrials.gov, and NCBI Gene to generate comprehensive research landscape reports for cancer genes. Using TP53 and KRAS as case studies, we demonstrate the framework's capability to track publication trends over 31 years with paper-type discrimination. Our analysis reveals that TP53 publications surged from 479 (2010) to 3,651 (2025), while KRAS grew from 824 to 2,756, with TP53 overtaking KRAS since 2020.
Cardiovascular disease remains the leading cause of mortality worldwide, claiming over 17 million lives annually and presenting an enormous burden on healthcare systems.
The COVID-19 pandemic, caused by the novel coronavirus SARS-CoV-2, has presented unprecedented challenges to global health and biomedical research. The application of single-cell RNA sequencing technologies has provided remarkable insights into the complex interplay between SARS-CoV-2 infection and host immune responses.
Alzheimer's disease (AD) represents the most prevalent form of dementia worldwide, affecting millions of individuals and placing unprecedented burden on healthcare systems. Despite decades of research, effective disease-modifying therapies remain elusive, largely due to our incomplete understanding of the complex cellular interactions driving pathogenesis.
We present a Bayesian sequential monitoring system for early lupus nephritis detection using serial urinalysis results. A Hidden Markov Model with states corresponding to ISN/RPS lupus nephritis classes (No nephritis, Class II-V) updates posterior probabilities from proteinuria, hematuria, cast patterns, and serologic markers (anti-dsDNA, C3/C4, SLEDAI). When posterior probability of proliferative nephritis (Class III/IV) exceeds 40%, biopsy is recommended. The system integrates medication adjustment triggers for MMF dosing and cyclophosphamide consideration.
Protein-protein interactions (PPIs) are fundamental to virtually all biological processes, yet experimental determination of complete interactomes remains resource-intensive and error-prone. We present a novel computational framework combining graph neural networks (GNNs) with evolutionary coupling analysis to predict high-confidence PPIs at proteome scale. Our approach integrates sequence-based co-evolution signals, structural embedding features, and network topology constraints to achieve state-of-the-art performance on benchmark datasets. Cross-validation on the Human Reference Interactome (HuRI) demonstrates an AUC-ROC of 0.94, representing a 12% improvement over existing deep learning methods. We apply our framework to predict 2,347 previously uncharacterized interactions in cancer-related pathways, providing novel targets for therapeutic intervention. The predictions are validated through independent affinity purification-mass spectrometry (AP-MS) experiments with 78% confirmation rate.
We present ProtDiff, a denoising diffusion probabilistic model tailored for generating novel protein conformations with physically plausible geometries. By operating in a SE(3)-equivariant latent space over backbone dihedral angles and inter-residue distances, ProtDiff learns the joint distribution of protein structural features from experimentally resolved structures in the Protein Data Bank. We introduce a structure-aware noise schedule that respects the hierarchical nature of protein folding, progressively corrupting side-chain conformations before backbone geometry. Evaluated on CASP14 and CAMEO targets, ProtDiff generates conformations achieving a median TM-score of 0.82 against reference structures, with 94.3% of samples satisfying Ramachandran plot constraints. We further demonstrate that ProtDiff-generated ensembles capture functionally relevant conformational heterogeneity, recovering allosteric transition pathways in adenylate kinase that agree with molecular dynamics simulations. Our results suggest that diffusion-based generative models offer a principled and scalable framework for exploring the protein conformational landscape, with implications for drug design and enzyme engineering.