Six Months of Prompt Engineering: Building Scientific Altitudinal, Topographical, and Geological Visualizations for Spiders

  Six Months of Prompt Engineering: Building Scientific Altitudinal, Topographical, and Geological Visualizations for Spiders By: Luis A. Roque,  Arácnido Taxonomy Six months ago, I set out on what seemed like a relatively straightforward goal: create better visual representations of where spiders live. What I quickly discovered was that producing scientifically meaningful ecological visualizations requires far more than simply asking artificial intelligence to draw a landscape. It requires learning how to communicate ecology, geology, geography, climate, and biodiversity in a language that AI can understand. Over the past six months, I have spent hundreds of hours developing, testing, refining, and rewriting prompts designed to generate publication-quality altitudinal, topographical, geological, and habitat-based visualizations for spiders, particularly tarantulas and other species whose distributions are closely tied to specific environmental conditions. What began as a curi...

ProVenTL: a transfer-learning framework for predicting peptide–protein interactions derived from snake venom for cancer therapeutics

 


ProVenTL: a transfer-learning framework for predicting peptide–protein interactions derived from snake venom for cancer therapeutics

Abstract

Accurate prediction of peptide–protein interactions (PepPI) is crucial for advancing peptide-based anticancer drug design. In this study, we introduce ProVenTL, a computer-aided molecular design framework that leverages transfer learning and protein language model embeddings to enhance PepPI prediction accuracy and interpretability. Two complementary strategies were explored: (i) fine-tuning a CAMP model pretrained on large-scale PepPI data from the Protein Data Bank (PDB) using a curated dataset of Calloselasma rhodostoma venom peptides and cancer-related proteins, and (ii) integrating ProtT5 embeddings with stacked autoencoder–deep neural networks (SAE–DNN) and TabNet classifiers. Models were comprehensively benchmarked against baseline configurations and representative deep-learning approaches using standard classification metrics, while biological relevance was evaluated through functional enrichment and pathway analysis of top-ranked predictions. Compared with baseline configurations and conventional deep-learning approaches, the ProtT5-based SAE–DNN model achieved the best performance (accuracy = 0.78; ROC–AUC = 0.86), demonstrating improved generalization capability on a small, domain-specific venom peptide dataset. The model identified key targets such as TRBC2, CD274, HIF1AN, PCSK9, and PLAU, which are associated with pathways involved in immune suppression, hypoxia regulation, lipid metabolism, and metastasis. This study highlights the utility of transfer learning and protein language models for PepPI prediction in data-limited scenarios and establishes a computational framework for prioritizing snake-venom-derived peptides for anticancer drug discovery and future experimental validation.

Adhiva, J., Pradana, H.A., Kusuma, W.A. et al. ProVenTL: a transfer-learning framework for predicting peptide–protein interactions derived from snake venom for cancer therapeutics. J Comput Aided Mol Des 40, 90 (2026). https://doi.org/10.1007/s10822-026-00801-w