Tool catalog
Every entry below is a working Cortexa tool the research agent can call right now. Browse schemas and example calls; we hide anything that's not currently operational.
Showing all 2,204 tools
- Available
arXiv
Search arXiv for preprints across CS/ML, physics, math, quant-bio, and more. Use for methodology papers, bleeding-edge algorithms, and pre-peer-review results.
search_arxivView - Available
bioRxiv/medRxiv
bioRxiv + medRxiv preprints. Use for the most recent biology and clinical research (pre-peer-review).
search_biorxivView - Available
BLS
US Bureau of Labor Statistics: authoritative US labor & price series — CPI inflation, unemployment rate, nonfarm payrolls, average hourly earnings, PPI, and labor-force participation. Use for the official US jobs/inflation release figures. Returns the latest datapoint with its release period.
search_blsView - Available
Clinical Trials
Search ClinicalTrials.gov. Use for trial phase, status, enrollment, primary endpoints, and sponsor. Returns NCT records.
search_clinical_trialsView - Available
ClinVar
Search ClinVar for clinical significance of genetic variants. Use for variant pathogenicity, allele frequency, and condition associations.
search_clinvarView - Available
CoinGecko
CoinGecko: live cryptocurrency market data — price, market cap, 24h volume, rank, supply, and all-time high across 10K+ coins. FIRST CHOICE for any crypto/digital-asset price or market question. Search by coin name or symbol (e.g. 'bitcoin', 'ETH', 'solana').
search_coingeckoView - Available
Congress.gov
Congress.gov: bills, resolutions, members, roll-call votes, and committee reports from the Library of Congress. Use to track the status of legislation, find sponsors, or look up Congressional voting records.
search_congressView - Available
CourtListener
CourtListener (Free Law Project): US federal + state case law (9M+ opinions, including the historical Caselaw Access Project corpus) and RECAP federal dockets. FIRST CHOICE for any question about a court opinion, judicial decision, or active litigation, AND for historical precedent (Standard Oil, Brown v. Board, Sherman Act cases). Uses semantic + hybrid search by default — write the query in natural language ('cases where shareholders sued directors for ignoring climate risk') and put any term that MUST appear literally in "double quotes" (e.g. '"Section 230" immunity for AI-generated content'). Returns case name, court, citation, date filed, judge, and excerpt.
search_courtlistenerView - Available
Crossref
Crossref: canonical DOI registry with bibliographic metadata (journal, ISSN, licence, publisher). Use when the user gives you a DOI to resolve, or when you need authoritative journal/book metadata. Also useful as a backup when OpenAlex is rate-limiting.
search_crossrefView - Available
Crypto On-Chain (BigQuery)
Crypto on-chain data (Bitcoin + Ethereum via BigQuery public mirrors): the raw-ledger complement to CoinGecko market data. Returns recent network statistics (latest block, 24h tx count, average gas fee), or — given an Ethereum address (0x… 40 hex) or transaction hash (0x… 64 hex) — that entity's recent on-chain activity. Use CoinGecko for price/market-cap; use this for on-chain activity.
search_crypto_onchainView - Available
DailyMed
DailyMed NIH prescribing information. Use for the official FDA-cleared package insert text.
search_dailymedView - Available
ECB Data Portal
ECB Data Portal: euro-area headline indicators — reference exchange rates (EUR/USD, EUR/GBP, EUR/JPY), ECB policy rates (main refinancing, deposit facility), and HICP inflation. Use for euro-area monetary policy, FX, and inflation questions. Returns the latest official value with its period.
search_ecbView - Available
eCFR
Electronic Code of Federal Regulations (eCFR): authoritative current text of all 50 CFR titles. Use for any question about federal regulations — securities (17 CFR), drugs and food (21 CFR), environment (40 CFR), labor (29 CFR), tax (26 CFR), etc. Returns the regulation hierarchy and text excerpt.
search_ecfrView - Available
SEC EDGAR
SEC EDGAR full-text filings search. Use for company filings — 10-K (annual), 10-Q (quarterly), 8-K (material events), S-1 (IPO), proxy statements, Form 4 (insider trades). FIRST CHOICE for questions about a public company's risk factors, financials, governance, or insider activity.
search_edgarView - Available
FDA Drug Labels
Search FDA-approved drug labels. Use for indications, contraindications, dosing ranges, adverse reactions, and black-box warnings. Returns OpenFDA label records.
search_fdaView - Available
FRED
FRED (Federal Reserve Economic Data): 800K+ US & international economic time series — interest rates (DGS10, FEDFUNDS), inflation (CPIAUCSL), GDP, unemployment (UNRATE), payrolls, money supply (M2), exchange rates, and equity indices (SP500). FIRST CHOICE for any macroeconomic, monetary-policy, or markets time-series question. Returns the matched series with its latest value and as-of date. (Requires FRED_API_KEY.)
search_fredView - Available
GitHub
GitHub: search public repositories by name, description, topics. Use to find official implementations of papers, kernel / compiler / database projects, RFC drafts, and to gauge reproducibility / maintenance for any CS/ML work.
search_githubView - Available
GovInfo
GovInfo (GPO): Federal Register, public laws, US Code, Congressional reports, Compilation of Presidential Documents. Use for proposed rules, agency notices, statutes, and Congressional hearings. Complements eCFR (final regulations) by covering the rulemaking process.
search_govinfoView - Available
HPO
Human Phenotype Ontology. Use to map symptoms to phenotype terms and associated conditions.
search_hpoView - Available
Hugging Face Hub
Hugging Face Hub: ML models + datasets registry. Use for anything about pretrained models, benchmark datasets, model cards, licences, and downloads/likes signal. First-choice source for CS/ML reproducibility queries.
search_huggingfaceView - Available
ICD-10/11
ICD-10/11 diagnosis coding. Use to map a clinical description to a standardized diagnosis code.
search_icd10View - Available
LOINC
LOINC lab and observation codes. Use when you need the LOINC code for a specific lab test or measurement.
search_loincView - Available
NASA ADS
NASA ADS (Astrophysics Data System): canonical literature index for astrophysics, cosmology, planetary science, and space sciences. Use for any astronomy/astrophysics query — it covers ApJ, MNRAS, A&A, IAU, arXiv astro-ph, and more.
search_nasa_adsView - Available
NCBI Gene
Look up gene records (RefSeq, genomic context, orthologs, expression). Use when a gene symbol needs authoritative NCBI data.
search_ncbi_geneView - Available
OMIM
OMIM catalog of Mendelian disorders. Use for gene-disease relationships and inheritance patterns.
search_omimView - Available
OpenAlex
OpenAlex: the universal scholarly-works catalog. 260M+ papers across every discipline with citation counts, open-access metadata, and concept tags. Use as the FIRST-CHOICE literature source for any question that isn't obviously biomedical, especially CS/ML, physics, chemistry, earth sciences, social sciences, economics, or humanities.
search_openalexView - Available
openFDA Animal & Veterinary
openFDA Animal & Veterinary adverse events (FDA Center for Veterinary Medicine): adverse-event reports for animal drugs — species/breed treated, drug and route, VeDDRA-coded reactions, and outcome. FIRST CHOICE for veterinary pharmacovigilance and animal-drug safety questions. Search by active ingredient (e.g. 'carprofen'), reaction, or species.
search_openfda_vetView - Available
Open Targets
Query Open Targets for drug-target-disease associations, tractability, and evidence scores. Use for target identification and drug-repurposing hypotheses.
search_opentargetsView - Available
Orphanet
Orphanet rare-disease encyclopedia. Use for orphan disease prevalence, inheritance, and associated genes.
search_orphanetView - Available
Patent Litigation (USPTO OCE)
Patent litigation (USPTO OCE dataset): US district-court patent infringement cases — caption, case number, court, and filing/close dates. Use for patent-litigation history, party disputes, and 'has this been litigated' questions. Search by party or case name.
search_patent_litigationView - Available
Patents (PatentsView)
Patents (PatentsView via BigQuery): search US patents by title/abstract. FIRST CHOICE for patent-landscape, prior-art, FTO, and IP-strategy questions. Returns patent number, grant date, title, and abstract snippet, linked to Google Patents.
search_patentsView - Available
PubChem
PubChem: NIH compound database with structures (SMILES, InChI), physicochemical properties (MW, XLogP, TPSA), and GHS hazard classifications. FIRST CHOICE for any chemistry question that names a molecule, compound, or drug substance.
search_pubchemView - Available
PubMed
Search PubMed for peer-reviewed biomedical literature. Use for clinical research, drug efficacy studies, disease mechanisms, and treatment outcomes. Returns abstracts + PMIDs.
search_pubmedView - Available
RxNorm
Look up drug nomenclature, ingredients, brand/generic mappings, and drug-drug interactions. Use when you need canonical drug identifiers.
search_rxnormView - Available
SEC Financials (XBRL)
SEC Financials (XBRL company facts): structured financial-statement numbers for US public companies — revenue, net income, total assets, shareholders' equity, and diluted EPS — resolved by ticker or company name. Use when you need the actual reported figures (not the filing narrative — that's search_edgar). Returns the latest annual values with period-end dates.
search_sec_financialsView - Available
Semantic Scholar
Semantic Scholar: academic papers with citation counts and influence scores. Use when you need citation analysis or cross-discipline coverage.
search_semantic_scholarView - Available
UniProt
Search UniProt for protein sequences, functions, domains, PTMs, and cross-references. Use when the question involves a specific protein (gene symbol or accession).
search_uniprotView - Available
US Healthcare Providers (NPPES)
US Healthcare Providers (CMS NPPES NPI registry): resolve a US clinician or healthcare organization name to its NPI, practice city/state, and primary taxonomy (specialty). Use for 'who is this provider / what's their NPI' lookups and provider directories. Search by person name (e.g. 'Atul Gawande') or organization name.
search_us_providersView - Available
Zenodo
Zenodo: CERN-hosted artifact registry with persistent DOIs. Use to locate versioned datasets, software releases, and supplementary research artifacts that a paper depends on. Especially good for reproducibility questions.
search_zenodoView - Available
ACC List Guidelines
List recent American College of Cardiology (ACC) clinical practice guidelines. Returns ACC-authored guidelines published in JACC and other ACC journals, sorted by date. ACC guidelines cover coronary revascularization, valvular heart disease, chest pain evaluation, cardiac catheterization, and cardiovascular risk assessment.
tu_ACC_list_guidelinesView - Available
ADA Get Standards Section
Fetch detailed content from a specific ADA Standards of Care section by PMID. Retrieves the abstract and, when available via PubMed Central (PMC), a full-text snippet. Use ADA_list_standards_sections first to get PMIDs for each section. All ADA Standards sections are freely available in PMC. Returns title, abstract, full text snippet, and links.
tu_ADA_get_standards_sectionView - Available
ADA List Standards Sections
List all sections/chapters of the current ADA Standards of Medical Care in Diabetes. The ADA publishes the Standards of Care annually in Diabetes Care as a free supplement. Each section covers a specific clinical topic (e.g., glycemic goals, pharmacologic treatment, cardiovascular risk management, diabetes technology). Returns section titles, numbers, PMIDs, and PMC links. Use year parameter to access specific editions (default: 2026).
tu_ADA_list_standards_sectionsView - Available
ADA Search Standards
Search ADA (American Diabetes Association) guidelines and publications by topic using PubMed. Searches publications with ADA as corporate author, returning titles, abstracts, PMIDs, and PMC links. Useful for finding ADA recommendations on specific diabetes-related topics such as glycemic targets, insulin therapy, GLP-1 receptor agonists, SGLT2 inhibitors, gestational diabetes, diabetic kidney disease, or diabetes technology.
tu_ADA_search_standardsView - Available
ADMETAI Predict BBB Penetrance
Predicts blood-brain barrier (BBB) penetrance for a given list of molecules in SMILES format.
tu_ADMETAI_predict_BBB_penetranceView - Available
ADMETAI Predict Bioavailability
Predicts bioavailability endpoints (Bioavailability_Ma, HIA_Hou, PAMPA_NCATS, Caco2_Wang, Pgp_Broccatelli) for a given list of molecules in SMILES format.
tu_ADMETAI_predict_bioavailabilityView - Available
ADMETAI Predict Clearance Distribution
Predicts clearance and distribution endpoints (Clearance_Hepatocyte_AZ, Clearance_Microsome_AZ, Half_Life_Obach, VDss_Lombardo, PPBR_AZ) for a given list of molecules in SMILES format.
tu_ADMETAI_predict_clearance_distributionView - Available
ADMETAI Predict CYP Interactions
Predicts CYP enzyme interactions for a given list of molecules in SMILES format.
tu_ADMETAI_predict_CYP_interactionsView - Available
ADMETAI Predict Nuclear Receptor Activity
Predicts nuclear receptor activity endpoints (NR-AR-LBD, NR-AR, NR-AhR, NR-Aromatase, NR-ER-LBD, NR-ER, NR-PPAR-gamma) for a given list of molecules in SMILES format.
tu_ADMETAI_predict_nuclear_receptor_activityView - Available
ADMETAI Predict Physicochemical Properties
Predicts physicochemical properties (molecular weight, logP, hydrogen bond acceptors/donors, Lipinski, QED, stereo centers, TPSA) for a given list of molecules in SMILES format.
tu_ADMETAI_predict_physicochemical_propertiesView - Available
ADMETAI Predict Solubility Lipophilicity Hydration
Predicts solubility, lipophilicity, and hydration endpoints (Solubility_AqSolDB, Lipophilicity_AstraZeneca, HydrationFreeEnergy_FreeSolv) for a given list of molecules in SMILES format.
tu_ADMETAI_predict_solubility_lipophilicity_hydrationView - Available
ADMETAI Predict Stress Response
Predicts stress response endpoints (SR-ARE, SR-ATAD5, SR-HSE, SR-MMP, SR-p53) for a given list of molecules in SMILES format.
tu_ADMETAI_predict_stress_responseView - Available
ADMETAI Predict Toxicity
Predicts toxicity endpoints (AMES, Carcinogens_Lagunin, ClinTox, DILI, LD50_Zhu, Skin_Reaction, hERG) for a given list of molecules in SMILES format.
tu_ADMETAI_predict_toxicityView - Available
Advanced Literature Search Agent
Advanced multi-agent deep literature search system. This is a SEARCH-FIRST system: agents must extensively use Cortexa Tool Mesh literature search tools to gather information, NOT rely on their own knowledge. Required pipeline: (1) query_planner analyzes the query to identify key search aspects and generates 3-8 focused search queries; (2) FIRST call web_foundation_searcher (base web tools only) to produce seeds/sources/keywords; (3) For EACH seed query, parallel call ALL THREE specialized sear…
tu_advanced_literature_search_agentView - Available
AgingCohort Search
Search a curated registry of ~30 major longitudinal cohort studies relevant to aging research worldwide. Contains study metadata including design, sample size, age range, variable categories, and data access information. Use to find the right cohort for a specific research question, such as which studies measure iron intake, grip strength, or walking speed in older adults. Filters by country, study design (longitudinal/cross-sectional), minimum sample size, and specific variable availability.
tu_AgingCohort_searchView - Available
AHA ACC Get Guideline
Fetch the full text (or a detailed snippet) of a specific AHA (American Heart Association) or ACC (American College of Cardiology) guideline from PubMed Central using its PMID. Returns the abstract and, when the article is open-access in PMC, the full guideline text up to 6000 characters covering recommendations, evidence tables, and rationale. Use AHA_list_guidelines or ACC_list_guidelines first to obtain PMIDs. Many recent AHA/ACC guidelines are open-access in PMC.
tu_AHA_ACC_get_guidelineView - Available
AHA ACC Search Guidelines
Search AHA (American Heart Association) and ACC (American College of Cardiology) clinical practice guidelines by topic via PubMed. Covers guidelines published in Circulation, JACC, and other AHA/ACC journals. Useful for finding recommendations on hypertension, heart failure, atrial fibrillation, coronary artery disease, valvular heart disease, lipid management, cardiac arrest, and other cardiovascular conditions.
tu_AHA_ACC_search_guidelinesView - Available
AHA List Guidelines
List recent American Heart Association (AHA) clinical practice guidelines. Returns AHA-authored guidelines and scientific statements published in Circulation and other AHA journals, sorted by date. Covers cardiovascular guidelines on topics including hypertension, heart failure, cardiac arrest, stroke, arrhythmias, and prevention.
tu_AHA_list_guidelinesView - Available
AllenBrain Get Expression Datasets
Get gene expression experiment datasets from the Allen Mouse Brain Atlas for a given gene. Returns section data sets including plane of section, section thickness, and linked gene info. These datasets contain ISH (in situ hybridization) gene expression images. Example: 'Gad1' returns 22 ISH experiment datasets.
tu_AllenBrain_get_expression_datasetsView - Available
AllenBrain Get Structure
Get detailed information about a brain structure by its Allen Brain Atlas structure ID. Returns full structure record including name, hierarchy path, parent structure, color code. Example: structure_id=382 returns 'Field CA1' in hippocampus.
tu_AllenBrain_get_structureView - Available
AllenBrain Search Genes
Search for genes in the Allen Brain Atlas by gene symbol (acronym) or name. Returns gene metadata including Entrez ID, chromosome, homologene ID. Useful for finding gene IDs needed to query expression data. Example: 'Gad1' returns glutamate decarboxylase 1 records for mouse and rat.
tu_AllenBrain_search_genesView - Available
AllenBrain Search Structures
Search for brain structures/regions in the Allen Brain Atlas ontology by acronym or name. Returns structure hierarchy, atlas coordinates, and color coding. Example: 'CA1' returns hippocampal Field CA1 across multiple atlases.
tu_AllenBrain_search_structuresView - Available
Alliance Get Disease
Get disease summary information from the Alliance of Genome Resources by Disease Ontology (DO) ID. Returns disease name, definition, synonyms, and classification. The Alliance integrates disease annotations from multiple model organism databases. Example: 'DOID:162' returns cancer with definition and synonyms like 'malignant neoplasm'.
tu_Alliance_get_diseaseView - Available
Alliance Get Disease Genes
Get genes associated with a disease across all model organisms from the Alliance of Genome Resources. Uses Disease Ontology (DO) IDs to find genes linked to a disease from human, mouse, rat, zebrafish, fly, worm, and other organisms. Includes both direct annotations and inferred associations from model organism phenotypes. Example: 'DOID:162' (cancer) returns TP53, BRCA1, and thousands more.
tu_Alliance_get_disease_genesView - Available
Alliance Get Gene Phenotypes
Get phenotype annotations for a gene from the Alliance of Genome Resources. Returns phenotypes observed when the gene is mutated or knocked out, aggregated from model organism databases. Supports all model organisms (yeast, fly, worm, zebrafish, mouse, rat, frog). Example: 'HGNC:6081' (INS) returns phenotypes like 'Abnormal circulating insulin concentration'.
tu_Alliance_get_gene_phenotypesView - Available
Alliance Search Genes
Search for genes across all model organisms in the Alliance of Genome Resources. Searches by gene symbol, name, or keyword and returns matching genes from human, mouse, rat, zebrafish, fly, worm, yeast, and frog databases. Provides type-ahead suggestions. Example: 'insulin' returns INS (human), Ins1/Ins2 (mouse), ins (zebrafish), etc.
tu_Alliance_search_genesView - Available
Alphafold Get Annotations
Retrieve AlphaFold MUTAGEN annotations for a given UniProt accession. Returns experimental mutagenesis data mapped onto protein structures from UniProt. The qualifier must be a UniProt ACCESSION (e.g., 'P69905'). Note: Not all proteins have MUTAGEN annotations available in the database.
tu_alphafold_get_annotationsView - Available
Alphafold Get Prediction
Retrieve full AlphaFold 3D structure predictions for a given protein. IMPORTANT: The qualifier must be a UniProt ACCESSION (e.g., 'P69905' for HBA_HUMAN). Do NOT use UniProt entry names like 'HBA_HUMAN' - they will cause API errors. To find UniProt accession from a gene/protein name, use `UniProt_search` (e.g., query='gene:HBA' organism='human') or `UniProt_id_mapping` for ID conversion. Returns residue-level metadata including sequence, per-residue confidence scores (pLDDT), and structure down…
tu_alphafold_get_predictionView - Available
Alphafold Get Summary
Retrieve summary details of AlphaFold 3D models for a given protein. IMPORTANT: The qualifier must be a UniProt ACCESSION (e.g., 'Q5SWX9' for MEIOB_HUMAN). Do NOT use UniProt entry names like 'MEIOB_HUMAN' - they will cause API errors. To find UniProt accession from a gene/protein name, use `UniProt_search` (e.g., query='gene:MEIOB' organism='human') or `UniProt_id_mapping` for ID conversion. Returns lightweight information such as sequence length, coverage, confidence scores, experimental meth…
tu_alphafold_get_summaryView - Available
AlphaMissense Get Protein Scores
Get all AlphaMissense pathogenicity scores for a protein by UniProt ID. Returns comprehensive residue-level data for all possible missense variants across the entire protein. Useful for identifying pathogenic hotspots and overall protein vulnerability assessment.
tu_AlphaMissense_get_protein_scoresView - Available
AlphaMissense Get Residue Scores
Get AlphaMissense scores for all 20 possible amino acid substitutions at a specific protein position. Useful for saturation mutagenesis analysis and identifying pathogenic hotspots.
tu_AlphaMissense_get_residue_scoresView - Available
AlphaMissense Get Variant Score
Get AlphaMissense pathogenicity score for a specific missense variant. Input: UniProt ID and variant (e.g., 'p.R123H'). Returns score and classification (pathogenic/ambiguous/benign). State-of-the-art for VUS interpretation.
tu_AlphaMissense_get_variant_scoreView - Available
Anaconda Get Package
Get detailed information about a specific conda package from a specific channel on Anaconda.org. Returns the package's full metadata including description, all versions, platform availability, download counts, home URL, license, and owner details. Use the format '{channel}/{package_name}' (e.g., 'conda-forge/numpy', 'bioconda/samtools', 'defaults/scipy').
tu_Anaconda_get_packageView - Available
Anaconda Search Packages
Search Anaconda.org for conda packages across all channels (conda-forge, bioconda, defaults, etc.). Returns matching packages with name, owner/channel, summary, versions, platforms, and download counts. Anaconda.org hosts conda packages for Python, R, and other languages used in scientific computing, data science, and bioinformatics. Bioconda alone has 9,000+ bioinformatics packages.
tu_Anaconda_search_packagesView - Available
AOPWiki List Aops
List Adverse Outcome Pathways (AOPs) from AOPWiki. Returns AOP IDs, titles, and short names. AOPs describe causal chains from molecular initiating events through key events to adverse outcomes, used in toxicology and risk assessment.
tu_AOPWiki_list_aopsView - Available
ARCHS4 Get Gene Correlations
Get genes most co-expressed with a query gene from ARCHS4, based on Pearson correlation computed across 300K+ uniformly processed RNA-seq samples. Returns the top N most correlated genes with their Pearson correlation coefficients. Useful for identifying functionally related genes, predicting gene function, and building co-expression networks.
tu_ARCHS4_get_gene_correlationsView - Available
ARCHS4 Get Gene Expression
Get gene expression levels across human or mouse tissues and cell lines from ARCHS4 (All RNA-seq and ChIP-seq Sample and Signature Search). Returns box-plot statistics (min, Q1, median, Q3, max) in log2(TPM+1) units for each tissue/cell line, computed from 300K+ uniformly processed RNA-seq samples. Choose type='tissue' for organ/tissue expression or type='cellline' for cell line expression. Results are sorted by median expression (highest first).
tu_ARCHS4_get_gene_expressionView - Available
Arrayexpress Get Experiment
Get detailed information about a specific ArrayExpress experiment by its accession ID (E-GEOD, E-MTAB, etc.). Returns complete experiment metadata including title, description, protocols, experimental design, and publication information from the original ArrayExpress database.
tu_arrayexpress_get_experimentView - Available
Arrayexpress Get Experiment Files
Get list of files associated with an ArrayExpress experiment. Returns file metadata including file names, types, sizes, and download paths for raw data files, processed data files, and MAGE-TAB annotation files.
tu_arrayexpress_get_experiment_filesView - Available
Arrayexpress Get Experiment Samples
Get sample information for an ArrayExpress experiment. Returns sample metadata including sample names, characteristics (e.g., organism part, developmental stage), experimental factors, and technical details from the original ArrayExpress MAGE-TAB annotations.
tu_arrayexpress_get_experiment_samplesView - Available
Arrayexpress Search Experiments
Search ArrayExpress experiments - the original functional genomics database from EMBL-EBI. Searches microarray and RNA-seq experiments by keywords, species, or array platform. Returns experiment metadata including accession IDs (E-GEOD, E-MTAB), titles, descriptions, and study information. Note: Some newer ArrayExpress data is now hosted in BioStudies - use BioStudies tools if an accession is not found here.
tu_arrayexpress_search_experimentsView - Available
ArtIC Get Artwork
Get detailed information about a specific artwork from the Art Institute of Chicago collection by its numeric ID. Returns full metadata including title, artist, date, medium, dimensions, provenance, exhibition history, and IIIF image URL. No authentication required. Use ArtIC_search_artworks to find artwork IDs.
tu_ArtIC_get_artworkView - Available
ArtIC Search Artworks
Search the Art Institute of Chicago's collection of over 100,000 artworks using its public API. Returns artwork titles, artists, dates, mediums, and collection links. No authentication required. The collection includes Impressionist paintings (Seurat, Monet, Picasso), ancient art, photography, prints, and decorative arts. Useful for art research, museum data analysis, and cultural heritage studies.
tu_ArtIC_search_artworksView - Available
ArXiv Get Pdf Snippets
Fetch an arXiv paper's PDF and return bounded text snippets around provided terms. Uses markitdown to convert PDF to markdown. All arXiv papers are freely available (no paywall). Useful when key details are in the full text rather than abstract.
tu_ArXiv_get_pdf_snippetsView - Available
ArXiv Search Papers
Search arXiv for papers by keyword using the public arXiv API. Returns papers with title, abstract, authors, publication date, category, and URL.
tu_ArXiv_search_papersView - Available
Bgee Get Gene Expression
Get expression data for a gene across tissues and organs from the Bgee database. Returns where a gene is expressed with expression scores, confidence levels, and quality ratings. Supports RNA-Seq, Affymetrix, EST, and in situ hybridization data types. Requires both an Ensembl gene ID and NCBI taxonomy ID. Example: ENSG00000141510 (TP53) with species 9606 (human) returns expression across 145+ tissues including ventricular zone (score 95.11), ganglionic eminence, and monocytes.
tu_Bgee_get_gene_expressionView - Available
Bgee List Species
List all species available in the Bgee gene expression database. Returns species with taxonomy IDs, genome versions, and common names. Bgee currently covers 29+ animal species including major model organisms (human, mouse, zebrafish, fruit fly, C. elegans, rat) and additional vertebrates and invertebrates. Useful for discovering which species are available before querying gene expression data.
tu_Bgee_list_speciesView - Available
Bgee Search Genes
Search for genes in the Bgee comparative gene expression database across 29+ animal species. Bgee integrates RNA-Seq, Affymetrix, EST, and in situ hybridization data to provide curated expression calls. Search by gene symbol, name, or description. Returns matching genes with species information, biotype, and expression summaries. Example: searching 'TP53' returns tumor protein p53 across human, mouse, zebrafish, and other model organisms.
tu_Bgee_search_genesView - Available
BiGG Get Database Version
Get BiGG Models database version and last update date. Returns BiGG version number, API version, and timestamp of last database update. Use for: checking data currency, version control, documentation, reproducibility. Check this to ensure using latest metabolic models data.
tu_BiGG_get_database_versionView - Available
BiGG Get Metabolite
Get detailed metabolite information including formula, compartments, and database cross-references. Query from specific model or universal database. Returns metabolite names, formulas, compartment locations, and links to KEGG, MetaCyc, HMDB, ChEBI. Shows which models contain this metabolite. Use for: compound identification, formula lookup, database integration, metabolite annotation. Essential for understanding metabolic networks.
tu_BiGG_get_metaboliteView - Available
BiGG Get Model
Get detailed information for a specific metabolic model. Returns organism, genome accession, reaction/metabolite/gene counts, and publication reference (PMID). Use for: understanding model scope, checking model currency, finding publications, model selection. Essential metadata before downloading or analyzing a model. Example model IDs: iJO1366 (E. coli), iMM904 (S. cerevisiae), Recon3D (human).
tu_BiGG_get_modelView - Available
BiGG Get Model Reactions
Get all reactions in a metabolic model. Returns reaction IDs, names, and organisms for all biochemical reactions in the model. Use for: analyzing metabolic capabilities, finding specific pathways, reaction enumeration, model comparison. Follow with BiGG_get_reaction for detailed reaction information including stoichiometry and gene-reaction rules.
tu_BiGG_get_model_reactionsView - Available
BiGG Get Reaction
Get detailed reaction information including stoichiometry, metabolites, gene-reaction rules, and database links. Query from specific model or universal database. Returns reaction equation, participating metabolites with coefficients, GPR (gene-protein-reaction) associations, bounds, and cross-references (KEGG, MetaCyc, BRENDA). Use for: understanding reaction mechanisms, gene associations, flux constraints, pathway analysis. Essential for metabolic modeling.
tu_BiGG_get_reactionView - Available
BiGG List Models
Get list of all genome-scale metabolic models in BiGG database (85+ models). Returns model IDs, organisms, reaction counts, metabolite counts, and gene counts for bacteria, archaea, and eukaryotes. Models cover E. coli, yeast, human, and other organisms. Use for: discovering available models, finding organism-specific models, comparing model sizes, metabolic modeling projects. Essential first step for flux balance analysis.
tu_BiGG_list_modelsView - Available
BiGG Search
Search BiGG database for models, reactions, metabolites, or genes. Returns matching entries with IDs, names, and organism info. Search types: models (find organisms), reactions (find biochemical reactions), metabolites (find compounds), genes (find gene names/IDs). Use for: discovery, finding specific entities, cross-model searches, identifying pathways. Flexible search across entire database.
tu_BiGG_searchView - Available
BindingDB Get Ligands By PDB
Get binding affinity data for proteins by PDB structure ID. Returns ligands with measured affinities for the protein structure. Useful for structure-activity relationship analysis when you have co-crystal structures.
tu_BindingDB_get_ligands_by_pdbView - Available
BindingDB Get Ligands By Uniprot
Get binding affinity data (Ki, IC50, Kd) for a single protein by UniProt ID. Returns SMILES structures and measured affinities from BindingDB. Essential for drug discovery target analysis. Example: P00533 (EGFR) returns known inhibitors with binding constants.
tu_BindingDB_get_ligands_by_uniprotView - Available
BindingDB Get Ligands By Uniprots
Get binding affinity data for multiple proteins by UniProt IDs. Returns SMILES and affinities for all matching ligands. Use for comparing binding profiles across protein family members or paralogs.
tu_BindingDB_get_ligands_by_uniprotsView - Available
BindingDB Get Targets By Compound
Find protein targets for a compound by SMILES structure. Returns proteins with binding affinity data for similar compounds. Essential for polypharmacology analysis and off-target prediction.
tu_BindingDB_get_targets_by_compoundView - Available
BindingDB Search By Target
Search BindingDB for binding affinity data by target gene name or protein name. NOTE: BindingDB REST API is currently unavailable (requests time out). Use ChEMBL_get_target_activities or PubChem BioAssay as alternatives. To query by UniProt ID, use BindingDB_get_ligands_by_uniprot instead.
tu_BindingDB_search_by_targetView - Available
Bioconductor Search Packages
Search Bioconductor bioinformatics R packages by keyword, topic, or analysis type. Bioconductor provides 2200+ open-source R packages for bioinformatics and computational biology covering RNA-seq, single-cell, proteomics, epigenomics, metabolomics, flow cytometry, imaging, and more. Returns matching packages with title, description, usage count, and relevance score. Powered by R-universe search index. Use this to find packages for specific bioinformatics workflows, then use Bioconductor_get_pac…
tu_Bioconductor_search_packagesView - Available
BioImageArchive Get Study
Get detailed information about a specific BioImage Archive study by accession number. Returns comprehensive metadata including title, description, imaging methods, organism, and file information. Example: 'S-BIAD634' returns an annotated fluorescence image dataset.
tu_BioImageArchive_get_studyView - Available
BioImageArchive Search Bioimages
Search the BioImage Archive specifically for biological image datasets (BioImages collection). This targets the BioImages-specific collection at EBI, returning imaging studies with microscopy-specific metadata. Example: 'cell division' finds imaging datasets of cell division processes.
tu_BioImageArchive_search_bioimagesView - Available
BioImageArchive Search Studies
Search the BioImage Archive (EBI BioStudies) for biological imaging datasets. The BioImage Archive hosts open microscopy, imaging, and visualization data from life sciences research. Search by imaging modality (cryo-EM, fluorescence, confocal), organism, technique, or scientific topic. Returns study accessions, titles, and metadata. Example: 'fluorescence microscopy cell' finds fluorescence imaging studies.
tu_BioImageArchive_search_studiesView - Available
BiomarkerDiscoveryWorkflow
Discover and validate biomarkers for a specific disease condition using literature analysis, expression data, pathway enrichment, and clinical validation.
tu_BiomarkerDiscoveryWorkflowView - Available
BioModels Download Model
Download a specific file from a BioModels entry or get the download URL for the entire model as a COMBINE archive. Returns download URL and file metadata. Use this to access model files for simulation software (COPASI, CellDesigner, etc.). Specify filename parameter to download a specific file, or omit to get the COMBINE archive containing all files. Find model IDs using biomodels_search.
tu_BioModels_download_modelView - Available
BioModels Get Model
Get comprehensive metadata for a specific BioModels entry by its model identifier. Returns detailed information including model name, description, publication references, organism, biological processes, SBML level/version, creation/modification dates, and curator annotations. Use this to understand model details, citations, and biological context before downloading or using a model. Find model IDs using biomodels_search (e.g., 'BIOMD0000000469', 'MODEL1707110000').
tu_BioModels_get_modelView - Available
BioModels List Files
Get detailed file listing for a specific BioModels entry including file names, types, sizes, and metadata. Returns information about all files associated with the model (SBML files, figures, supplementary data, COMBINE archives). Use this before downloading to see what files are available and their formats. Requires model ID from biomodels_search or BioModels_get_model.
tu_BioModels_list_filesView - Available
Biomodels Search
Search for computational biological models in the EBI BioModels database by keyword. Returns models with identifiers, names, formats (typically SBML), and URLs. Use this to find systems biology models related to specific biological processes (e.g., 'glycolysis', 'apoptosis', 'cell cycle'), diseases, or pathways. Find model IDs to use with other BioModels tools for detailed information and downloads.
tu_biomodels_searchView - Available
BioModels Search Parameters
Search for specific parameters within BioModels entries. Find models containing parameters matching your query (e.g., reaction rates, initial concentrations, kinetic constants). Returns models with parameter names, values, and units. Essential for finding models with specific parameter values or discovering parameter ranges across multiple models. Use this for comparative analysis or parameter estimation studies.
tu_BioModels_search_parametersView - Available
BioPortal Annotate Text
Annotate biomedical text with ontology terms using BioPortal's Annotator (named entity recognition). Scans free text and identifies mentions of biomedical concepts from 900+ ontologies. Useful for extracting disease mentions, drug names, gene functions, phenotypes, and anatomical terms from clinical notes, papers, or abstracts. Example: annotate 'Melanoma is a malignant tumor of melanocytes' to find DOID:1909 (melanoma) and DOID:162 (cancer).
tu_BioPortal_annotate_textView - Available
BioPortal Get Concept
Get detailed information about a specific concept from a BioPortal ontology, including full definition, synonyms, CUI codes, and semantic types. Requires knowing the ontology acronym and concept IRI. Use BioPortal_search_ontology_terms first to find concept IDs, then use this tool for complete details. Example: get details for DOID:1909 (melanoma) from the DOID ontology, or GO:0006915 (apoptotic process) from the GO ontology.
tu_BioPortal_get_conceptView - Available
BioPortal Get Hierarchy
Get children, parents, or ancestors of a specific concept in a BioPortal ontology. Enables hierarchical browsing of ontology term relationships (is-a hierarchy). Use to explore narrower terms (children), broader terms (parents), or full ancestry path (ancestors). Example: get children of DOID:9351 (diabetes mellitus) to find type 1, type 2, gestational diabetes; get ancestors of HP:0001250 (Seizure) to see the full path to root.
tu_BioPortal_get_hierarchyView - Available
BioPortal Search Ontology Terms
Search across 900+ biomedical ontologies in BioPortal (NCBO) for concepts matching a query. BioPortal is the world's largest repository of biomedical ontologies including GO (gene function), HPO (phenotypes), DOID (diseases), CHEBI (chemicals), UBERON (anatomy), SNOMED, MeSH, and 900+ more. Returns matching concepts with labels, IDs, definitions, and synonyms. Optionally filter by specific ontologies. Example: search 'melanoma' to find disease terms across DOID, NCIT, MESH, SNOMED; search 'apop…
tu_BioPortal_search_ontology_termsView - Available
Bioregistry Get Registry
Get metadata for a biological database/ontology by its Bioregistry prefix. Returns name, description, homepage, ID pattern, URI format, example IDs, and synonyms. Covers 2600+ resources including UniProt, ChEBI, GO, PDB, Ensembl, HGNC, and many more.
tu_Bioregistry_get_registryView - Available
Bioregistry Resolve Reference
Resolve a compact identifier (prefix:id) to provider URLs across 2600+ biological databases. Given a database prefix (e.g., 'uniprot', 'chebi', 'go', 'pubmed') and an identifier, returns URLs from all known providers. Useful for cross-database linking and identifier resolution.
tu_Bioregistry_resolve_referenceView - Available
Bioregistry Search Registries
Search the Bioregistry for databases, ontologies, and resources by keyword. Use to find the correct prefix for identifier resolution, or to discover databases related to a topic (e.g., 'protein', 'gene ontology', 'metabolite').
tu_Bioregistry_search_registriesView - Available
BioRxiv Get Preprint
Get full metadata for a specific bioRxiv or medRxiv preprint by its DOI. Returns comprehensive metadata including title, authors, abstract, publication date, version, category, corresponding author, funding information, and URLs (HTML, PDF, XML). Response time: <1 second. Use when you have a bioRxiv DOI (format: 10.1101/YYYY.MM.DD.XXXXXX). To SEARCH for preprints by keywords, use EuropePMC_search_articles with 'SRC:PPR' filter (fast, relevance-ranked), or web_search with 'site:biorxiv.org'. bio…
tu_BioRxiv_get_preprintView - Available
BioRxiv List Recent Preprints
List recent bioRxiv or medRxiv preprints by date range. Returns up to 100 preprints posted between the given start and end dates, with title, authors, abstract, DOI, category, and publication status. The bioRxiv API returns results in batches of up to 100; use the cursor parameter for pagination (0 for first page, 100 for second, etc.). Use this to monitor new preprints in a date range or discover recent work in a field. For searching by DOI, use BioRxiv_get_preprint instead. For searching by k…
tu_BioRxiv_list_recent_preprintsView - Available
BioSamples Get Sample
Get detailed metadata for a specific biological sample from the EBI BioSamples database by accession. Returns sample name, organism, tissue, disease, experimental context, and links to associated data archives (ENA, ArrayExpress, EVA). BioSamples contains 60+ million samples from all major sequence and functional genomics archives. Example: SAMEA104228123 returns a metagenomic sample from the Qiita microbiome project.
tu_BioSamples_get_sampleView - Available
BioSamples Search
Search the EBI BioSamples database by text query across 60+ million biological samples. Returns matching samples with accessions, names, organisms, and key characteristics. Useful for finding source material metadata for sequencing, proteomics, and other experiments. Example: searching 'breast cancer' returns thousands of tumor samples with tissue/disease annotations.
tu_BioSamples_searchView - Available
BioSamples Search By Filter
Search the EBI BioSamples database with structured attribute filters. More precise than text search - filters by specific metadata fields like organism, tissue, or disease. Example: attribute='organism', value='Homo sapiens' returns human samples; attribute='disease', value='melanoma' returns melanoma-specific samples.
tu_BioSamples_search_by_filterView - Available
Biostudies Get Study
Get detailed information about a specific BioStudies study by accession number. Returns complete study metadata including attributes, sections, files, and links. Supports various accession formats (e.g., S-BSST123, E-GEOD-12345 for ArrayExpress). HTML responses are converted to Markdown.
tu_biostudies_get_studyView - Available
Biostudies Get Study Files
Get list of files associated with a BioStudies study. Returns file metadata including paths, sizes, types, and attributes. Files are extracted from the study's section hierarchy.
tu_biostudies_get_study_filesView - Available
Biostudies Search
Search BioStudies repository for biological studies across all collections. BioStudies is a comprehensive repository at EMBL-EBI that hosts diverse study types including genomics, transcriptomics, proteomics, imaging data, and more. Supports full-text search with pagination and sorting. HTML responses are automatically converted to Markdown using markitdown.
tu_biostudies_searchView - Available
Biostudies Search By Collection
Search BioStudies by specific collection (e.g., ArrayExpress, BioImages, EuropePMC). Collections are curated datasets within BioStudies that focus on specific data types or domains. HTML responses are automatically converted to Markdown.
tu_biostudies_search_by_collectionView - Available
BioTools Get Tool
Get detailed information about a specific bioinformatics tool from the ELIXIR Bio.tools registry by its biotoolsID. Returns comprehensive metadata including name, description, homepage, tool type (command-line, web app, database, etc.), EDAM topics and operations, supported input/output data formats, programming languages, operating systems, license, documentation links, and credits. Use biotoolsID values discovered via BioTools_search. Example: 'samtools' for SAMtools, 'blast' for NCBI BLAST.
tu_BioTools_get_toolView - Available
BioTools Search
Search the ELIXIR Bio.tools registry for bioinformatics software tools, databases, and web services. Bio.tools is the comprehensive registry of bioinformatics resources maintained by ELIXIR, containing 30,000+ entries. Returns tool names, descriptions, types, topics, and homepages. Use this to find software for specific bioinformatics tasks (e.g., 'sequence alignment', 'variant calling', 'protein structure prediction'). Example: search 'RNA-seq' to find RNA sequencing analysis tools.
tu_BioTools_searchView - Available
BioTools Search By Operation
Search the ELIXIR Bio.tools registry for bioinformatics tools by EDAM operation (what the tool does). Returns tools that perform a specific computational operation. Useful operations include: 'Sequence alignment', 'Structure prediction', 'Variant calling', 'Gene expression analysis', 'Phylogenetic inference', 'Protein function prediction', 'Molecular docking', 'Genome annotation', 'Visualisation', 'Data retrieval', 'Statistical calculation', 'Clustering'. Use this to find software for a specifi…
tu_BioTools_search_by_operationView - Available
BioTools Search By Topic
Search the ELIXIR Bio.tools registry for bioinformatics tools by EDAM scientific topic. Returns tools classified under a specific research domain. Useful topics include: 'Proteomics', 'Genomics', 'Transcriptomics', 'Structural biology', 'Phylogenetics', 'Metabolomics', 'Metagenomics', 'Epigenomics', 'Sequence analysis', 'Systems biology', 'Machine learning', 'Molecular interactions', 'Population genetics'. Use this to discover all available software for a specific research field.
tu_BioTools_search_by_topicView - Available
BioTools Search By Type
Search the ELIXIR Bio.tools registry for bioinformatics resources by tool type. Returns resources of a specific category. Available types: 'Command-line tool', 'Web application', 'Web API', 'Web service', 'Database portal', 'Desktop application', 'Library', 'Plug-in', 'Suite', 'Script', 'Workflow', 'Workbench', 'Ontology', 'Bioinformatics portal'. Use this to find, for example, all database portals or all web APIs for a research domain.
tu_BioTools_search_by_typeView - Available
BLAST Nucleotide Search
Search nucleotide sequences using NCBI BLAST blastn against nucleotide databases. Requires Biopython (pip install biopython). IMPORTANT: This tool uses NCBI's remote BLAST service which is inherently SLOW - expect 5-30 minutes per query depending on server load. This is normal NCBI behavior, not a bug. For faster results, use smaller databases like 'refseq_select_rna' instead of 'nt'.
tu_BLAST_nucleotide_searchView - Available
BLAST Protein Search
Search protein sequences using NCBI BLAST blastp against protein databases. Requires Biopython (pip install biopython). IMPORTANT: This tool uses NCBI's remote BLAST service which is inherently SLOW - expect 5-30 minutes per query depending on server load. This is normal NCBI behavior, not a bug. For faster results, use 'swissprot' or 'pdb' databases instead of 'nr'.
tu_BLAST_protein_searchView - Available
BMRB Get Entries By PDB ID
Find BMRB NMR entries associated with a PDB structure. Returns BMRB entry IDs linked to a given PDB accession, with match type (BLAST or author-provided). Useful for finding NMR chemical shift data for a protein with known crystal structure.
tu_BMRB_get_entries_by_pdb_idView - Available
BMRB Get Entries By Uniprot
Find BMRB NMR entries for a protein using its UniProt accession. Returns NMR data entries associated with the protein, allowing cross-referencing between sequence databases and NMR spectroscopy data.
tu_BMRB_get_entries_by_uniprotView - Available
BMRB Get Entry
Get NMR data entry from the Biological Magnetic Resonance Data Bank (BMRB). Returns the full NMR-STAR data for a BMRB entry including chemical shifts, coupling constants, relaxation data, structure restraints, and experimental conditions. BMRB archives NMR spectroscopy data for biological macromolecules (proteins, nucleic acids, carbohydrates) and small molecules. Entry IDs are integers (e.g., 15000 for villin headpiece, 4020 for ubiquitin).
tu_BMRB_get_entryView - Available
BMRB Get Entry Citation
Get the citation/publication information for a BMRB NMR entry in BibTeX format. Returns author list, title, journal, DOI, and year. Useful for citing NMR data sources.
tu_BMRB_get_entry_citationView - Available
BMRB Search By Keyword
Search BMRB (Biological Magnetic Resonance Data Bank) entries by keyword or molecule name. Returns matching BMRB entries with their IDs and summary information. Use the 'database' parameter to search either macromolecules (proteins, nucleic acids) or metabolomics small molecules.
tu_BMRB_search_by_keywordView - Available
BRENDA Get Enzyme Kinetics
Get comprehensive enzyme kinetic parameters WITHOUT requiring BRENDA credentials. Retrieves enzyme identity (name, catalytic activity, alternative names) from ExPASy ENZYME and kinetic parameters (Km, kcat, Ki, Vmax, kcat/Km) from SABIO-RK. If BRENDA credentials are set (BRENDA_EMAIL, BRENDA_PASSWORD), additionally fetches optimal pH, optimal temperature, and specific activity from BRENDA. Accepts EC number (e.g., 1.1.1.1) or enzyme name (e.g., alcohol dehydrogenase). Returns aggregated summary…
tu_BRENDA_get_enzyme_kineticsView - Available
BridgeDb Attributes
Get properties and attributes for a biological identifier from BridgeDb. Returns metadata such as molecular formula, molecular weight, InChI, InChIKey, synonyms, and other database-specific attributes. Especially useful for metabolites (HMDB, ChEBI, KEGG compound IDs) to get chemical properties, and for genes to get symbol and name information. System codes: Ch=HMDB, Ce=ChEBI, Ck=KEGG Compound, En=Ensembl, H=HGNC, S=UniProt.
tu_BridgeDb_attributesView - Available
BridgeDb Search
Search for biological identifiers by name using BridgeDb. Enter a gene name, metabolite name, or protein name and get matching identifiers from supported databases. Useful for finding database-specific IDs when you only know the common name. Searches across all database sources available for the specified organism. Returns identifier-database pairs. For example, searching 'glucose' returns matches from MetaCyc, KEGG, and other databases.
tu_BridgeDb_searchView - Available
BridgeDb Xrefs
Map a biological identifier across databases using BridgeDb. Given an identifier from one database (e.g., HMDB0000122 from HMDB), returns equivalent identifiers in all other supported databases (ChEBI, KEGG, PubChem, Wikidata, CAS, etc.). Supports genes, proteins, metabolites, and other biological entities across 45+ databases and 30+ organisms. System codes: Ch=HMDB, Ce=ChEBI, Ck=KEGG Compound, En=Ensembl, H=HGNC, S=UniProt, L=NCBI Gene, Cpc=PubChem-compound, Ik=InChIKey. Use this when you nee…
tu_BridgeDb_xrefsView - Available
BVBRC Get Genome
Get detailed genome information from BV-BRC (Bacterial and Viral Bioinformatics Resource Center, formerly PATRIC). Returns genome metadata including organism name, taxonomy, genome size, GC content, assembly status, host, disease, and isolation information. BV-BRC is the primary NIAID-funded resource for bacterial and viral pathogen genomics. Example: '83332.12' returns Mycobacterium tuberculosis H37Rv.
tu_BVBRC_get_genomeView - Available
BVBRC Get Protein Structure
Get detailed pathogen protein structure information from BV-BRC by PDB ID. Returns structure metadata including title, resolution, method, gene, organism, UniProt accession, PMID references, sequence, and taxonomic lineage. Useful for structural biology research on pathogen proteins. Example: '6VSB' returns the SARS-CoV-2 spike glycoprotein structure.
tu_BVBRC_get_protein_structureView - Available
BVBRC Get Taxonomy
Get detailed taxonomy information for a pathogen from BV-BRC by NCBI Taxonomy ID. Returns taxon name, rank, lineage, alternative names, and genome counts. Useful for resolving pathogen taxonomy and understanding classification hierarchy. Example: '2697049' returns SARS-CoV-2 taxonomy with full lineage from Viruses to species.
tu_BVBRC_get_taxonomyView - Available
BVBRC Search Amr
Search for antimicrobial resistance (AMR) phenotype data in BV-BRC. Returns resistance/susceptibility data for pathogens against specific antibiotics, including MIC values, testing methods, and evidence. Essential for AMR surveillance research. Example: search for methicillin resistance in Staphylococcus aureus.
tu_BVBRC_search_amrView - Available
BVBRC Search Epitopes
Search for pathogen epitopes (B-cell and T-cell) in BV-BRC. Returns epitope sequences, types, protein associations, and assay results for vaccine and immunology research. Covers epitopes from IEDB for bacteria and viruses including SARS-CoV-2, influenza, tuberculosis, and other pathogens. Example: search taxon_id 2697049 for SARS-CoV-2 spike glycoprotein epitopes.
tu_BVBRC_search_epitopesView - Available
BVBRC Search Genome Features
Search for genome features (genes, proteins, CDS) in BV-BRC pathogen genomes. Find specific genes like antimicrobial resistance genes (mecA, blaTEM), virulence factors, or any annotated feature across thousands of pathogen genomes. Example: search for 'mecA' gene (methicillin resistance) across all genomes.
tu_BVBRC_search_genome_featuresView - Available
BVBRC Search Genomes
Search for pathogen genomes in BV-BRC by organism name or keyword. Returns a list of matching genomes with metadata. Useful for finding bacterial or viral genome assemblies. Example: 'Staphylococcus aureus' returns all S. aureus genome assemblies.
tu_BVBRC_search_genomesView - Available
BVBRC Search Pathways
Search for metabolic pathways in pathogen genomes from BV-BRC. Returns KEGG pathway assignments with EC numbers, enzyme descriptions, and genome associations. Useful for understanding pathogen metabolism and identifying potential drug targets. Example: search for pathways in M. tuberculosis (taxon_id 1773).
tu_BVBRC_search_pathwaysView - Available
BVBRC Search Protein Structures
Search for pathogen protein structures in BV-BRC by organism (taxon_id), gene name, or experimental method. Returns PDB entries with titles, resolution, methods, and references. Covers viral and bacterial protein structures linked to BV-BRC genome annotations. Example: search for all SARS-CoV-2 protein structures with taxon_id 2697049.
tu_BVBRC_search_protein_structuresView - Available
BVBRC Search Specialty Genes
Search for specialty genes in BV-BRC including virulence factors, antibiotic resistance genes, drug targets, and human homologs across pathogen genomes. Data sourced from CARD, NDARO, VFDB, PATRIC_VF, and other curated databases. Essential for identifying key pathogenicity and resistance determinants. Example: search for 'mecA' (methicillin resistance gene in Staphylococcus).
tu_BVBRC_search_specialty_genesView - Available
BVBRC Search Subsystems
Search for functional subsystems (curated groups of functionally related proteins) in BV-BRC pathogen genomes. Returns subsystem classifications including superclass, class, subclass, and specific functional roles. Useful for understanding pathogen functional organization and comparing metabolic capabilities. Example: search for virulence-related subsystems in M. tuberculosis.
tu_BVBRC_search_subsystemsView - Available
BVBRC Search Surveillance
Search influenza and pathogen surveillance data in BV-BRC. Returns surveillance samples with host species, geographic location, subtype, and collection metadata. Essential for epidemiological tracking of influenza subtypes (H5N1, H1N1, H3N2) and other monitored pathogens across global regions. Example: search for H5N1 cases in North America.
tu_BVBRC_search_surveillanceView - Available
BVBRC Search Taxonomy
Search for pathogen taxonomy entries in BV-BRC by keyword. Returns matching taxa with names, ranks, genome counts, and lineage information. Useful for discovering pathogen taxon IDs for use with other BV-BRC tools. Example: 'Mycobacterium' returns the Mycobacterium genus and all its species.
tu_BVBRC_search_taxonomyView - Available
CADD Get Position Scores
Get CADD scores for all possible substitutions at a genomic position. Returns PHRED scores for A, C, G, T variants at the position. Useful for saturation mutagenesis analysis.
tu_CADD_get_position_scoresView - Available
CADD Get Range Scores
Get CADD scores for all variants in a genomic range (max 100bp). Returns all pre-computed deleteriousness scores in the region. Use for hotspot analysis or exon scanning.
tu_CADD_get_range_scoresView - Available
CADD Get Variant Score
Get CADD deleteriousness score for a specific variant. PHRED scores: >=20 top 1% deleterious (pathogenic threshold ~15-20), >=30 top 0.1%. Input: chromosome, position, ref, alt alleles. Supports GRCh37 and GRCh38.
tu_CADD_get_variant_scoreView - Available
Cancer Biomarkers Disease Target Score
Extract disease-target association scores from cancer biomarkers data. This includes known cancer biomarkers.
tu_cancer_biomarkers_disease_target_scoreView - Available
Cancer Gene Census Disease Target Score
Extract disease-target association scores from Cancer Gene Census. This provides curated cancer gene data.
tu_cancer_gene_census_disease_target_scoreView - Available
CancerPrognosis Get Gene Expression
Fetch gene expression values (RNA-seq) for a specific gene across cancer samples in a TCGA or cBioPortal study. Returns per-sample expression values with summary statistics (mean, median, min, max). Output can be combined with survival data for expression-based prognostic analysis. Supports 33 TCGA cancer types and any cBioPortal study with mRNA expression data.
tu_CancerPrognosis_get_gene_expressionView - Available
CancerPrognosis Get Study Summary
Get summary information for a cancer study including available molecular profiles (mutations, expression, CNV), survival-related clinical attributes, and sample counts. Useful for determining what data is available before querying expression or survival data. Lists all 33 supported TCGA cancer type abbreviations.
tu_CancerPrognosis_get_study_summaryView - Available
CancerPrognosis Get Survival Data
Retrieve patient-level overall survival (OS) and disease-free survival (DFS) data from a TCGA or cBioPortal cancer study. Returns per-patient survival times, event status, and age. Data can be directly used with Survival_kaplan_meier or Survival_log_rank_test tools for custom survival analyses. Supports 33 TCGA cancer types (BRCA, LUAD, COAD, GBM, etc.) and 400+ cBioPortal studies.
tu_CancerPrognosis_get_survival_dataView - Available
CancerPrognosis Search Studies
Search cBioPortal for cancer genomics studies by keyword. Find studies by cancer type, institution, or data type. Returns study IDs that can be used with other CancerPrognosis tools. Covers 400+ studies including all TCGA projects, AACR GENIE, MSK-IMPACT, and institutional cohorts.
tu_CancerPrognosis_search_studiesView - Available
CancerVar Classify Variant
Classify a somatic cancer variant using AMP/ASCO/CAP 2017 guidelines (CancerVar). Returns a Tier I–IV assignment: Tier I = Strong Clinical Significance (FDA-approved therapy association), Tier II = Potential Clinical Significance (investigational/preclinical evidence), Tier III = Unknown Clinical Significance, Tier IV = Benign/Likely Benign. Also returns 12 CBP evidence scores and an Oncogenicity Pathogenicity Index (OPAI, 0–1). Input requires genomic coordinates in hg19 or hg38. No API key nee…
tu_CancerVar_classify_variantView - Available
CATH Get Domain Summary
Get CATH structural classification for a specific protein domain from PDB. Domains are identified by PDB code + chain + domain number (e.g., 1cukA01). Returns the full CATH hierarchy (Class, Architecture, Topology, Homologous superfamily) and residue count. Example: '1cukA01' returns CATH ID 2.40.50.140.116.1.1.1.1 (Mainly Beta class, Nucleic acid-binding proteins superfamily) with 212 residues.
tu_CATH_get_domain_summaryView - Available
CATH Get Funfam
Get details for a specific functional family (FunFam) within a CATH superfamily. Returns FunFam name, description, member count, seed alignment size, DOPS quality score, and representative domain ID. FunFams represent groups of domains predicted to share the same biological function. Use CATH_list_funfams to browse FunFams within a superfamily, or CATH_get_domain_summary to find which FunFam a specific domain belongs to.
tu_CATH_get_funfamView - Available
CATH Get Superfamily
Get CATH protein structure superfamily information by CATH ID. CATH classifies protein domain structures into a four-level hierarchy: Class, Architecture, Topology, Homologous superfamily. Superfamilies group evolutionarily related domains. Returns classification name, number of families, and domain counts. Example: 2.40.50.140 returns 'Nucleic acid-binding proteins' with 227 S35 families and 2,879 total domains.
tu_CATH_get_superfamilyView - Available
CATH List Funfams
List functional families (FunFams) within a CATH superfamily. FunFams group protein domains that share the same function, identified by conserved specificity-determining residues. Returns FunFam numbers, names, member counts, and representative domain IDs. Example: superfamily 1.10.510.10 (Globin-like) has 4,524 FunFams including CaM kinase (2,698 members) and Myosin light chain kinase (1,234 members). Use CATH_get_superfamily to find superfamily IDs first.
tu_CATH_list_funfamsView - Available
CBioPortal Get Cancer Studies
Get list of cancer studies from cBioPortal
tu_cBioPortal_get_cancer_studiesView - Available
CBioPortal Get Cancer Types
Get all cancer types from cBioPortal including TCGA, Pan-Cancer Atlas, and other studies. Returns cancerTypeId, name, and clinical trial keywords for filtering studies.
tu_cBioPortal_get_cancer_typesView - Available
CBioPortal Get Clinical Attributes
Get available clinical attributes for a cancer study. Returns attribute IDs, names, and data types. Use to know what clinical data can be queried (e.g., stage, grade, survival).
tu_cBioPortal_get_clinical_attributesView - Available
CBioPortal Get Clinical Data
Get clinical data for all samples in a study. Returns patient-level data like tumor stage, histology, survival status. Filter by clinical attribute ID for specific data types.
tu_cBioPortal_get_clinical_dataView - Available
CBioPortal Get Gene Info
Get detailed information about a specific gene by Entrez Gene ID. Returns gene symbol, aliases, type, and chromosome location.
tu_cBioPortal_get_gene_infoView - Available
CBioPortal Get Gene Panel Genes
Get all genes in a specific gene panel. Essential for understanding what genes are covered when analyzing mutation frequency data from targeted sequencing studies.
tu_cBioPortal_get_gene_panel_genesView - Available
CBioPortal Get Gene Panels
Get all gene panels used in cBioPortal studies. Gene panels define which genes were sequenced in a study. Use to understand coverage before querying mutation data.
tu_cBioPortal_get_gene_panelsView - Available
CBioPortal Get Genes
Search for genes by keyword or query (gene symbol or alias). Alias: cBioPortal_search.
tu_cBioPortal_get_genesView - Available
CBioPortal Get Molecular Profiles
Get molecular profiles for a cancer study. Molecular profiles include mutation data, copy number alterations, mRNA expression, etc.
tu_cBioPortal_get_molecular_profilesView - Available
CBioPortal Get Mutations
Get mutation data for specific genes in a cancer study. This uses the updated cBioPortal API that requires molecular profiles.
tu_cBioPortal_get_mutationsView - Available
CBioPortal Get Patients
Get all patients in a cancer study
tu_cBioPortal_get_patientsView - Available
CBioPortal Get Sample Lists
Get all sample lists (cohort definitions) in a cancer study. Sample lists define groups like 'samples with mutations', 'samples with CNA data'. Use for targeted queries.
tu_cBioPortal_get_sample_listsView - Available
CBioPortal Get Samples
Get all samples in a cancer study
tu_cBioPortal_get_samplesView - Available
Cdc Data Get Dataset
Retrieve data from a specific CDC dataset on Data.CDC.gov. Requires a dataset ID (view ID) which can be found using cdc_data_search_datasets.
tu_cdc_data_get_datasetView - Available
CEDA Search Datasets
Search the CEDA (Centre for Environmental Data Analysis) data catalogue for UK and global environmental, climate, and atmospheric science datasets. CEDA archives over 450TB of data from atmospheric, earth observation, and climate science research. Returns dataset records with titles, abstracts, time periods, geographic extents, and access information. Essential for finding atmospheric measurements, climate model outputs, satellite data, and Earth observation datasets.
tu_CEDA_search_datasetsView - Available
CellMarker List Cell Types
List available cell types in the CellMarker 2.0 database for a given tissue and species. Returns all annotated cell types found in that tissue context. Use this to explore what cell types have been characterized in a specific tissue, or to get an overview of CellMarker's coverage. Supports both normal and cancer cell contexts.
tu_CellMarker_list_cell_typesView - Available
CellMarker Search By Cell Type
Search the CellMarker 2.0 database for marker genes of a specific cell type. Returns curated lists of marker genes that define a cell type, sourced from single-cell RNA-seq studies and experimental literature. Use this to find canonical markers for cell type annotation, or to build gene sets for cell scoring. Example: 'T cell' returns CD3D, CD3E, CD3G, etc. 'Macrophage' returns CD68, CD163, MARCO, etc.
tu_CellMarker_search_by_cell_typeView - Available
CellMarker Search By Gene
Search the CellMarker 2.0 database for cell types that express a given marker gene. Returns curated cell type annotations from single-cell RNA-seq studies and experimental literature. CellMarker 2.0 covers >500 cell types across >400 tissue types in human and mouse. Use this to identify which cell types a gene marks, enabling cell type annotation in scRNA-seq analysis. Example: CD3D marks T cells, CD68 marks macrophages, EPCAM marks epithelial cells.
tu_CellMarker_search_by_geneView - Available
CellMarker Search Cancer Markers
Search the CellMarker 2.0 database for cancer-specific cell markers. Returns marker genes and cell types found in cancer tissue contexts. Use this to identify tumor-specific markers, immune cell markers in the tumor microenvironment, or cancer stem cell markers. Filters results to 'Cancer cell' annotations only. At least one search parameter is required: cancer_type (tissue), gene_symbol, or cell_type.
tu_CellMarker_search_cancer_markersView - Available
Cellosaurus Get Cell Line Info
Get detailed information about a specific cell line using its Cellosaurus accession number (CVCL_ format).
tu_cellosaurus_get_cell_line_infoView - Available
Cellosaurus Query Converter
Convert natural language queries to Solr syntax for Cellosaurus API searches. Uses semantic similarity to map terms to appropriate fields.
tu_cellosaurus_query_converterView - Available
Cellosaurus Search Cell Lines
Search Cellosaurus cell lines using the /search/cell-line endpoint. Supports Solr query syntax for precise field-based searches.
tu_cellosaurus_search_cell_linesView - Available
CellPainting Get Screen Plates
Get the list of plates in a Cell Painting screen from the Image Data Resource (IDR). Returns plate IDs and names for the given screen. Use CellPainting_search_screens first to find valid screen IDs. Example: Get all plates in idr0016 screenA (screen_id=1251).
tu_CellPainting_get_screen_platesView - Available
CellPainting Get Well Data
Get well-level metadata and image links for a specific plate in a Cell Painting experiment from the Image Data Resource (IDR). Returns well IDs, row/column positions, well URLs, and links to microscopy images. Use CellPainting_get_screen_plates to find valid plate IDs. Example: Get the first 5 wells of plate 5104 from idr0016.
tu_CellPainting_get_well_dataView - Available
CellPainting Search Screens
List available Cell Painting screens/studies in the Image Data Resource (IDR). Returns screen names, IDs, descriptions, and URLs. Automatically filters to Cell Painting and morphological profiling studies (e.g. idr0016 Wawer ~30,000 compounds, JUMP-CP). Use the optional query parameter to further filter by keyword. Example: Discover which Cell Painting datasets are available in IDR.
tu_CellPainting_search_screensView - Available
CELLxGENE Download H5ad
Download original H5AD (HDF5-based AnnData) files from CELLxGENE datasets or get their URIs. Access unprocessed data from individual studies with complete metadata and analysis results. Requires dataset_id from CELLxGENE Discover. Note: Files can be large (GBs). Use for: accessing raw data, reproducibility, offline analysis, custom processing pipelines.
tu_CELLxGENE_download_h5adView - Available
CELLxGENE Get Cell Metadata
Query cell metadata from CELLxGENE Census (50M+ human/mouse single cells). CRITICAL: obs_value_filter is REQUIRED - unfiltered queries will timeout. Common filters: tissue_general (brain, heart, lung, liver, blood, kidney, skin, eye, intestine), cell_type (T cell, B cell, neuron, macrophage, fibroblast), disease (normal, COVID-19, Alzheimer disease, cancer). Start with: obs_value_filter='tissue_general == "lung" and disease == "normal"'. Returns cell type, tissue, disease, donor info. Use for:…
tu_CELLxGENE_get_cell_metadataView - Available
CELLxGENE Get Census Versions
Get list of available CELLxGENE Census versions with release dates and descriptions. The Census contains single-cell RNA-seq data from 50M+ cells (human, mouse, non-human primates). Latest LTS release: 2025-11-08. Prerequisites: Requires 'cellxgene-census' package (install: pip install Cortexa Tool Mesh[singlecell]). Use for: checking available data versions, selecting stable vs latest builds, understanding data updates.
tu_CELLxGENE_get_census_versionsView - Available
CELLxGENE Get Embeddings
Access pre-calculated cell embeddings (scVI, Geneformer) from CELLxGENE Census. Returns available embedding names or specific embedding data. Embeddings enable rapid visualization and analysis without recomputation. Use for: UMAP/t-SNE visualization, cell similarity analysis, transfer learning.
tu_CELLxGENE_get_embeddingsView - Available
CELLxGENE Get Expression Data
Query gene expression data from CELLxGENE Census (50M+ cells, 60K+ genes). CRITICAL: At least one filter REQUIRED - unfiltered queries timeout. Start with: obs_value_filter='tissue_general == "lung" and disease == "normal"' AND var_value_filter='feature_name in ["TP53", "EGFR"]'. Returns AnnData summary (dimensions, metadata). For full matrix data, use Python API with sufficient memory.
tu_CELLxGENE_get_expression_dataView - Available
CELLxGENE Get Gene Metadata
Query gene (variable) metadata from CELLxGENE Census. Returns gene symbols, Ensembl IDs, feature types, and summary statistics. Filter by gene name, feature type, or expression characteristics. Use for: finding genes of interest, validating gene symbols, exploring feature presence across datasets.
tu_CELLxGENE_get_gene_metadataView - Available
CELLxGENE Get Presence Matrix
Get feature presence matrix showing which genes are measured in which datasets. Returns sparse matrix dimensions and density. Useful for understanding data completeness and selecting datasets with genes of interest. Use for: checking gene coverage, dataset selection, quality assessment.
tu_CELLxGENE_get_presence_matrixView