Back to catalog
Cortexa research tool·structure·huggingface
HuggingFace Search Datasets
Search HuggingFace Hub for machine learning datasets by keyword. HuggingFace hosts 100,000+ open datasets spanning text, image, audio, tabular, and biological sequence data. Returns dataset IDs, authors, download counts, likes, tags (including format, modality, size category), and descriptions. Use this to find training/evaluation datasets for ML models, benchmark datasets, or domain-specific datasets (e.g., protein sequences, medical images, genomics).
Livelast probed 2026-04-30 23:07Z· 96msCortexa
Status
The most recent probe succeeded. Safe to call from the agent today.
- Last probed
- 1d ago
- Last success
- 1d ago
- Last latency
- 96ms
Schema
JSON Schema the agent (or your API call) must match.
{
"properties": {
"limit": {
"description": "Maximum number of results to return (default 20, max 1000)",
"type": [
"integer",
"null"
]
},
"search": {
"description": "Search query for dataset name or topic (e.g., 'protein sequence', 'medical imaging', 'sentiment analysis', 'genomics')",
"type": "string"
}
},
"required": [
"search"
],
"type": "object"
}Examples (1)
Default HuggingFace Search Datasets call
public-safeinput
{
"search": ""
}Expected response keys: success
Anonymous-safe example. Rate-limited; no sign-in required.
Identifiers
- Catalog ID
- tu_HuggingFace_search_datasets
- Tool name
- HuggingFace_search_datasets
- Added
- 2026-04-30 22:47Z
- Tags
- tooluniverse