STACKQUADRANT

NVIDIA-NeMo/Curator

Fine-tuning Tools

Scalable data pre processing and curation toolkit for LLMs

GitHub Metrics
Stars
1.4k
Forks
224
Open Issues
184
Watchers
19
Contributors
53
Weekly Commits
0
Language
Python
License
Apache-2.0
Last Commit
Mar 2, 2026
Created
Mar 14, 2024
Latest Release
v1.1.0
Release Date
Feb 23, 2026
Synced: Mar 3, 2026
Quality Scores
Documentation Qualityw: 20%
0.0
Community Healthw: 20%
0.0
Maintenance Velocityw: 15%
0.0
API Design & DXw: 20%
0.0
Production Readinessw: 15%
0.0
Ecosystem Integrationw: 10%
0.0
Tags
datadata-curationdata-prepdata-preparationdata-processingdata-processing-pipelinesdata-qualitydatacurationdatarecipesdeduplication
Radar
No scores yet