Odixcity Consulting is seeking a Dataset Curator to design, maintain, and optimize high-quality datasets for AI, ML, and LLM projects. The role involves collaborating with data scientists and engineers to gather, clean, annotate, and validate datasets, ensuring reliable AI model training and evaluation.
Job Overview
Job Summary
Key Responsibilities
Curate, collect, and structure datasets for AI and ML training purposes
Validate datasets for accuracy, completeness, and consistency
Annotate and label datasets according to project guidelines
Identify and correct data inconsistencies, duplicates, and anomalies
Maintain metadata and documentation for datasets
Collaborate with AI trainers and data engineers to define dataset requirements
Ensure datasets are ethically sourced and free from bias
Continuously monitor dataset quality and recommend improvements
Required Qualifications
Bachelor’s or Master’s Degree in Data Science, Computer Science, Statistics, Information Systems, or related fields
5 to 8 years proven experience in dataset curation, data analysis, or data management
Proficiency in Excel, Google Sheets, SQL, and/or Python for dataset handling
Knowledge of data cleaning, normalization, and transformation techniques
Familiarity with data annotation tools and platforms
Understanding of structured, semi-structured, and unstructured datasets
Experience with database management and version control systems
Awareness of AI dataset ethics and bias mitigation
Optional certifications: Google Data Analytics, Data Management/Curation, AI/Data Annotation Training
Experience handling large-scale datasets for AI, ML, or analytics projects
*Interested and qualified candidates should send their CV
Skills & Competencies
Experience
None
Benefits & Perks
Work Schedule
Additional Information
How to Apply
Apply via Email
Send your application via email with the provided subject line