The open-source tool for building high-quality datasets and computer vision models
-
Updated
Jun 12, 2024 - Python
The open-source tool for building high-quality datasets and computer vision models
Repo for PSRC's Regional Travel Studies, 2014 onward
A Python library and its cli for converting grabcraft to schema (more specifically litematica schematic) files
Library Carpentry: OpenRefine
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
Data2Neo is a library that simplifies the convertion of data in relational format to a graph knowledge database.
Data Analysis with the Pandas Library 📊
pyDVL is a library of stable implementations of algorithms for data valuation and influence function computation
We leverage machine learning and data analysis to address real-world challenges in the copper industry. Our documentation encompasses data preprocessing, feature engineering, classification, regression, and model selection. Explore how we've enhanced predictive capabilities to optimize manufacturing solutions.
💻☕This repository is a resource for learning data science, including learning materials and projects. It covers topics such as data analysis, machine learning, and programming.
OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)
A guide to all my projects
R package to clean and standardize epidemiological data
🗺️ Data Cleaning and Textual Data Visualization 🗺️
Test data management tool for any data source, batch or real-time
Prepping tables for machine learning
A toolbox of simple solutions for common data cleaning problems.
Add a description, image, and links to the data-cleaning topic page so that developers can more easily learn about it.
To associate your repository with the data-cleaning topic, visit your repo's landing page and select "manage topics."