big-data

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

python data-science machine-learning data-mining tutorial r big-data gpu cuda kaggle gbdt gbm gpu-computing decision-trees gradient-boosting coreml catboost categorical-features

Updated Jun 11, 2024
Python

apache / iotdb

Star

Apache IoTDB

java iot database big-data timeseries nosql tsdb

Updated Jun 11, 2024
Java

crate / crate

Star

CrateDB is a distributed and scalable SQL database for storing and analyzing massive amounts of data in near real-time, even with complex queries. It is PostgreSQL-compatible, and based on Lucene.

Updated Jun 11, 2024
Java

arkime / arkime

Star

Arkime is an open source, large scale, full packet capturing, indexing, and database system.

javascript c security big-data pcap network-monitoring nsm packet-capture

Updated Jun 11, 2024
JavaScript

delta-io / delta

Star

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

big-data spark analytics acid delta-lake

Updated Jun 11, 2024
Scala

apache / spark

Star

Apache Spark - A unified analytics engine for large-scale data processing

python java r scala sql big-data spark jdbc

Updated Jun 11, 2024
Scala

Improve this page

Add a description, image, and links to the big-data topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the big-data topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

big-data

Here are 4,036 public repositories matching this topic...

ClickHouse / ClickHouse

ytsaurus / ytsaurus

astrolabsoftware / fink-website

seung-lab / cloud-volume

quickwit-oss / quickwit

paradedb / paradedb

trinodb / trino

vespa-engine / vespa

apache / datafusion

trieu / leo-cdp-free-edition

apache / ozone

prestodb / presto

apache / helix

apache / beam

catboost / catboost

apache / iotdb

crate / crate

arkime / arkime

delta-io / delta

apache / spark

Improve this page

Add this topic to your repo