ClickHouse® is a real-time analytics DBMS
-
Updated
Jun 11, 2024 - C++
ClickHouse® is a real-time analytics DBMS
YTsaurus is a scalable and fault-tolerant open-source big data platform.
Read and write Neuroglancer datasets programmatically.
Cloud-native search engine for observability. An open-source alternative to Datadog, Elasticsearch, Loki, and Tempo.
Postgres for Search and Analytics
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
AI + Data, online. https://vespa.ai
Apache DataFusion SQL Query Engine
The binary build of LEO CDP Free Edition for training purposes
Scalable, redundant, and distributed object store for Apache Hadoop
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
CrateDB is a distributed and scalable SQL database for storing and analyzing massive amounts of data in near real-time, even with complex queries. It is PostgreSQL-compatible, and based on Lucene.
Arkime is an open source, large scale, full packet capturing, indexing, and database system.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Add a description, image, and links to the big-data topic page so that developers can more easily learn about it.
To associate your repository with the big-data topic, visit your repo's landing page and select "manage topics."