본문 바로가기

IT143

Project Nessie Project Nessie: Transactional Catalog for Data Lakes with Git-like semantics Transactional Catalog for Data Lakes Git-inspired data version control Cross-table transactions and visibility Open data lake approach, supporting Hive, Spark, Dremio, AWS Athena, etc. Works with Apache Iceberg and Delta Lake tables Run as a docker image, AWS Lambda or fork it on GitHub Get in touch via our Google Group.. 2022. 10. 20.
Presto https://prestodb.io/ Presto | Distributed SQL Query Engine for Big Data Distributed SQL Query Engine for Big Data prestodb.io Presto: Fast and reliable SQL query engine for data analytics and the open lakehouse For data engineers who struggle with managing multiple query languages and interfaces to siloed databases and storage, Presto is the fast and reliable engine that provides one simple ANSI.. 2022. 10. 19.
ClickHouse https://clickhouse.com/ Fast Open-Source OLAP DBMS - ClickHouse sudo apt-get install apt-transport-https ca-certificates dirmngr sudo apt-key adv --keyserver hkp://keyserver.ubuntu.com:80 --recv 8919F6BD2B48D754 echo "deb https://packages.clickhouse.com/deb stable main" | sudo tee /etc/apt/sources.list.d/clickhouse.lis clickhouse.com ClickHouse® is a column-oriented database management system (D.. 2022. 10. 19.
Apache Pinot https://docs.pinot.apache.org/basics/concepts Concepts - Apache Pinot Docs In contrast to RDBMS schemas, multiple tables in Pinot (real-time or batch) can inherit a single schema definition. Tables are independently configured for concerns such as indexing strategies, partitioning, tenants, data sources, and/or replication. docs.pinot.apache.org Pinot is designed to deliver low latency queries o.. 2022. 10. 19.