FreeBSD.software
Home/misc/arrow-datafusion

arrow-datafusion

45.0.0_8misc

Apache Arrow DataFusion SQL Query Engine

DataFusion is an extensible query planning, optimization, and execution framework, written in Rust, that uses Apache Arrow as its in-memory format. Features: - SQL query planner with support for multiple SQL dialects - DataFrame API - Parquet, CSV, JSON, and Avro file formats are supported natively. Custom file formats can be supported by implementing a `TableProvider` trait. - Supports popular object stores, including AWS S3, Azure Blob Storage, and Google Cloud Storage. There are extension points for implementing custom object stores. Use Cases: DataFusion is modular in design with many extension points and can be used without modification as an embedded query engine and can also provide a foundation for building new systems. Here are some example use cases: - DataFusion can be used as a SQL query planner and query optimizer, providing optimized logical plans that can then be mapped to other execution engines. - DataFusion is used to create modern, fast and efficient data pipelines, ETL processes, and database systems, which need the performance of Rust and Apache Arrow and want to provide their users the convenience of an SQL interface or a DataFrame API.

$pkg install arrow-datafusion
arrow.apache.org/datafusion
Origin
misc/arrow-datafusion
Size
46.9MiB
License
APACHE20
Maintainer
yuri@FreeBSD.org
Dependencies
1 packages
Required by
0 packages

Dependencies (1)