Roadmap#

This page outlines the major projects that are actively in development and those planned for the near future.

In-Flight Projects#

CUDA GPU Support#

Adding support for GPU-accelerated compute over Vortex arrays using CUDA. This will allow encodings to be decompressed and evaluated directly on the GPU.

Lazy Array Evaluation#

Reworking array compute to use lazy evaluation, where operations build up an expression graph that is materialized on demand. This is a prerequisite for efficient GPU support, since it allows the runtime to batch and schedule work across devices.

Extension DTypes#

Introducing user-defined logical types that extend the built-in Vortex type system. Extension DTypes allow libraries and applications to register custom types with their own semantics while still benefiting from Vortex’s encoding, compression, and I/O infrastructure.

Upcoming Projects#

Scan API#

An abstract table-scan interface that positions Vortex as an interchange layer between data sources and query engines. The Scan API will support pluggable data sources and can be consumed over the C ABI for in-process integrations or over RPC for remote/distributed access.

Language Bindings Overhaul#

A comprehensive rework of the language bindings to expose plugin and extension points across the ecosystem:

  • Rust, C, C++, and Python will have first-class support for extending Vortex with custom encodings, compute functions, and DTypes.

  • Other languages (e.g. Java) will initially focus on reading and writing Arrow data to and from Vortex files.

Tensor Extension DType#

A built-in extension DType for multi-dimensional tensor data, enabling native support for fixed-shape and variable-shape tensors within Vortex arrays and files.

Variant DType#

A DType for representing arbitrarily nested, JSON-like data within Vortex arrays and files. This enables efficient columnar storage and querying of semi-structured data.