Skip to main content

Apache GraphAr

An open source, standard data file format for graph data storage and retrieval

Efficient format design

GraphAr is designed to be efficient for storage and retrieval of large-scale graph data with techniques such as chunking, columnar storage, and maintain CSR/CSC semantics.

Out-of-core queries

Designed for out-of-core scenarios, enabling the storage and querying of large-scale graphs outside of memory, such as in data lakes.

Cross-language support

Provides libraries in C++, Java, Scala with Spark, and Python with PySpark for generating, accessing, and transforming files in GraphAr format.