The Apache Arrow C++ library is the foundation of the Arrow project, providing high-performance data structures and algorithms for columnar data processing.Documentation Index
Fetch the complete documentation index at: https://mintlify.com/apache/arrow/llms.txt
Use this file to discover all available pages before exploring further.
Pre-compiled Binaries
The easiest way to install Apache Arrow C++ is to use pre-compiled binary packages.For the most up-to-date installation instructions and packages, visit the official Apache Arrow installation page.
System Package Managers
Install Arrow C++ using your system’s package manager:Verifying Installation
After installation, verify that Arrow C++ is correctly installed:Check pkg-config
Verify the Arrow package configuration:This should output the installed Arrow version.
Building from Source
For development or custom configurations, you can build Arrow C++ from source:As of Arrow 23.0.0, building from source requires a C++20 compatible compiler:
- GCC 11 or later
- Clang 12 or later
- MSVC 2019 16.10 or later
Common Features
When installing Arrow C++, you can enable additional features:- Parquet: Read/write Apache Parquet files
- Dataset API: Work with multi-file datasets
- Flight RPC: High-performance data transport
- Compute: Expression evaluation and aggregations
- CUDA: GPU acceleration support
- S3: Amazon S3 filesystem support
- GCS: Google Cloud Storage support
Troubleshooting
pkg-config not finding arrow
pkg-config not finding arrow
Ensure the PKG_CONFIG_PATH includes the Arrow installation directory:
Linking errors when compiling
Linking errors when compiling
Make sure to include all required Arrow libraries:
Version conflicts
Version conflicts
Remove old Arrow installations before installing a new version:
Next Steps
Now that you have Arrow C++ installed:- Explore the Arrow C++ Examples
- Read the C++ API Documentation
- Learn about Memory Management
- Try working with Parquet Files