Apache ORC (Optimized Row Columnar) is a self-describing, type-aware columnar file format for large-scale data processing. It provides efficient compression, predicate pushdown, and indexing for analytic workloads.
Apache ORC (Optimized Row Columnar) is a self-describing, type-aware columnar file format for large-scale data processing. It provides efficient compression, predicate pushdown, and indexing for analytic workloads.