Skip to content

Latest commit

 

History

History
180 lines (160 loc) · 28.5 KB

CHANGELOG.md

File metadata and controls

180 lines (160 loc) · 28.5 KB

Changelog

53.0.0 (2024-08-31)

Full Changelog

Breaking changes:

Implemented enhancements:

  • Derive PartialEq and Eq for parquet::arrow::ProjectionMask #6329 [parquet]
  • Allow converting empty pyarrow.RecordBatch to arrow::RecordBatch #6318 [arrow]
  • Parquet writer should not write any min/max data to ColumnIndex when all values are null #6315 [parquet]
  • Parquet: Add union method to RowSelection #6307 [parquet]
  • Support writing UTC adjusted time arrow array to parquet #6277 [parquet]
  • A better way to resize the buffer for the snappy encode/decode #6276 [parquet]
  • parquet_derive: support reading selected columns from parquet file #6268
  • Tests for invalid parquet files #6261 [parquet]
  • Implement date_part for Duration #6245 [arrow]
  • Avoid unnecessary null buffer construction when converting arrays to a different type #6243 [parquet] [arrow]
  • Add parquet_opendal in related projects #6235
  • Look into optimizing reading FixedSizeBinary arrays from parquet #6219 [parquet] [arrow]
  • Add benchmarks for BYTE_STREAM_SPLIT encoded Parquet FIXED_LEN_BYTE_ARRAY data #6203 [parquet]
  • Make it easy to write parquet to object_store -- Implement AsyncFileWriter for a type that implements obj_store::MultipartUpload for AsyncArrowWriter #6200 [parquet]
  • Remove test duplication in parquet statistics tets #6185 [parquet]
  • Support BinaryView Types in C Schema FFI #6170 [arrow]
  • speedup take_byte_view kernel #6167 [arrow]
  • Add support for StringView and BinaryView statistics in StatisticsConverter #6164 [parquet]
  • Support casting BinaryView --> Utf8 and LargeUtf8 #6162 [arrow]
  • Implement filter kernel specially for FixedSizeByteArray #6153 [arrow]
  • Use LevelHistogram throughout Parquet metadata #6134 [parquet]
  • Support DoPutStatementIngest from Arrow Flight SQL 17.0 #6124 [arrow] [arrow-flight]
  • ColumnMetaData should no longer be written inline with data #6115 [parquet]
  • Implement date_part for Interval #6113 [arrow]
  • Implement Into<Arc<dyn Array>> for ArrayData #6104
  • Allow flushing or non-buffered writes from arrow::ipc::writer::StreamWriter #6099 [arrow]
  • Default block_size for StringViewArray #6094 [arrow]
  • Remove Statistics::has_min_max_set and ValueStatistics::has_min_max_set and use Option instead #6093 [parquet]
  • Upgrade arrow-flight to tonic 0.12 #6072
  • Improve speed of row converter by skipping utf8 checks #6058 [arrow]
  • Extend support for BYTE_STREAM_SPLIT to FIXED_LEN_BYTE_ARRAY, INT32, and INT64 primitive types #6048 [parquet]
  • Release arrow-rs / parquet minor version 52.2.0 (August 2024) #5998 [parquet] [arrow]

Fixed bugs:

  • Invalid ColumnIndex written in parquet #6310 [parquet]
  • comparison_kernels benchmarks panic #6283 [arrow]
  • Printing schema metadata includes possibly incorrect compression level #6270 [parquet]
  • Don't panic when creating Field from FFI_ArrowSchema with no name #6251 [arrow]
  • lexsort_to_indices should not fallback to non-lexical sort if the datatype is not supported #6226 [arrow]
  • Parquet Statistics null_count does not distinguish between 0 and not specified #6215 [parquet]
  • Using a take kernel on a dense union can result in reaching "unreachable" code #6206 [arrow]
  • Adding sub day seconds to Date64 is ignored. #6198 [arrow]
  • mismatch between parquet type is_optional codes and comment #6191 [parquet]

Documentation updates:

Closed issues:

  • Simplify take octokit workflow #6279
  • Make the bearer token visible in FlightSqlServiceClient #6253 [arrow] [arrow-flight]
  • Port take workflow to use oktokit #6242
  • Remove SchemaBuilder dependency from StructArray constructors #6138 [arrow]

Merged pull requests:

* This Changelog was automatically generated by github_changelog_generator