Skip to content

Releases: StarRocks/starrocks

3.2.14

09 Jan 02:34
b86884b
Compare
Choose a tag to compare

3.2.14

Release date: January 8, 2025

Improvements

  • Supports collecting statistics of Paimon tables. #52858
  • Included node information and histogram metrics in JSON metrics. #53735

Bug Fixes

Fixed the following issues:

  • The score of the Primary Key table index was not updated in the Commit phase. #41737
  • Incorrect execution plans for max(count(distinct)) when low-cardinality optimization is enabled. #53403
  • When the List partition column has NULL values, queries against the Min/Max value of the partition column will lead to incorrect partition pruning. #53235
  • Upload retries fail when backing up data to HDFS. #53679

3.3.8

06 Jan 01:54
e3816ec
Compare
Choose a tag to compare

3.3.8

Release date: January 3, 2025

Improvements

  • Added a cluster idle API to assist in determining cluster status. #53850
  • Included node information and histogram metrics in JSON metrics. #53735
  • Optimized the MemTable for Primary Key tables in shared-data clusters. #54178
  • Optimized memory usage and statistics for Primary Key tables in shared-data clusters. #54358
  • Introduced a limit on the number of partitions scanned per node for queries requiring full-table or large-scale partition scans, enhancing system stability by reducing scanning pressure on individual BE or CN nodes. #53747
  • Supports collecting statistics of Paimon tables. #52858
  • Supports configuration of S3 client request timeout for shared-data clusters. #54211

Bug Fixes

Fixed the following issues:

  • BE crashes caused by inconsistencies in the DelVec of Primary Key tables. #53460
  • Issues with lock release of Primary Key tables in shared-data clusters. #53878
  • Errors of UDFs nested in functions are not returned in query failures. #44297
  • Transactions are blocked at the Decommission phase because they depend on the original replicas. #49349
  • Queries against Delta Lake tables use relative paths instead of filenames for file retrieval. #53949
  • An error is returned when querying Delta Lake Shallow Clone tables. #54044
  • Case sensitivity issues when reading Paimon using JNI. #54041
  • An error is returned during INSERT OVERWRITE operations on Hive tables created in Hive. #53792
  • SHOW TABLE STATUS command does not validate view privileges. #53811
  • Missing FE metrics. #53058
  • Memory leaks in INSERT tasks. #53809
  • Concurrency issues caused by missing write locks in replication tasks. #54061
  • partition_ttl of tables in the statistics database does not take effect. #54398
  • Query Cache-related issues:
    • Crashes when Query Cache is enabled with Group Execution. #54363
    • Runtime Filter crashes. #54305
  • Issues with materialized view Union Rewrite. #54293
  • Missing padding in string updates for partial updates in Primary Key tables. #54182
  • Incorrect execution plans for max(count(distinct)) when low-cardinality optimization is enabled. #53403
  • Issues with changing the excluded_refresh_tables parameter of materialized views. #53394

Behavior Changes

  • Changed the default value of persistent_index_type for Primary Key tables in shared-data clusters to CLOUD_NATIVE, that is, enabled Persistent Index by default. #52209

3.1.17

03 Jan 11:19
67ae3b7
Compare
Choose a tag to compare

Release Date: January 3, 2025

Bug Fixes

Fixed the following issues:

  • Cross-cluster Data Migration Tool caused the Follower FE to crash during data synchronization and commit, due to not accounting for the deletion of partitions in the target cluster. #54061
  • BE in the target cluster might crash when synchronizing tables with DELETE operations using Cross-cluster Data Migration Tool. #54081
  • A bug in the BDBJE handshake mechanism where Leader FE would reject reconnection attempts from Follower FE when connection is being re-established, causing Follower FE nodes to exit. #50412
  • Duplicate memory statistics in FE leads to excessive memory usage. #53055
  • The statuses of the asynchronous materialized view refresh tasks are inconsistent across multiple FE nodes, which lead to inaccurate states of the materialized view during queries. #54236

3.2.13

13 Dec 06:58
f0965dc
Compare
Choose a tag to compare

Release date: December 13, 2024

Improvements

  • Supports setting a time range within which Base Compaction is forbidden for a specific table. #50120

Bug Fixes

Fixed the following issues:

  • The loadRowsRate field returned 0 after executing SHOW ROUTINE LOAD. #52151
  • The Files() function read columns that were not queried. #52210
  • Prometheus failed to parse materialized view metrics with special characters in their names. (Now materialized view metrics support tags.) #52782
  • The array_map function caused BE to crash. #52909
  • Metadata Cache issues caused BE to crash. #52968
  • Routine Load tasks were canceled due to expired transactions. (Now tasks are canceled only if the database or table no longer exists). #50334
  • Stream Load failures when submitted using HTTP 1.0. #53010 #53008
  • Issues related to Glue and S3 integration: #48433
    • Some error messages did not display the root cause.
    • Error messages for writing to a Hive partitioned table with the partition column of type STRING when Glue was used as the metadata service.
    • Dropping Hive tables failed without proper error messages when the user lacked sufficient permissions.
  • The storage_cooldown_time property for materialized views did not take effect when set to maximum. #52079

3.1.16

16 Dec 12:07
76526c0
Compare
Choose a tag to compare

Release date: December 16, 2024

Improvements

  • Optimized table-related statistics. #50316

Bug Fixes

Fixed the following issues:

  • Insufficient granularity in error code handling for disk full scenarios caused the BE to mistakenly identify disk errors and delete data. #51411
  • Stream Load failures when submitted using HTTP 1.0. #53010 #53008
  • Routine Load tasks were canceled due to expired transactions (now tasks are canceled only if the database or table no longer exists and paused when transactions expired). #50334
  • Unloading data using EXPORT with Broker to file:// resulted in a file rename error, causing the export to fail. #52544
  • If the join condition in an equal-join is an expression based on a low-cardinality column, the system may incorrectly push down a Runtime Filter predicate, leading to a BE crash. #50690

3.3.7

12 Dec 16:13
00177de
Compare
Choose a tag to compare

3.3.7

Release date: November 29, 2024

New Features

  • Added a new Materialized View parameter, excluded_refresh_tables, exclude tables that need to be refreshed. #50926

Improvements

  • Rewrote unnest(bitmap_to_array) as unnest_bitmap to improve performance. #52870
  • Reduced the write and delete operations of Txn logs. #42542

Bug Fixes

Fixed the following issues:

  • Failure to connect Power BI to external tables. #52977
  • Misleading FE Thrift RPC failure messages in logs. #52706
  • Routine Load tasks were canceled due to expired transactions (now tasks are canceled only if the database or table no longer exists). #50334
  • Stream Load failures when submitted using HTTP 1.0. #53010 #53008
  • Integer overflow of partition IDs. #52965
  • Hive Text Reader failed to recognize the last empty element. #52990
  • Issues caused by array_map in Join conditions. #52911
  • Metadata cache issues under high concurrency scenarios. #52968
  • The whole materialized view was refreshed when a partition was dropped from the base table. #52740

3.3.6

20 Nov 06:38
8f01cfa
Compare
Choose a tag to compare

3.3.6

Release date: November 18, 2024

Improvements

  • Optimized internal repair logic for Primary Key tables. #52707
  • Optimized the internal implementation of histograms of statistics. #52400
  • Supports adjusting log level via the FE configuration item sys_log_warn_modules to reduce Hudi Catalog logging. #52709
  • Supports constant folding in the yearweek function. #52714
  • Avoided push-down for Lambda functions. #52655
  • Divided the Query Error metric into three: Internal Error Rate, Analysis Error Rate, and Timeout Rate. #52646
  • Avoided constant expressions being extracted as common expressions within array_map. #52541
  • Optimized the Text-based Rewrite of materialized views. #52498

Bug Fixes

Fixed the following issues:

  • The unique_constraints and foreign_constraints parameters were incomplete in SHOW CREATE TABLE for cloud-native tables in shared-data clusters. #52804
  • Some materialized views were activated even when enable_mv_automatic_active_check was set to false. #52799
  • Memory usage is not reducing after stale memory flush. #52613
  • Resource leak caused by Hudi file-system views. #52738
  • Concurrent Publish and Update operations on Primary Key tables may cause issues. #52687
  • Failures to terminate queries on clients. #52185
  • Multi-column List partitions cannot be pushed down. #51036
  • Incorrect result due to the lack of hasnull property in ORC files. #52555
  • An issue caused by using uppercase column names in ORDER BY during table creation. #52513
  • An error was returned after running ALTER TABLE PARTITION (*) SET ("storage_cooldown_ttl" = "xxx"). #52482

Behavior Changes

  • In earlier versions, scale-in operations would fail if there were insufficient replicas for views in the _statistics_ database. Starting from v3.3.6, if nodes are scaled in to 3 or more, view replicas are set to 3; if there is only 1 node after the scale-in, view replicas are set to 1, allowing for successful scale-in. #51799

    Affected views include:

    • column_statistics
    • histogram_statistics
    • table_statistic_v1
    • external_column_statistics
    • external_histogram_statistics
    • pipe_file_list
    • loads_history
    • task_run_history
  • New Primary Key tables no longer allow __op as a column name, even if allow_system_reserved_names is set to true. Existing tables are unaffected. #52621

  • Expression-partitioned tables cannot have partition names modified. #52557

  • Deprecated FE parameters heartbeat_mgr_blocking_queue_size and profile_process_threads_num. #52236

  • Enabled persistent index on object storage by default for Primary Key tables in shared-data clusters. #52209

  • Disallowed manual changes to bucketing methods for tables with the random bucketing method. #52120

  • Backup and Restore-related parameter changes: #52111

    • make_snapshot_worker_count supports dynamic configuration.
    • release_snapshot_worker_count supports dynamic configuration.
    • upload_worker_count supports dynamic configuration. Its default value is changed from 1 to the number of CPU cores on the machine where the BE resides.
    • download_worker_count supports dynamic configuration. Its default value is changed from 1 to the number of CPU cores on the machine where the BE resides.
  • The return type of SELECT @@autocommit has changed from BOOLEAN to BIGINT. #51946

  • Added a new FE configuration item, max_bucket_number_per_partition, to control the maximum number of buckets per partition. #47852

  • Enabled memory usage checks by default for Primary Key tables. #52393

3.2.12

14 Nov 07:15
5f81e3e
Compare
Choose a tag to compare

Release date: October 23, 2024

Improvements

  • Optimized memory allocation and statistics in BE for certain complex query scenarios to avoid OOM. #51382
  • Optimized memory usage in FE in Schema Change scenarios. #50855
  • Optimized the job status display when querying the system-defined view information_schema.routine_load_jobs from Follower FE nodes. #51763
  • Supports Backup and Restore of with the List partitioned tables. #51993

Bug Fixes

Fixed the following issues:

  • The error message was lost after writing to Hive failed. #33167
  • The array_map function causes a crash when excessive constant parameters are used. #51244
  • Special characters in the PARTITION BY columns of expression partitioned tables cause FE CheckPoint failures. #51677
  • Accessing the system-defined view information_schema.fe_locks causes a crash. #51742
  • Querying generated columns causes an error. #51755
  • Optimize Table fails when the table name contains special characters. #51755
  • Tablets could not be balanced in certain scenarios. #51828

Behavior Changes

  • Supports dynamic modification of Backup and Restore-related parameters.#52111

3.3.5

24 Oct 03:08
6d81f75
Compare
Choose a tag to compare

3.3.5

Release date: October 23, 2024

New Features

  • Supports millisecond and microsecond precision in the DATETIME type.
  • Resource groups support CPU hard isolation.

Improvements

  • Optimized performance and extraction strategy for Flat JSON. #50696
  • Reduced memory usage for the following ARRAY functions:
  • Optimized error messages when loading Null values into List partition keys with the Not Null attribute. #51086
  • Optimized error messages for Files() when authentication fails in the Files function. #51697
  • Optimized internal statistics for INSERT OVERWRITE. #50417
  • Shared-data clusters support garbage collection (GC) for persistent index files. #51684
  • Added FE logs to help diagnose FE out-of-memory (OOM) issues. #51528
  • Supports recovering metadata from the metadata directory of FE. #51040

Bug Fixes

Fixed the following issues:

  • A deadlock issue caused by PIPE exceptions. #50841
  • Dynamic partition creation failures block subsequent partition creation. #51440
  • An error is returned for UNION ALL queries with ORDER BY. #51647
  • CTE in UPDATE statements causes hints to be ignored. #51458
  • The load_finish_time field in the system-defined view statistics.loads_history does not update as expected after a loading task is completed. #51174
  • UDTF mishandles multibyte UTF-8 characters. #51232

Behavior Changes

  • Modified the return content of the EXPLAIN statement. After the change, the return content is equivalent to EXPLAIN COST. You can configure the level of details returned by EXPLAIN using the dynamic FE parameter query_detail_explain_level. The default value is COSTS, with other valid values being NORMAL and VERBOSE. #51439

3.3.4

30 Sep 08:24
56bcf6f
Compare
Choose a tag to compare

3.3.4

Release date: September 30, 2024

New Features

  • Supports creating asynchronous materialized views on List Partition tables. #46680 #46808
  • List Partition tables now support Nullable partition columns. #47797
  • Supports viewing external file schema information using DESC FILES(). #50527
  • Supports viewing replication task metrics via SHOW PROC '/replications'. #50483

Improvements

  • Optimized data recycling performance for TRUNCATE TABLE in shared-data clusters. #49975
  • Supports intermediate result spilling for CTE operators. #47982
  • Supports adaptive phased scheduling to alleviate OOM issues caused by complex queries. #47868
  • Supports predicate pushdown for STRING-type date or datatime columns in specific scenarios. #50643
  • Supports COUNT DISTINCT computation on constant semi-structured data. #48273
  • Added a new FE parameter lake_enable_balance_tablets_between_workers to enable tablet balancing for tables in shared-date clusters. #50843
  • Enhanced query rewrite capabilities for generated columns. #50398
  • Partial Update now supports automatically populating columns with default values of CURRENT_TIMESTAMP. #50287

Bug Fixes

Fixed the following issues:

  • The error "version has been compacted" caused by an infinite loop on the FE side during Tablet Clone. #50561
  • ISO- formatted DATETIME types cannot be pushed down. #49358
  • In concurrent scenarios, data still existed after the tablet was deleted. #50382
  • Incorrect results returned by the yearweek function. #51065
  • An issue with low cardinality dictionaries in ARRAY during CTE queries. #51148
  • After FE restarts, partition TTL-related parameters were lost for materialized views. #51028
  • Data loss in columns defined with CURRENT_TIMESTAMP after upgrading. #50911
  • A stack overflow caused by the array_distinct function. #51017
  • Activation failures for materialized views after upgrading due to changes in default field lengths. You can avoid such issues by setting enable_active_materialized_view_schema_strict_check to false. #50869
  • Resource group property cpu_weight can be set to a negative value. #51005
  • Incorrect statistics for disk capacity information. #50669
  • Constant fold in the replace function. #50828

Behavior Changes

  • Changed the default replica number for external catalog-based materialized views from 1 to the value of the FE parameter default_replication_num (Default value: 3). #50931