Releases: StarRocks/starrocks
3.2.14
3.2.14
Release date: January 8, 2025
Improvements
- Supports collecting statistics of Paimon tables. #52858
- Included node information and histogram metrics in JSON metrics. #53735
Bug Fixes
Fixed the following issues:
- The score of the Primary Key table index was not updated in the Commit phase. #41737
- Incorrect execution plans for
max(count(distinct))
when low-cardinality optimization is enabled. #53403 - When the List partition column has NULL values, queries against the Min/Max value of the partition column will lead to incorrect partition pruning. #53235
- Upload retries fail when backing up data to HDFS. #53679
3.3.8
3.3.8
Release date: January 3, 2025
Improvements
- Added a cluster idle API to assist in determining cluster status. #53850
- Included node information and histogram metrics in JSON metrics. #53735
- Optimized the MemTable for Primary Key tables in shared-data clusters. #54178
- Optimized memory usage and statistics for Primary Key tables in shared-data clusters. #54358
- Introduced a limit on the number of partitions scanned per node for queries requiring full-table or large-scale partition scans, enhancing system stability by reducing scanning pressure on individual BE or CN nodes. #53747
- Supports collecting statistics of Paimon tables. #52858
- Supports configuration of S3 client request timeout for shared-data clusters. #54211
Bug Fixes
Fixed the following issues:
- BE crashes caused by inconsistencies in the DelVec of Primary Key tables. #53460
- Issues with lock release of Primary Key tables in shared-data clusters. #53878
- Errors of UDFs nested in functions are not returned in query failures. #44297
- Transactions are blocked at the Decommission phase because they depend on the original replicas. #49349
- Queries against Delta Lake tables use relative paths instead of filenames for file retrieval. #53949
- An error is returned when querying Delta Lake Shallow Clone tables. #54044
- Case sensitivity issues when reading Paimon using JNI. #54041
- An error is returned during
INSERT OVERWRITE
operations on Hive tables created in Hive. #53792 SHOW TABLE STATUS
command does not validate view privileges. #53811- Missing FE metrics. #53058
- Memory leaks in
INSERT
tasks. #53809 - Concurrency issues caused by missing write locks in replication tasks. #54061
partition_ttl
of tables in thestatistics
database does not take effect. #54398- Query Cache-related issues:
- Issues with materialized view Union Rewrite. #54293
- Missing padding in string updates for partial updates in Primary Key tables. #54182
- Incorrect execution plans for
max(count(distinct))
when low-cardinality optimization is enabled. #53403 - Issues with changing the
excluded_refresh_tables
parameter of materialized views. #53394
Behavior Changes
- Changed the default value of
persistent_index_type
for Primary Key tables in shared-data clusters toCLOUD_NATIVE
, that is, enabled Persistent Index by default. #52209
3.1.17
Release Date: January 3, 2025
Bug Fixes
Fixed the following issues:
- Cross-cluster Data Migration Tool caused the Follower FE to crash during data synchronization and commit, due to not accounting for the deletion of partitions in the target cluster. #54061
- BE in the target cluster might crash when synchronizing tables with DELETE operations using Cross-cluster Data Migration Tool. #54081
- A bug in the BDBJE handshake mechanism where Leader FE would reject reconnection attempts from Follower FE when connection is being re-established, causing Follower FE nodes to exit. #50412
- Duplicate memory statistics in FE leads to excessive memory usage. #53055
- The statuses of the asynchronous materialized view refresh tasks are inconsistent across multiple FE nodes, which lead to inaccurate states of the materialized view during queries. #54236
3.2.13
Release date: December 13, 2024
Improvements
- Supports setting a time range within which Base Compaction is forbidden for a specific table. #50120
Bug Fixes
Fixed the following issues:
- The
loadRowsRate
field returned0
after executing SHOW ROUTINE LOAD. #52151 - The
Files()
function read columns that were not queried. #52210 - Prometheus failed to parse materialized view metrics with special characters in their names. (Now materialized view metrics support tags.) #52782
- The
array_map
function caused BE to crash. #52909 - Metadata Cache issues caused BE to crash. #52968
- Routine Load tasks were canceled due to expired transactions. (Now tasks are canceled only if the database or table no longer exists). #50334
- Stream Load failures when submitted using HTTP 1.0. #53010 #53008
- Issues related to Glue and S3 integration: #48433
- Some error messages did not display the root cause.
- Error messages for writing to a Hive partitioned table with the partition column of type STRING when Glue was used as the metadata service.
- Dropping Hive tables failed without proper error messages when the user lacked sufficient permissions.
- The
storage_cooldown_time
property for materialized views did not take effect when set tomaximum
. #52079
3.1.16
Release date: December 16, 2024
Improvements
- Optimized table-related statistics. #50316
Bug Fixes
Fixed the following issues:
- Insufficient granularity in error code handling for disk full scenarios caused the BE to mistakenly identify disk errors and delete data. #51411
- Stream Load failures when submitted using HTTP 1.0. #53010 #53008
- Routine Load tasks were canceled due to expired transactions (now tasks are canceled only if the database or table no longer exists and paused when transactions expired). #50334
- Unloading data using
EXPORT
with Broker tofile://
resulted in a file rename error, causing the export to fail. #52544 - If the join condition in an equal-join is an expression based on a low-cardinality column, the system may incorrectly push down a Runtime Filter predicate, leading to a BE crash. #50690
3.3.7
3.3.7
Release date: November 29, 2024
New Features
- Added a new Materialized View parameter,
excluded_refresh_tables
, exclude tables that need to be refreshed. #50926
Improvements
- Rewrote
unnest(bitmap_to_array)
asunnest_bitmap
to improve performance. #52870 - Reduced the write and delete operations of Txn logs. #42542
Bug Fixes
Fixed the following issues:
- Failure to connect Power BI to external tables. #52977
- Misleading FE Thrift RPC failure messages in logs. #52706
- Routine Load tasks were canceled due to expired transactions (now tasks are canceled only if the database or table no longer exists). #50334
- Stream Load failures when submitted using HTTP 1.0. #53010 #53008
- Integer overflow of partition IDs. #52965
- Hive Text Reader failed to recognize the last empty element. #52990
- Issues caused by
array_map
in Join conditions. #52911 - Metadata cache issues under high concurrency scenarios. #52968
- The whole materialized view was refreshed when a partition was dropped from the base table. #52740
3.3.6
3.3.6
Release date: November 18, 2024
Improvements
- Optimized internal repair logic for Primary Key tables. #52707
- Optimized the internal implementation of histograms of statistics. #52400
- Supports adjusting log level via the FE configuration item
sys_log_warn_modules
to reduce Hudi Catalog logging. #52709 - Supports constant folding in the
yearweek
function. #52714 - Avoided push-down for Lambda functions. #52655
- Divided the Query Error metric into three: Internal Error Rate, Analysis Error Rate, and Timeout Rate. #52646
- Avoided constant expressions being extracted as common expressions within
array_map
. #52541 - Optimized the Text-based Rewrite of materialized views. #52498
Bug Fixes
Fixed the following issues:
- The
unique_constraints
andforeign_constraints
parameters were incomplete in SHOW CREATE TABLE for cloud-native tables in shared-data clusters. #52804 - Some materialized views were activated even when
enable_mv_automatic_active_check
was set tofalse
. #52799 - Memory usage is not reducing after stale memory flush. #52613
- Resource leak caused by Hudi file-system views. #52738
- Concurrent Publish and Update operations on Primary Key tables may cause issues. #52687
- Failures to terminate queries on clients. #52185
- Multi-column List partitions cannot be pushed down. #51036
- Incorrect result due to the lack of
hasnull
property in ORC files. #52555 - An issue caused by using uppercase column names in ORDER BY during table creation. #52513
- An error was returned after running
ALTER TABLE PARTITION (*) SET ("storage_cooldown_ttl" = "xxx")
. #52482
Behavior Changes
-
In earlier versions, scale-in operations would fail if there were insufficient replicas for views in the
_statistics_
database. Starting from v3.3.6, if nodes are scaled in to 3 or more, view replicas are set to 3; if there is only 1 node after the scale-in, view replicas are set to 1, allowing for successful scale-in. #51799Affected views include:
column_statistics
histogram_statistics
table_statistic_v1
external_column_statistics
external_histogram_statistics
pipe_file_list
loads_history
task_run_history
-
New Primary Key tables no longer allow
__op
as a column name, even ifallow_system_reserved_names
is set totrue
. Existing tables are unaffected. #52621 -
Expression-partitioned tables cannot have partition names modified. #52557
-
Deprecated FE parameters
heartbeat_mgr_blocking_queue_size
andprofile_process_threads_num
. #52236 -
Enabled persistent index on object storage by default for Primary Key tables in shared-data clusters. #52209
-
Disallowed manual changes to bucketing methods for tables with the random bucketing method. #52120
-
Backup and Restore-related parameter changes: #52111
make_snapshot_worker_count
supports dynamic configuration.release_snapshot_worker_count
supports dynamic configuration.upload_worker_count
supports dynamic configuration. Its default value is changed from1
to the number of CPU cores on the machine where the BE resides.download_worker_count
supports dynamic configuration. Its default value is changed from1
to the number of CPU cores on the machine where the BE resides.
-
The return type of
SELECT @@autocommit
has changed from BOOLEAN to BIGINT. #51946 -
Added a new FE configuration item,
max_bucket_number_per_partition
, to control the maximum number of buckets per partition. #47852 -
Enabled memory usage checks by default for Primary Key tables. #52393
3.2.12
Release date: October 23, 2024
Improvements
- Optimized memory allocation and statistics in BE for certain complex query scenarios to avoid OOM. #51382
- Optimized memory usage in FE in Schema Change scenarios. #50855
- Optimized the job status display when querying the system-defined view information_schema.routine_load_jobs from Follower FE nodes. #51763
- Supports Backup and Restore of with the List partitioned tables. #51993
Bug Fixes
Fixed the following issues:
- The error message was lost after writing to Hive failed. #33167
- The array_map function causes a crash when excessive constant parameters are used. #51244
- Special characters in the PARTITION BY columns of expression partitioned tables cause FE CheckPoint failures. #51677
- Accessing the system-defined view information_schema.fe_locks causes a crash. #51742
- Querying generated columns causes an error. #51755
- Optimize Table fails when the table name contains special characters. #51755
- Tablets could not be balanced in certain scenarios. #51828
Behavior Changes
- Supports dynamic modification of Backup and Restore-related parameters.#52111
3.3.5
3.3.5
Release date: October 23, 2024
New Features
- Supports millisecond and microsecond precision in the DATETIME type.
- Resource groups support CPU hard isolation.
Improvements
- Optimized performance and extraction strategy for Flat JSON. #50696
- Reduced memory usage for the following ARRAY functions:
- Optimized error messages when loading
Null
values into List partition keys with theNot Null
attribute. #51086 - Optimized error messages for Files() when authentication fails in the Files function. #51697
- Optimized internal statistics for
INSERT OVERWRITE
. #50417 - Shared-data clusters support garbage collection (GC) for persistent index files. #51684
- Added FE logs to help diagnose FE out-of-memory (OOM) issues. #51528
- Supports recovering metadata from the metadata directory of FE. #51040
Bug Fixes
Fixed the following issues:
- A deadlock issue caused by PIPE exceptions. #50841
- Dynamic partition creation failures block subsequent partition creation. #51440
- An error is returned for
UNION ALL
queries withORDER BY
. #51647 - CTE in UPDATE statements causes hints to be ignored. #51458
- The
load_finish_time
field in the system-defined viewstatistics.loads_history
does not update as expected after a loading task is completed. #51174 - UDTF mishandles multibyte UTF-8 characters. #51232
Behavior Changes
- Modified the return content of the
EXPLAIN
statement. After the change, the return content is equivalent toEXPLAIN COST
. You can configure the level of details returned byEXPLAIN
using the dynamic FE parameterquery_detail_explain_level
. The default value isCOSTS
, with other valid values beingNORMAL
andVERBOSE
. #51439
3.3.4
3.3.4
Release date: September 30, 2024
New Features
- Supports creating asynchronous materialized views on List Partition tables. #46680 #46808
- List Partition tables now support Nullable partition columns. #47797
- Supports viewing external file schema information using
DESC FILES()
. #50527 - Supports viewing replication task metrics via
SHOW PROC '/replications'
. #50483
Improvements
- Optimized data recycling performance for
TRUNCATE TABLE
in shared-data clusters. #49975 - Supports intermediate result spilling for CTE operators. #47982
- Supports adaptive phased scheduling to alleviate OOM issues caused by complex queries. #47868
- Supports predicate pushdown for STRING-type date or datatime columns in specific scenarios. #50643
- Supports COUNT DISTINCT computation on constant semi-structured data. #48273
- Added a new FE parameter
lake_enable_balance_tablets_between_workers
to enable tablet balancing for tables in shared-date clusters. #50843 - Enhanced query rewrite capabilities for generated columns. #50398
- Partial Update now supports automatically populating columns with default values of
CURRENT_TIMESTAMP
. #50287
Bug Fixes
Fixed the following issues:
- The error "version has been compacted" caused by an infinite loop on the FE side during Tablet Clone. #50561
- ISO- formatted DATETIME types cannot be pushed down. #49358
- In concurrent scenarios, data still existed after the tablet was deleted. #50382
- Incorrect results returned by the
yearweek
function. #51065 - An issue with low cardinality dictionaries in ARRAY during CTE queries. #51148
- After FE restarts, partition TTL-related parameters were lost for materialized views. #51028
- Data loss in columns defined with
CURRENT_TIMESTAMP
after upgrading. #50911 - A stack overflow caused by the
array_distinct
function. #51017 - Activation failures for materialized views after upgrading due to changes in default field lengths. You can avoid such issues by setting
enable_active_materialized_view_schema_strict_check
tofalse
. #50869 - Resource group property
cpu_weight
can be set to a negative value. #51005 - Incorrect statistics for disk capacity information. #50669
- Constant fold in the
replace
function. #50828
Behavior Changes
- Changed the default replica number for external catalog-based materialized views from
1
to the value of the FE parameterdefault_replication_num
(Default value:3
). #50931