Optimizing query performance
Starburst Galaxy and Starburst Enterprise platform (SEP) are fast, but there are still many opportunities to make them even faster, depending on how you write your queries.
Learn how to use EXPLAIN and ANALYZE to improve your query performance in this training video presented by one of our founders, Martin Traverso. For your convenience, we’ve divided the video training course up into topic sections, and provided links to the relevant parts of our documentation below.
The query lifecycle #
Knowing what’s happening under the hood in SQL can help you to write queries that capitalize on possible optimizations and avoid approaches that will cost you performance. This section provides an overview of what happens as a query is executed.
Topics:
Parsing Analysis Planning Optimization Scheduling and executionRunning time: ~12 min.
The EXPLAIN statement in detail #
If you want to understand what the Trino query engine is basing its decisions on as it executes a query, use the EXPLAIN statement. This section walks you through this very informative tool in detail.
Topics:
EXPLAIN (Starburst Galaxy open_in_new or SEP open_in_new) EXPLAIN vs EXPLAIN ANALYZE (Starburst Galaxy open_in_new or SEP open_in_new) Fragment structure, distribution, row layout, estimates, and performance stats in EXPLAIN ANALYZE (SEP open_in_new) Exchanges (SEP open_in_new)Click the links to read more on that topic in our reference manuals. Some considerations, such as exchanges, are handled for you by Starburst Galaxy.
Running time: ~20 min.
General optimizations #
The content in this section is more technique-oriented, and is a complex subject. We strongly suggest watching it all the way through thoroughly first to gain a broad awareness of how you write a query can affect its performance before trying these on your own. For further reading, we recommend our SEP pushdown documentation.
The SQL engine relies on table statistics to make decisions on optimizations. Enabling dynamic filtering can take optimizations even further. We recommend reading about these powerful features to ensure you are getting the best performance possible out of your SEP cluster. With Starburst Galaxy, this is handled for you:
Dynamic filtering Table statistics
Topics:
Constant folding Predicate pushdown Predicate pushdown into the Hive connector Hive partition pruning Hive bucket pruning Row group skipping for ORC and Parquet Limit, partial limit, and aggregation pushdown SkewRunning time: ~58 min.
SEP offers several properties to control how the optimizer handles certain operations. With Starburst Galaxy, this is handled for you.
Cost-based optimizations #
This section presents on overview of how cost-based optimizations work in Starburst clusters, and provides context for the following recommended reading:
Cost-based optimizations Cost in EXPLAIN
Topics:
Partitioned and broadcast joins Disabling cost-based optimizations Join reordering Table statistics Computing statistics with ANALYZE (Starburst Galaxy open_in_new or SEP open_in_new)Running time: ~13 min.
Is the information on this page helpful?
Yes
No
Optimizing query performanceThe query lifecycleThe EXPLAIN statement in detailGeneral optimizationsCost-based optimizations网址:Optimizing query performance https://mxgxt.com/news/view/1718479
相关内容
VTA: An Open, Customizable Deep Learning Acceleration StackS.M. The Performance
Thermal stability and wave permeability of high performance SiBN(C) ceramic fibers
雅思口语part2话题卡:Describe a performance you watched recently 欣赏过的演出
龙星计划课程信息检索OverviewofTextRetrievalPart
「专访」Peak Performance中国品牌负责人钮逸姝:要做最好看的户外运动产品
创业公司Celebrity Performance 评估名人影响力
江西宏柏新材料股份有限公司关于公司与 Momentive Performance Materials Inc. 签订合作意向书的公告
林 瑞
Zhengzhou Tik Tok

