a75dc80a76
### What changes were proposed in this pull request? Remove the unneeded embedded inline HTML markup by using the basic markdown syntax. Please see #28414 ### Why are the changes needed? Make the doc cleaner and easily editable by MD editors. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Manually build and check Closes #28451 from huaxingao/html_cleanup. Authored-by: Huaxin Gao <huaxing@us.ibm.com> Signed-off-by: Sean Owen <srowen@gmail.com>
125 lines
4.6 KiB
Markdown
125 lines
4.6 KiB
Markdown
---
|
|
layout: global
|
|
title: ANALYZE TABLE
|
|
displayTitle: ANALYZE TABLE
|
|
license: |
|
|
Licensed to the Apache Software Foundation (ASF) under one or more
|
|
contributor license agreements. See the NOTICE file distributed with
|
|
this work for additional information regarding copyright ownership.
|
|
The ASF licenses this file to You under the Apache License, Version 2.0
|
|
(the "License"); you may not use this file except in compliance with
|
|
the License. You may obtain a copy of the License at
|
|
|
|
http://www.apache.org/licenses/LICENSE-2.0
|
|
|
|
Unless required by applicable law or agreed to in writing, software
|
|
distributed under the License is distributed on an "AS IS" BASIS,
|
|
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
|
|
See the License for the specific language governing permissions and
|
|
limitations under the License.
|
|
---
|
|
|
|
### Description
|
|
|
|
The `ANALYZE TABLE` statement collects statistics about the table to be used by the query optimizer to find a better query execution plan.
|
|
|
|
### Syntax
|
|
|
|
```sql
|
|
ANALYZE TABLE table_identifier [ partition_spec ]
|
|
COMPUTE STATISTICS [ NOSCAN | FOR COLUMNS col [ , ... ] | FOR ALL COLUMNS ]
|
|
```
|
|
|
|
### Parameters
|
|
|
|
* **table_identifier**
|
|
|
|
Specifies a table name, which may be optionally qualified with a database name.
|
|
|
|
**Syntax:** `[ database_name. ] table_name`
|
|
|
|
* **partition_spec**
|
|
|
|
An optional parameter that specifies a comma separated list of key and value pairs
|
|
for partitions. When specified, partition statistics is returned.
|
|
|
|
**Syntax:** `PARTITION ( partition_col_name [ = partition_col_val ] [ , ... ] )`
|
|
|
|
* **[ NOSCAN `|` FOR COLUMNS col [ , ... ] `|` FOR ALL COLUMNS ]**
|
|
|
|
* If no analyze option is specified, `ANALYZE TABLE` collects the table's number of rows and size in bytes.
|
|
* **NOSCAN**
|
|
|
|
Collects only the table's size in bytes ( which does not require scanning the entire table ).
|
|
* **FOR COLUMNS col [ , ... ] `|` FOR ALL COLUMNS**
|
|
|
|
Collects column statistics for each column specified, or alternatively for every column, as well as table statistics.
|
|
|
|
### Examples
|
|
|
|
```sql
|
|
CREATE TABLE students (name STRING, student_id INT) PARTITIONED BY (student_id);
|
|
INSERT INTO students PARTITION (student_id = 111111) VALUES ('Mark');
|
|
INSERT INTO students PARTITION (student_id = 222222) VALUES ('John');
|
|
|
|
ANALYZE TABLE students COMPUTE STATISTICS NOSCAN;
|
|
|
|
DESC EXTENDED students;
|
|
+--------------------+--------------------+-------+
|
|
| col_name| data_type|comment|
|
|
+--------------------+--------------------+-------+
|
|
| name| string| null|
|
|
| student_id| int| null|
|
|
| ...| ...| ...|
|
|
| Statistics| 864 bytes| |
|
|
| ...| ...| ...|
|
|
| Partition Provider| Catalog| |
|
|
+--------------------+--------------------+-------+
|
|
|
|
ANALYZE TABLE students COMPUTE STATISTICS;
|
|
|
|
DESC EXTENDED students;
|
|
+--------------------+--------------------+-------+
|
|
| col_name| data_type|comment|
|
|
+--------------------+--------------------+-------+
|
|
| name| string| null|
|
|
| student_id| int| null|
|
|
| ...| ...| ...|
|
|
| Statistics| 864 bytes, 2 rows| |
|
|
| ...| ...| ...|
|
|
| Partition Provider| Catalog| |
|
|
+--------------------+--------------------+-------+
|
|
|
|
ANALYZE TABLE students PARTITION (student_id = 111111) COMPUTE STATISTICS;
|
|
|
|
DESC EXTENDED students PARTITION (student_id = 111111);
|
|
+--------------------+--------------------+-------+
|
|
| col_name| data_type|comment|
|
|
+--------------------+--------------------+-------+
|
|
| name| string| null|
|
|
| student_id| int| null|
|
|
| ...| ...| ...|
|
|
|Partition Statistics| 432 bytes, 1 rows| |
|
|
| ...| ...| ...|
|
|
| OutputFormat|org.apache.hadoop...| |
|
|
+--------------------+--------------------+-------+
|
|
|
|
ANALYZE TABLE students COMPUTE STATISTICS FOR COLUMNS name;
|
|
|
|
DESC EXTENDED students name;
|
|
+--------------+----------+
|
|
| info_name|info_value|
|
|
+--------------+----------+
|
|
| col_name| name|
|
|
| data_type| string|
|
|
| comment| NULL|
|
|
| min| NULL|
|
|
| max| NULL|
|
|
| num_nulls| 0|
|
|
|distinct_count| 2|
|
|
| avg_col_len| 4|
|
|
| max_col_len| 4|
|
|
| histogram| NULL|
|
|
+--------------+----------+
|
|
```
|