Table of Contents |
---|
Lateral View Syntax
Code Block |
---|
lateralView: LATERAL VIEW udtf(expression) tableAlias AS columnAlias (',' columnAlias)*
fromClause: FROM baseTable (lateralView)*
|
...
Lateral view is used in conjunction with user-defined table generating functions such as explode()
. As mentioned in Built-in Table-Generating Functions, a UDTF generates one zero or more output rows for each input row. A lateral view first applies the UDTF to each row of base table and then joins resulting output rows to the input rows to form a virtual table having the supplied table alias.
...
A lateral view with explode() can be used to convert adid_list
into separate rows using the query:
Code Block | ||||
---|---|---|---|---|
| ||||
SELECT pageid, adid
FROM pageAds LATERAL VIEW explode(adid_list) adTable AS adid;
|
...
Then in order to count the number of times a particular ad appears, count/group by can be used:
Code Block | ||||
---|---|---|---|---|
| ||||
SELECT adid, count(1)
FROM pageAds LATERAL VIEW explode(adid_list) adTable AS adid
GROUP BY adid;
|
...
For example, the following could be a valid query:
Code Block | ||||
---|---|---|---|---|
| ||||
SELECT * FROM exampleTable
LATERAL VIEW explode(col1) myTable1 AS myCol1
LATERAL VIEW explode(myCol1) myTable2 AS myCol2;
|
...
The query:
Code Block | ||||
---|---|---|---|---|
| ||||
SELECT myCol1, col2 FROM baseTable
LATERAL VIEW explode(col1) myTable1 AS myCol1;
|
...
A query that adds an additional LATERAL VIEW:
Code Block | ||||
---|---|---|---|---|
| ||||
SELECT myCol1, myCol2 FROM baseTable
LATERAL VIEW explode(col1) myTable1 AS myCol1
LATERAL VIEW explode(col2) myTable2 AS myCol2;
|
...
Info | ||
---|---|---|
| ||
Introduced in Hive version 0.12.0 |
User The user can specify the optional OUTER
keyword to make a row with NULL(s) for UDTF side in case the UDTF does not make any rows keyword to generate rows even when a LATERAL VIEW
usually would not generate a row. This happens when the UDTF used does not generate any rows which happens easily with explode
when the column to explode is empty. In this case the source row would never appear in the results. OUTER
can be used to prevent that and rows will be generated with NULL
values in the columns coming from the UDTF.
For example, the following query returns an empty result:
Code Block | ||
---|---|---|
| ||
SELEC select * fromFROM src LATERAL VIEW explode(array()) C AS a limit 10; |
But with the OUTER
keyword, keyword
Code Block | ||
---|---|---|
| ||
SELECT select * fromFROM src LATERAL VIEW OUTER explode(array()) C AS a limit 10; |
Will it will produce:
238 val_238 NULL
86 val_86 NULL
311 val_311 NULL
27 val_27 NULL
165 val_165 NULL
409 val_409 NULL
255 val_255 NULL
278 val_278 NULL
98 val_98 NULL
...