col1, col2的结果会去重:
select col1, col2
from the_table1
union
select col1, col2
from the_table2
col1, col2的结果不会去重:
select col1, col2
from the_table1
union all
select col1, col2
from the_table2
Spark的union
,unionAll
,unionByName
都不会去重