### What is this PR for? There is a dead link in [docs/README.md](https://github.com/apache/zeppelin/blob/master/docs/README.md). It should be `https://zeppelin.apache.org/docs/latest/` not `https://zeppelin.apache.org/docs/latest` ### What type of PR is it? Bug Fix ### What is the Jira issue? [ZEPPELIN-1421](https://issues.apache.org/jira/browse/ZEPPELIN-1421) ### How should this be tested? - Before [https://zeppelin.apache.org/docs/latest](https://zeppelin.apache.org/docs/latest) - After [https://zeppelin.apache.org/docs/latest/](https://zeppelin.apache.org/docs/latest/) ### Screenshots (if appropriate) ### Questions: * Does the licenses files need update? no * Is there breaking changes for older versions? no * Does this needs documentation? no Author: AhyoungRyu <fbdkdud93@hanmail.net> Closes #1420 from AhyoungRyu/ZEPPELIN-1421 and squashes the following commits:57336f8[AhyoungRyu] Remove BASE_PATH in some docs pages311e2ef[AhyoungRyu] Fix dead link in docs/README.md
4.7 KiB
| layout | title | description | group |
|---|---|---|---|
| page | Hive Interpreter for Apache Zeppelin | Apache Hive data warehouse software facilitates querying and managing large datasets residing in distributed storage. Hive provides a mechanism to project structure onto this data and query the data using a SQL-like language called HiveQL. At the same time this language also allows traditional map/reduce programmers to plug in their custom mappers and reducers when it is inconvenient or inefficient to express this logic in HiveQL. | interpreter |
{% include JB/setup %}
Hive Interpreter for Apache Zeppelin
Important Notice
Hive Interpreter will be deprecated and merged into JDBC Interpreter. You can use Hive Interpreter by using JDBC Interpreter with same functionality. See the example below of settings and dependencies.
Properties
| Property | Value |
|---|---|
| hive.driver | org.apache.hive.jdbc.HiveDriver |
| hive.url | jdbc:hive2://localhost:10000 |
| hive.user | hiveUser |
| hive.password | hivePassword |
Dependencies
| Artifact | Exclude |
|---|---|
| org.apache.hive:hive-jdbc:0.14.0 | |
| org.apache.hadoop:hadoop-common:2.6.0 |
Configuration
| Property | Default | Description |
|---|---|---|
| default.driver | org.apache.hive.jdbc.HiveDriver | Class path of JDBC driver |
| default.url | jdbc:hive2://localhost:10000 | Url for connection |
| default.user | ( Optional ) Username of the connection | |
| default.password | ( Optional ) Password of the connection | |
| default.xxx | ( Optional ) Other properties used by the driver | |
| ${prefix}.driver | Driver class path of %hive(${prefix}) |
|
| ${prefix}.url | Url of %hive(${prefix}) |
|
| ${prefix}.user | ( Optional ) Username of the connection of %hive(${prefix}) |
|
| ${prefix}.password | ( Optional ) Password of the connection of %hive(${prefix}) |
|
| ${prefix}.xxx | ( Optional ) Other properties used by the driver of %hive(${prefix}) |
This interpreter provides multiple configuration with ${prefix}. User can set a multiple connection properties by this prefix. It can be used like %hive(${prefix}).
Overview
The Apache Hive ™ data warehouse software facilitates querying and managing large datasets residing in distributed storage. Hive provides a mechanism to project structure onto this data and query the data using a SQL-like language called HiveQL. At the same time this language also allows traditional map/reduce programmers to plug in their custom mappers and reducers when it is inconvenient or inefficient to express this logic in HiveQL.
How to use
Basically, you can use
%hive
select * from my_table;
or
%hive(etl)
-- 'etl' is a ${prefix}
select * from my_table;
You can also run multiple queries up to 10 by default. Changing these settings is not implemented yet.
Apply Zeppelin Dynamic Forms
You can leverage Zeppelin Dynamic Form inside your queries. You can use both the text input and select form parameterization features.
%hive
SELECT ${group_by}, count(*) as count
FROM retail_demo.order_lineitems_pxf
GROUP BY ${group_by=product_id,product_id|product_name|customer_id|store_id}
ORDER BY count ${order=DESC,DESC|ASC}
LIMIT ${limit=10};