本文主要记录配置安全的Impala集群集成Sentry的过程。Impala集群上配置了Kerberos认证,并且需要提前配置好Hive与Kerberos和Sentry的集成:
- 使用yum安装CDH Hadoop集群
- Hive配置kerberos认证
- Impala配置kerberos认证
-
1. 环境说明
系统环境:
操作系统:CentOs 6.6
- Hadoop版本:
CDH5.4 - JDK版本:
1.7.0_71 - 运行用户:root
集群各节点角色规划为:
192.168.56.121 cdh1 NameNode、ResourceManager、HBase、Hive metastore、Impala Catalog、Impala statestore、Sentry192.168.56.122 cdh2 DataNode、NodeManager、HBase、Hiveserver2、Impala Server192.168.56.123 cdh3 DataNode、HBase、NodeManager、Hiveserver2、Impala Server
2. 修改Impala配置
修改 /etc/default/impala 文件中的 IMPALA_SERVER_ARGS 参数,添加:
-server_name=server1-sentry_config=/etc/hive/conf/sentry-site.xml
在 IMPALA_CATALOG_ARGS 中添加:
-sentry_config=/etc/hive/conf/sentry-site.xml
/etc/hive/conf/sentry-site.xml 内容如下:
<?xml version="1.0" encoding="UTF-8"?><configuration><property><name>sentry.service.client.server.rpc-port</name><value>8038</value></property><property><name>sentry.service.client.server.rpc-address</name><value>cdh1</value></property><property><name>sentry.service.client.server.rpc-connection-timeout</name><value>200000</value></property><property><name>sentry.provider</name><value>org.apache.sentry.provider.file.HadoopGroupResourceAuthorizationProvider</value></property><property><name>sentry.hive.provider.backend</name><value>org.apache.sentry.provider.db.SimpleDBProviderBackend</value></property><property><name>sentry.metastore.service.users</name><value>hive</value><!--queries made by hive user (beeline) skip meta store check--></property><property><name>sentry.hive.server</name><value>server1</value></property><property><name>sentry.hive.testing.mode</name><value>true</value></property></configuration>
3. 重启Impala服务
在cdh1节点
4. 测试
5. 其他说明
如果要使用基于文件存储的方式配置Sentry store,则需要修改 /etc/default/impala 文件中的 IMPALA_SERVER_ARGS 参数,添加:
-server_name=server1-authorization_policy_file=/user/hive/sentry/sentry-provider.ini-authorization_policy_provider_class=org.apache.sentry.provider.file.LocalGroupResourceAuthorizationProvider
创建 sentry-provider.ini 文件并将其上传到 hdfs 的 /user/hive/sentry/ 目录:
$ cat /tmp/sentry-provider.ini[databases]# Defines the location of the per DB policy file for the customers DB/schema#db1 = hdfs://cdh1:8020/user/hive/sentry/db1.ini[groups]admin = any_operationhive = any_operationtest = select_filtered[roles]any_operation = server=server1->db=*->table=*->action=*select_filtered = server=server1->db=filtered->table=*->action=SELECTselect_us = server=server1->db=filtered->table=events_usonly->action=SELECT[users]test = testhive= hive$ hdfs dfs -rm -r /user/hive/sentry/sentry-provider.ini$ hdfs dfs -put /tmp/sentry-provider.ini /user/hive/sentry/$ hdfs dfs -chown hive:hive /user/hive/sentry/sentry-provider.ini$ hdfs dfs -chmod 640 /user/hive/sentry/sentry-provider.ini
注意:server1 必须和 sentry-provider.ini 文件中的保持一致。
