python连接
- 开始之前还要装一些依赖
pip install saslpip install thriftpip install thrift-saslpip install PyHive
from pyhive import prestoconn = presto.connect(protocol='https', host="~.com", port=~, username="你的账号", password="你的密码")cursor = conn.cursor()sql = "select * from hive.tmp.adjust_adid0408 limit 10"cursor.execute(sql)res = cursor.fetchall()print(res)for i in res:print(i)# 转为DataFramedf = pd.DataFrame(res)# 列名还要自己写,100多个字段难搞
使用python虽然可以连接,但是再数据分析上还是pandas用起来爽
pandas连接
from pyhive import prestoimport pandas as pdconn = presto.connect(protocol='https', host="~.com", port=~, username="#", password="#")df = pd.read_sql_query("select * from hrder_detail limit 20", conn)# df = pd.read_sql("select * from hrder_detail limit 20", conn)df

