解决spark hive插入数据异常Spark currently does NOT populate bucketed output_编程学问网

解决spark hive插入数据异常Spark currently does NOT populate bucketed output

hive | 2019-09-13 10:02:39

我是启动了 spark thriftserver，然后通过客户端连接hive,创建表成功，然后插入数据的时候出现异常。

1.在spark/sbin启动beeline连接hive

./beeline -u jdbc:hive2://master:10000

2.创建表

CREATE  TABLE t2 (id INT, name STRING) PARTITIONED BY (country STRING, state STRING)
CLUSTERED BY (id) INTO 8 BUCKETS
STORED AS ORC TBLPROPERTIES ('transactional'='true');

使用BUCKETS是为了支持 update delete能够修改和删除hive表数据

3.插入数据

beeline> INSERT INTO TABLE t2 PARTITION (country, state) VALUES (5,'刘','DD','DD');

出现异常

Error: org.apache.spark.sql.AnalysisException: Output Hive table `default`.`t2` is bucketed but Spark currently does NOT populate bucketed output which is compatible with Hive.; (state=,code=0)

4.解决方法

beeline> set hive.enforce.bucketing=false;
beeline> set hive.enforce.sorting=false;

然后再执行插入语句就能成功。

5.修改hive-site.xml

beeline set变量只是当前临时测试，要一直生效，就要修改hive-site.xml。

<property> 
<name>hive.enforce.bucketing</name> 
<value>false</value> 
</property> 
<property> 
<name>hive.enforce.sorting</name> 
<value>false</value> 
</property>

登录后即可回复登录 | 注册

相关文章

spark on hive 异常 `hivefileformat` doesn t match `parquetfileformat`spark操作hive orc transactional事务表异常解决spark hive插入数据异常spark currently does not populate bucketed output jdbc连接hive spark thriftserver异常unable to move source java jdbc通过spark连接hive 异常required field client protocol is unset spark hive 异常version information not found in metastore hive on spark异常failed to create spark client for spark session解决过程 hive on spark parquetdecodingexception 异常解决 hive on spark集群环境搭建 spark异常 could not locate executable null bin winutils.exe in the hadoop binaries spark hive 元数据异常 filenotfoundexception spark 异常 missing an output location for shuffle linux hadoop、hbase、hive、spark大数据分布式集群环境搭建 spark on yarn 异常 spark shuffle does not exist spark rdd写入数据到hbase nullpointerexception异常 spark从oracle导入数据到hive spark hive 异常 could not connect to meta store using any of the uris provided php mongocollection creates an index on the specified field s if it does not already exist spark hive Can not create the managed table('`xxx`'). The associated location('xxx') already exists elasticsearch异常query does not support [auto_generate_synonyms_phrase_query]

关注编程学问公众号