Spark + iceberg的API--粉丝服务平台-粉丝头条-fensifuwu.com

Spark + iceberg的API

科技 08-24 来源：研读师

由于spark版本问题，所以使用 iceberg-api操作创建表。

IcebergApi .java

package org.example;

import org.apache.hadoop.conf.Configuration;

import org.apache.iceberg.Schema;

import org.apache.iceberg.Table;

import org.apache.iceberg.catalog.Catalog;

import org.apache.iceberg.catalog.TableIdentifier;

import org.apache.iceberg.hive.HiveCatalog;

import org.apache.iceberg.types.Types;

import org.apache.spark.SparkConf;

import org.apache.spark.sql.SparkSession;

public class IcebergApi {

public static Configuration getProperties() {

System.out.println("start:-----");

SparkSession spark = SparkSession.builder().config(

new SparkConf().setAppName("IcebergApi")).enableHiveSupport().getOrCreate();

System.out.println("spark: " + spark);

Configuration conf = spark.sparkContext().hadoopConfiguration();

// conf1.set("spark.sql.warehouse.dir", "/user/bigdata/hive/warehouse/");

conf.set("hive.metastore.warehouse.dir", "/user/bigdata/hive/warehouse/");

return conf;

}

public static Table createTable() {

Configuration conf = getProperties();

Catalog catalog = new HiveCatalog(conf);

System.out.println("catalog: " + catalog);

TableIdentifier name = TableIdentifier.of("testdb", "ice_table2");

System.out.println("name: " + name);

Schema schema = new Schema(

Types.NestedField.required(1, "level", Types.StringType.get()),

Types.NestedField.required(2, "event_time", Types.StringType.get())

);

System.out.println("schema: " + schema);

Table table = catalog.createTable(name, schema);

System.out.println("end:-----" + table);

return table;

}

public static void main( String[] args ) {

createTable();

}

}

直接打包。放到spark服务器上。

然后执行命令：

spark-submit --class org.example.IcebergApi \

--master yarn \

--deploy-mode cluster \

/home/bigdata/mhb/iceberg-api-1.0-SNAPSHOT-jar-with-dependencies.jar

注意：

maven打包要把所有的依赖都打到jar包中才行。

所以要加如下插件：



maven-assembly-plugin

2.4.1





jar-with-dependencies

建表成功。

发表评论

留言与评论（共有 0 条评论） “”

Spark处理数据倾斜过程记录

跨平台API对接（Python）的使用

人脸识别API接口

Web3通用API平台Phyllo完成1500万

API网关-APISIX实战、部署、测试

智能手环上云API接口智能手环云服

网友投稿普通会员

我还没有学会写个人说明

1994463 篇文章

74653675 次浏览

最近文章

Spark + iceberg的API

相关文章

推荐文章

最热点击文章

热门标签