hadoop - Write Parquet format to HDFS using Java API with out using Avro and MR -


what simple way write parquet format hdfs (using java api) directly creating parquet schema of pojo, without using avro , mr?

the samples found outdated , uses deprecated methods uses 1 of avro, spark or mr.

effectively, there not lot of sample available reading/writing apache parquet files without of external framework.

the core parquet library parquet-column can find test files reading/writing directly : https://github.com/apache/parquet-mr/blob/master/parquet-column/src/test/java/org/apache/parquet/io/testcolumnio.java

you need use same functionality hdfs file. can follow sow question : accessing files in hdfs using java

updated : respond deprecated parts of api : avrowritesupport should replaced avroparquetwriter , check parquetwriter it's not deprecated , can used safely.

regards,

loïc


Comments

Popular posts from this blog

amazon web services - S3 Pre-signed POST validate file type? -

c# - Check Keyboard Input Winforms -