hadoop - Write Parquet format to HDFS using Java API with out using Avro and MR -
what simple way write parquet format hdfs (using java api) directly creating parquet schema of pojo, without using avro , mr?
the samples found outdated , uses deprecated methods uses 1 of avro, spark or mr.
effectively, there not lot of sample available reading/writing apache parquet files without of external framework.
the core parquet library parquet-column can find test files reading/writing directly : https://github.com/apache/parquet-mr/blob/master/parquet-column/src/test/java/org/apache/parquet/io/testcolumnio.java
you need use same functionality hdfs file. can follow sow question : accessing files in hdfs using java
updated : respond deprecated parts of api : avrowritesupport should replaced avroparquetwriter , check parquetwriter it's not deprecated , can used safely.
regards,
loïc
Comments
Post a Comment