Skip to content

i026e/py-parquet-viewer

Repository files navigation

Parquet Viewer

GUI application based on PyArrow + QT framework to view *.parquet files Requires python version 3.8+

Screenshots

Data tab data tab

Schema tab schema tab

Export export

Installation

Windows

Download and extract zip archive for Windows x64 from Releases page

Launch parquet-viewer.exe file win-exe

As python package

[Optionally] create virtual environment

Copy link to .whl file from Releases page, and run pip install providing that link.

For example

# python3 -m venv .venv
# source .venv/bin/activate

pip3 install https://github.com/i026e/py-parquet-viewer/releases/download/v0.1.0/parquet_viewer-0.1.0-py3-none-any.whl

Usage

Filters

Viewer supports filters in SQL-like syntax:

column is null, column is not null applicable for columns of any type, supported for structs and lists
column ~ '%substring%' select rows containing specified substring -- only for string type columns
column = true, column = false for boolean type columns
column == 123.123, column > 123, column >= 123, column < 123, column <= 123 for string and number columns

Development

Clone repo from GitHub

git clone git@github.com:i026e/py-parquet-viewer.git

Create virtual environment and install dependencies with poetry:

python3 -m venv .venv
source .venv/bin/activate
pip3 install poetry
poetry install

To run the application

poetry run parquet-viewer [path/to/parquet/file]
# OR
./parquet-viewer [path/to/parquet/file]

QTDesigner

To change the UI layout you need QTDesigner

Run qt_designer/run_qt_designer.sh script to launch it with required plugins loaded

Build WHL

poetry build

Build EXE

poetry run build-exe

Attribution