Skip to content

biodatageeks/polars-bio

Repository files navigation

polars-bio - Next-gen Python DataFrame operations for genomics!

PyPI - Version GitHub License PyPI - Downloads GitHub commit activity

CI Docs logo

polars-bio is a Python library for genomics built on top of polars, Apache Arrow and Apache DataFusion. It provides a DataFrame API for genomics data and is designed to be blazing fast, memory efficient and easy to use.

Key Features

Single-thread performance 🏃‍

overlap-single.png

overlap-single.png

count-overlaps-single.png

coverage-single.png

Parallel performance 🏃‍🏃‍

overlap-parallel.png

overlap-parallel.png

count-overlaps-parallel.png

coverage-parallel.png

Read the documentation