Polars¶

About

Polars is a high‑performance DataFrames library with interfaces for Rust, Python, Node.js, and R, plus a SQL context. It is powered by a multithreaded, vectorized query engine and written in Rust.

Install

pip install 'polars[pyarrow]' sqlalchemy-cratedb

Synopsis

Write Polars dataframe to CrateDB.

example.py

import polars as pl
import sqlalchemy as sa
from sqlalchemy_cratedb import insert_bulk

CRATEDB_URI = "crate://crate:crate@localhost:4200"
TABLE_NAME = "example"

df = pl.from_pandas(makeTimeDataFrame(rows=500_000, freq="s"))
engine = sa.create_engine(CRATEDB_URI)
df.write_database(
    engine="sqlalchemy",
    connection=engine,
    table_name=TABLE_NAME,
    if_table_exists="replace",
    engine_options={
        "method": insert_bulk,
        "chunksize": 20_000,
    },
)

Quickstart example

Create the file example.py including the synopsis code shared above. Complete the example by using the makeTimeDataFrame() function.

def makeTimeDataFrame(rows=5_000, freq = "B"):
    import numpy as np
    import pandas as pd
    return pd.DataFrame(
        np.random.default_rng(2).standard_normal((rows, 4)),
        columns=pd.Index(list("ABCD"), dtype=object),
        index=pd.date_range("2000-01-01", periods=rows, freq=freq),
    )

Start CrateDB using Docker or Podman, then invoke the example program.

docker run --rm --publish=5432:5432 docker.io/crate '-Cdiscovery.type=single-node'

pip install 'polars[pyarrow]' sqlalchemy-cratedb pandas
python example.py

Full example

Connect to CrateDB and CrateDB Cloud using Polars.

Includes basic examples of how to use Polars with CrateDB.

https://github.com/crate/cratedb-examples/tree/main/by-dataframe/polars