duckdb

create a polars dataframe containing unique values from a set of CSVs

create a polars dataframe containing unique values from a set of CSVs Question: I have +3000 CSVs with +10 columns. What I need is to get all unique values from just two of these. I am able to read unique values in polars: import polars as pl df1 = pl.read_csv("test1.biobank.tsv.gz", sep=’t’, dtype={"#chrom": pl.Utf8}, n_threads=8, columns=["#chrom", …

Total answers: 2