My preferred way is just use duckdb java API. I didn't see anything better in performance/efficiency. Also a SQL query is often easier to write
theanonymousone 4 hours ago [-]
Yes. It has bothered me for a long time too. Maybe the best mix is a dataframe library with basic operations (column select, non-null etc), which also allows SQL for more complex stuff?
jurgenaut23 6 hours ago [-]
Why replicating a bad API? Pleas build a Polars-like Java data frame library.
rickette 8 hours ago [-]
Congrats on putting this out there. There isn't a de facto pandas-like library in Java like you said. But for Kotlin there is: https://github.com/Kotlin/dataframe
skanga 7 hours ago [-]
What about Tablesaw, Apache Arrow? How does this compare ...
gitroom 4 hours ago [-]
[dead]
Rendered at 12:37:55 GMT+0000 (UTC) with Wasmer Edge.
https://github.com/jtablesaw/tablesaw
https://github.com/dflib/dflib
My preferred way is just use duckdb java API. I didn't see anything better in performance/efficiency. Also a SQL query is often easier to write