Rusty Conover
Data infrastructure engineer and founder of Query.Farm. Building DuckDB extensions, scaling data platforms, and writing about what I learn along the way.
Recent Activity
View all →Rusty spent the day advancing batched window aggregation support in duckdb/duckdb. He opened PR #22143 introducing aggregate_window_batch_t, a new callback variant that optimizes window function evaluation by pre-collecting subframes for all output rows and invoking the batch callback once, rather than looping through individual rows. This reduces function call overhead for custom aggregate window operations and demonstrates a thoughtful approach to performance in the aggregation pipeline.
Rusty added a health endpoint to vgi-rpc-python, extending the RPC service with a new monitoring capability. This addition supports better operational observability for deployments relying on the library.
Rusty had a busy day maintaining and updating the DuckDB ecosystem, with particular focus on the community extensions and addressing build issues in the core DuckDB library.
GeoSilo vs GeoArrow + Byte Stream Split: Two Approaches to Geometry Compression
A detailed comparison of GeoSilo's delta-integer encoding against GeoArrow's columnar float64 layout with Parquet BYTE_STREAM_SPLIT, covering compression ratios, precision tradeoffs, and when each approach wins.
VGI Injector: A Tiny HTTPS Download-and-Execute Binary in Zig
I needed a self-contained binary that downloads a program over HTTPS and exec's it — small enough to run in a FROM scratch container with nothing else. Go and Rust couldn't get small enough. Zig could.
TIME Data Type Compatibility Across Databases
A survey of the TIME data type across 14 databases, comparing supported ranges, maximum values, and whether the special value 24:00:00 is accepted.
Telemetry for DuckDB Extensions Without the Pain
I open-sourced the telemetry client I use across Query.Farm's DuckDB extensions. It's two files, one function call, and it only collects platform and version info.
Releasing vgi-rpc: An RPC Framework Built on Apache Arrow
I built an RPC framework for Python that uses Apache Arrow IPC as the wire format and Python Protocol classes as the interface definition. No .proto files, no codegen — just type annotations.
Acronym-Aware Case Conversions in the DuckDB Inflector Extension
The Inflector extension for DuckDB now supports configurable acronyms, so case conversions preserve terms like HTML, API, and URL as fully uppercase — configured through a native DuckDB setting.
DuckDB Extension Development Workshop
DuckDB Developer Meeting #1 — Amsterdam, Netherlands
Building on Flight: Real-World Lessons from the DuckDB Airport Extension
Apache Arrow Summit 2025 — Paris, France
What I've Been Working On
Full activity log →I've been orchestrating infrastructure upgrades and ecosystem stewardship across the DuckDB and Apache Arrow landscapes, from future-proofing dozens of extensions for compatibility to shipping production-ready implementations across multiple languages. February brought a visual identity overhaul for the vgi-rpc ecosystem and classified operations across several undisclosed projects, while January balanced deep performance archaeology with meaningful open-source contributions and a timely security disclosure. Throughout these months, I've maintained a rhythm of both high-impact core improvements and meticulous ecosystem gardening that keeps the broader developer community moving forward.
- Led a DuckDB 1.5 compatibility push across the Query-farm extension portfolio, opening 40+ PRs to standardize and future-proof extensions like airport, shellfs, inflector, and webmacro while aligning documentation structures.
- Contributed upstream enhancements to Apache Arrow across four language implementations, adding custom metadata support for RecordBatch IPC messages and fixing silent data drops in dictionary serialization.
- Shipped a complete vgi-rpc ecosystem with production-ready implementations in Python, C++, Go, and TypeScript, including unified branding, Cloudflare documentation infrastructure, and standardized licensing across all repositories.