Rusty Conover
Data infrastructure engineer and founder of Query.Farm. Building DuckDB extensions, scaling data platforms, and writing about what I learn along the way.
Recent Activity
View all →Rusty spent the day working across multiple codebases, with primary focus on the Apache Arrow project. He made 6 commits to apache/arrow, contributing to ongoing development efforts on that codebase.
Rusty focused on feature parity across the vgi-rpc language implementations, adding cancel() support for streaming RPC calls to all three client libraries.
Rusty spent the day advancing batched window aggregation support in duckdb/duckdb. He opened PR #22143 introducing aggregate_window_batch_t, a new callback variant that optimizes window function evaluation by pre-collecting subframes for all output rows and invoking the batch callback once, rather than looping through individual rows. This reduces function call overhead for custom aggregate window operations and demonstrates a thoughtful approach to performance in the aggregation pipeline.
GeoSilo vs GeoArrow + Byte Stream Split: Two Approaches to Geometry Compression
A detailed comparison of GeoSilo's delta-integer encoding against GeoArrow's columnar float64 layout with Parquet BYTE_STREAM_SPLIT, covering compression ratios, precision tradeoffs, and when each approach wins.
VGI Injector: A Tiny HTTPS Download-and-Execute Binary in Zig
I needed a self-contained binary that downloads a program over HTTPS and exec's it — small enough to run in a FROM scratch container with nothing else. Go and Rust couldn't get small enough. Zig could.
TIME Data Type Compatibility Across Databases
A survey of the TIME data type across 14 databases, comparing supported ranges, maximum values, and whether the special value 24:00:00 is accepted.
Telemetry for DuckDB Extensions Without the Pain
I open-sourced the telemetry client I use across Query.Farm's DuckDB extensions. It's two files, one function call, and it only collects platform and version info.
Releasing vgi-rpc: An RPC Framework Built on Apache Arrow
I built an RPC framework for Python that uses Apache Arrow IPC as the wire format and Python Protocol classes as the interface definition. No .proto files, no codegen — just type annotations.
Acronym-Aware Case Conversions in the DuckDB Inflector Extension
The Inflector extension for DuckDB now supports configurable acronyms, so case conversions preserve terms like HTML, API, and URL as fully uppercase — configured through a native DuckDB setting.
DuckDB Extension Development Workshop
DuckDB Developer Meeting #1 — Amsterdam, Netherlands
Building on Flight: Real-World Lessons from the DuckDB Airport Extension
Apache Arrow Summit 2025 — Paris, France
What I've Been Working On
Full activity log →I've been deep in ecosystem stewardship these past few months, conducting infrastructure upgrades and cross-language contributions while shipping production-ready systems. February saw a masterclass in standardization as I future-proofed dozens of DuckDB extensions for compatibility and launched the complete vgi-rpc ecosystem across four languages with unified branding. The work balanced systematic infrastructure improvements with delightfully practical features—from weather apps with dog-walking safety alerts to upstream Arrow enhancements that fixed silent data loss bugs.
• Led a DuckDB 1.5 compatibility initiative across the extension portfolio, opening 40+ PRs to standardize documentation and future-proof extensions like airport, shellfs, webmacro, and inflector.
• Contributed upstream enhancements to Apache Arrow across JavaScript, Swift, Rust, and Go, adding custom metadata support for RecordBatch IPC messages and fixing silent data drops in production systems.
• Shipped a complete vgi-rpc ecosystem with production-ready implementations in Python, C++, Go, and TypeScript, including unified branding, documentation sites, and Apache 2.0 licensing.