← Back to All Plugins Cloud Native

GIS Data Conversion Pipeline

Transform geospatial source data into cloud-native optimized formats using DuckDB as the core processing engine. High-performance batch processing for cloud deployment.

Key Features

High-performance data preparation for cloud infrastructure

DuckDB Processing Engine

Unified processing using DuckDB with spatial and httpfs extensions for high-performance analytical queries.

📦

Multi-Format Support

Vector formats (GPKG, SHP, GeoJSON, FlatGeoBuf), LiDAR (LAS/LAZ), and Raster (GeoTIFF) in one pipeline.

Cloud Storage Upload

Direct upload to AWS S3, Azure Blob Storage, or Google Cloud Storage with automatic bucket management.

📈

Hilbert Curve Sorting

Spatial sorting using Hilbert curves for optimized query performance on cloud-native formats.

🛠

Batch Processing

Process entire directories with per-file error isolation. Continue processing even if individual files fail.

💻

CLI Interface

Command-line interface for automation, scripting, and integration with CI/CD pipelines.

🔧

Format Optimization

Convert to cloud-optimized formats like FlatGeoBuf, COPC, and COG for efficient cloud access.

📊

Progress Tracking

Real-time progress reporting with detailed logging for monitoring large batch operations.

Supported Formats

Comprehensive format support for all your geospatial data

Vector Input

GeoPackage, Shapefile, GeoJSON, FlatGeoBuf, GeoParquet, and other OGR-supported formats.

Vector Output

FlatGeoBuf, GeoParquet, Cloud-Optimized GeoPackage with spatial indexing and sorting.

LiDAR Processing

LAS/LAZ point cloud conversion to COPC (Cloud-Optimized Point Cloud) for streaming access.

Raster Processing

GeoTIFF to COG (Cloud-Optimized GeoTIFF) with tiling, overviews, and internal compression.

Cloud Targets

AWS S3, Azure Blob Storage, Google Cloud Storage with presigned URLs and credential support.

HTTP Access

Read remote files directly via HTTP/HTTPS using DuckDB httpfs extension for cloud datasets.

Performance Benefits

Optimized for cloud-native geospatial workflows

Hilbert Spatial Sorting

Improves spatial query performance by 10-100x for common bbox queries on large datasets.

Cloud-Optimized Formats

Enable HTTP range requests for partial file access without downloading entire datasets.

DuckDB Columnar Engine

Analytical query performance with columnar storage and parallel processing capabilities.

Optimize for Cloud Deployment

Transform your geospatial data into cloud-native formats optimized for cloud storage and query performance.

Get the Plugin