Uncategorized

How to Run Scalable Geospatial Analysis with Apache Sedona – Right From Your Laptop

I spend a lot of time talking about scaling geospatial analysis—massive datasets, remote sensing archives, distributed computation—but sometimes the best way to start is right from your laptop. In this tutorial, we’ll set up a local Apache Spark environment using Apache Sedona, a powerful extension for scalable geospatial processing. We’ll connect to remote cloud-hosted data, […]

Scaling GIS Workflows with COGs, Airflow, and Apache Iceberg

TOP OF THE STACK What we need to do with COGs COGs (Cloud-Optimized GeoTIFFs) are one of the most promising tools we have for making raster data truly cloud-native. They let you stream just the pieces you need, work remotely, and plug into modern geospatial systems without downloading giant files. But after working closely with […]

Uncategorized

Spatial Joins: A comprehensive guide

Spatial joins combine data based on location instead of a common key. In a spatial join, attributes from one dataset are attached to another by evaluating how their geometries relate in space. For example, you could join a list of customer coordinates to sales territories by finding which territory polygon contains each customer point. This […]

Uncategorized

The Top Geospatial Python Packages: What’s Driving Their Growth?

Geospatial Python has seen an explosion in adoption, with several key libraries surpassing 10 million downloads this year. As geospatial analysis becomes increasingly critical in data science, urban planning, environmental monitoring, and AI applications, understanding why these libraries are growing can provide insight into where the industry is heading. Here’s a breakdown of the most […]