Uncategorized

Apache Sedona Tutorial: Scalable Spatial Joins and Geospatial Processing with Spark

June 6, 2025 Matt Forrest Comments Off

If you’ve ever tried to run a spatial join on millions of features and watched your machine cry for help, it’s time to level up with Apache Sedona.

In this new video, I walk through a complete hands-on tutorial for using Sedona in Python via JupyterLab. We’ll load spatial data, perform scalable spatial operations, and explore what makes Sedona a powerful choice for modern geospatial data processing.

What You’ll Learn

How to set up a local Sedona environment with Spark
Performing spatial joins and transformations at scale
Running everything inside a familiar Jupyter notebook

Whether you’re coming from GeoPandas or PostGIS, this is a great entry point to start working with distributed spatial computing in Python.

Resources

📦 GitHub Repo (Notebook + Setup Instructions): https://github.com/mbforr/sedona-tutorial
🗂️ D ownload the sample data used in the tutorial

Sedona makes it possible to scale up your geospatial work without giving up the tools you love. If you’ve been wanting to bridge the gap between desktop GIS and big data, this is your next step.

Apache Sedona Tutorial: Scalable Spatial Joins and Geospatial Processing with Spark

What You’ll Learn

Resources

Matt Forrest

Spatial Lab

Policies

Spatial SQL

Join Us

Get the newsletter

Apache Sedona Tutorial: Scalable Spatial Joins and Geospatial Processing with Spark

What You’ll Learn

Resources

Matt Forrest

Spatial Lab

Policies

Spatial SQL

Join Us