Can One Deep-Learning Detector Find Trees Across Any LiDAR Platform?

We explain the proposed tree-center detection framework in three stages.

Fig. 1: Pipeline of the proposed framework (Reproduced from Fig. 1 in [1]).

Make Points Comparable
We first prepare the data and use Discrete Morse Theory (DMT) to convert a raw point cloud into structure-based clusters and anchor points, which are more stable than density-based neighborhoods.
Input: forest LiDAR point cloud (TLS / MLS / ULS)
Terrain normalization: estimate ground surface and convert heights to height above ground
Height slicing: split the scene into thin vertical slices for stable, layer-wise processing
DMT-based clustering: in each slice, DMT groups points by following height-driven discrete flows and produces a representative anchor for each cluster
Output: per-slice structure-based clusters + anchors for the next stage
Topology Features + AI
We learn compact features from the DMT anchors and project them onto a fixed 2D ground map.
Point encoding: use a small neural network (MLP: Multi-Layer Perceptron, a simple stack of linear layers) to embed each point into a feature vector
Anchor feature seeding: summarize member-point features inside each DMT cluster to form an anchor descriptor plus a few general geometry cues
Anchor context exchange: connect nearby anchors using kNN (k-Nearest Neighbors) and update them with a lightweight message-passing block
Local anchor–point update: refine features by aggregating information within each cluster neighborhood using an efficient attention-style pooling
Grid projection: pool anchor features onto a fixed-resolution 2D ground grid for each slice, producing per-slice feature maps
Output: a stack of 2D slice maps (one map per height slice)
Predict Tree Locations
We fuse the slice maps across height and directly predict tree-center locations on the ground plane.
Slice fusion: merge the stacked 2D slice maps from top to bottom using an efficient attention-style fusion with a learnable gate for each height band
Detection head: predict (1) a tree-center confidence map and (2) a sub-cell offset map on the ground grid
Center decoding: convert high-confidence grid cells + offsets into final 2D stem-center coordinates
Output: tree-center locations for the entire plot

We use Boreal3D [2], a public synthetic benchmark of boreal mixed forests (spruce, pine, birch). For each forest plot, the dataset provides three platform-specific point clouds (TLS, MLS, and ULS) generated from the same underlying forest geometry. This design isolates platform effects (viewpoint, sampling density, and occlusion) from scene differences, making it suitable for evaluating cross-platform robustness. In total, the dataset contains 48,403 trees and includes plots of varying difficulty (Easy, Medium, TwoLayer). For training efficiency, each plot is further divided into three non-overlapping subplots.

Fig. 2:Sample plot from the Boreal3D dataset (Reproduced from Fig. 3 in [1]). Colors represent the z-coordinate before normalization of each point, ranging from purple (low) to red (high).

Can One Deep-Learning Detector Find Trees Across Any LiDAR Platform?

Overview

Research Background

Proposed Method

Data

Results

Key Outcomes & Recommendations

Key outcomes

Recommendations

Afterword

References

Cases in the same field