LaSoFly's Mathematical Path to Ground Truth
Learn how LaSoFly generates high-quality training data through blind labeling, the Hungarian Algorithm, and automatic consensus calculation.
Duration: approx. 18 minutes
Background
Automatic detection of objects...
- Forestry
- Urban greening
- Climate research
- Disaster prevention
Neural networks can do this excellently...
LaSoFly's Mission
LaSoFly is not...
LaSoFly is a specialized...
- Fast circle-based
- Multi-annotator
- Automatic quality
- Release only high-quality
The Core Principle
Circle-based Annotation
Tree crowns are approximately circular...
- Left click: Draw new circle
- Right-drag: Adjust radius
- Left double-click: Delete
- Mouse wheel: Enlarge
Multi-Annotator Workflow
Multiple annotators...
- Blind:
- Parallel:
- Independent:
Automatic Consensus Calculation
After all annotations...
- System compares
- Hungarian Algorithm
- IoU-Scores
- Quality Scores
- Admin reviews
Comparison to Other Tools
| Produkt | Stärke | Schwächen | Deshalb LaSoFly |
|---|---|---|---|
| Label Studio | Generic labeling tool | No domain-specific consensus | Consensus-Matching direkt eingebaut Label Studio is generic. LaSoFly specializes in consensus: multiple annotators label the same image blindly and independently. The system automatically compares annotations, matches objects via Hungarian Algorithm, and calculates quality scores. |
| SAM (Segment Anything) | Powerful AI segmentation | Not a labeling tool | Organisiert echte Ground Truth SAM auto-segments but isn't a labeling system. LaSoFly is a complete system: humans label blindly, LaSoFly aggregates multiple annotations into validated training labels. |
| CVAT | Multi-user, open-source | No circle consensus | Spezialisiert auf Circle-Consensus CVAT is generic multi-user without automatic consensus. LaSoFly has circle-specific annotation and automated consensus matching using the Hungarian Algorithm to match and score circles. |
| Roboflow | Cloud-based, easy | No quality levels | Mit Qualitätsbewertung Roboflow is cloud-based and simple but lacks quality levels and consensus validation. LaSoFly with multiple annotators and mathematical quality scoring ensures only high-consensus annotations are released for training. |
| Labelbox | Professional platform | No blind labeling | Mit Blind-Labeling Labelbox is professional but annotators see previous work. LaSoFly with blind labeling: annotators don't see each other's work, ensuring objective, unbiased consensus scores. |
| Scale AI | High-quality service | External, expensive | In-House Kontrolle Scale AI is outsourcing with high costs and external dependencies. LaSoFly is self-managed in-house with full control over data, processes, and quality standards. |
| Amazon SageMaker | Cloud integration | AWS lock-in | Keine Vendor-Lock-in SageMaker is tightly coupled to AWS and expensive. LaSoFly runs on your infrastructure, independent of cloud providers. Docker-based: flexible, portable, and cost-effective. |
| Hasty.ai | AI-assisted annotation | No blind labeling | 100% manuelle Kontrolle Hasty.ai uses AI assistants for semi-automatic annotation. LaSoFly is 100% manual: humans decide, not models. This provides true ground truth without AI bias, essential for training data. |
| VIA (VGG) | Simple, open-source | No consensus | Mit Consensus & Qualitätsmetriken VIA is simple and open-source but lacks multi-annotator management and consensus. LaSoFly with automatic consensus matching, inter-annotator agreement scoring, and quality levels ensures validated training-ready data. |
| BaKIM (Uni Bamberg) | Trained ensemble for UAV forest aerial photography | Trained on 1.6 cm UAV imagery, no public weights | Methodological role model, not a substitute on DOP20 BaKIM is a peer-reviewed ensemble (Mask2Former/ResNet/EfficientDet) for UAV imagery (1.6–1.8 cm GSD) on managed forests and city parks. Three simultaneous domain shifts to DOP20 residential areas (11× resolution gap, forest → residential scenes, forest genera → urban genera) make direct reuse infeasible. LaSoFly adopts the BaKIM ensemble architecture, out-of-distribution test design, and honest limit communication. |
| BAMFORESTS (DLR/Uni Bamberg) | 105 ha annotated UAV imagery, 27,160 tree crowns, CC BY 4.0 | A dataset, not a model — and all four AOIs are forest/park, no residential scenes | Benchmark reference, not training material for DOP20 BAMFORESTS is the peer-reviewed Bamberg Benchmark Forest Dataset (Troles et al. 2024, Remote Sensing 16). Four AOIs (Hain, Stadtwald, Tretzendorf-1/2) are all forest or park, dominated by Pinus/Fagus/Quercus. Hessian residential areas need their own annotated dataset in the target domain — that is exactly what LaSoFly builds. |
| LaSoFly | Specialized for objects | Only circles | Spezialisiert auf Consensus LaSoFly specializes in fast circle annotation with multi-annotator consensus. Blind labeling, automatic matching via Hungarian Algorithm, inter-annotator agreement scoring, and quality levels ensure only high-quality data is released for training. |
The Methodological Journey
How LaSoFly emerged: five clearly bounded iterations from deterministic image processing to a multi-modal AI architecture with quality-assured training data.
Heuristics
10 iterations NDVI · Otsu · Watershed
→ fails structurally
DeepForest
Pretrained RetinaNet (US forests)
→ 115 instead of 50 detections
LaSoFly
Own consensus platform · multi-annotator
→ reliable data foundation
TrustScore
Weighted consensus · Gold-Tiles
→ school gamification
nDSM
3D height data · RGB + height dual-stream
→ separates tree from shrub
Iteration 1 — Classical image processing. Ten iterations of an NDVI + Otsu + watershed pipeline on hessian DOP20 orthophotos. Works on industrial/rail sites with isolated trees, fails structurally on residential areas: at 40–60 actual trees the pipeline plateaus at 34 detections. The missing information is not parameter-level — it is not in the RGB image.
Iteration 2 — Pretrained AI model. DeepForest (RetinaNet, University of Florida, trained on US NEON forest plots) was the obvious shortcut. On the same residential test: 115 detections instead of ~50, false positives on house roofs, sandboxes, parked cars, max confidence 0.67 instead of the usual 0.85–0.95. Diagnosis: domain gap. Five other pretrained models (Ventura Urban, SAM2, Detectree2, ArcGIS Tree Detection, BAMFORESTS) showed the same pattern.
Iteration 3 — Own consensus platform. The unavoidable consequence: a dataset of our own — and a mechanism that secures its quality. Multi-annotator blind labeling, Hungarian matching for circle assignment, five quality levels Q0–Q4, audit trail, full DSGVO-compliant on-premise stack.
Iteration 4 — TrustScore against the elderberry dilemma. A pure consensus has a known weakness: if 25 children consistently mark the same elderberry shrub as a tree, the consensus is high but the annotation is wrong. Four trust stages (novice/experienced/verified/expert), gold-tile based promotion, weighted aggregation, Q4 only with expert involvement. A hundred-fold school consensus can no longer overrule an expert judgment.
Iteration 5 — The missing dimension: height data. From RGB alone you cannot tell a 12 m tree from a 3 m shrub — and for the downstream temperature prognosis that matters. The hessian land survey (HVBG) provides DGM1 (terrain) and DOM1 (surface) as open data since February 2022. The difference — the nDSM — gives the local object height per square meter. Used in a dual-stream DeepLabv3+ (ISPRS 2022) and as didactic warning in the school PWA.
Why LaSoFly is Better
- ✓ Domain-specific
- ✓ Blind Consensus
- ✓ No ML expertise
- ✓ Automatic quality
- ✓ Human-in-the-loop
Ready for the technical details?
To Quality Assurance