UAIV is a large-scale low-altitude multimodal dataset designed for urban intelligence and fine-grained governance analysis, with a strong focus on scene understanding, spatio-temporal reasoning, and physics-aware image restoration. Unlike conventional UAV datasets, UAIV emphasizes:
The dataset is built upon a self-operated UAV acquisition system, covering long-term, large-scale real urban environments with strong diversity in scene types (urban/rural/industrial/natural), illumination (day/night), weather (rain/snow/fog/haze), and flight altitudes.
UAV acquisition → multi-modal alignment → annotation → multi-task learning
Tasks: Scene classification, semantic/instance segmentation, object counting, OCR, environment understanding.
UAIV emphasizes context-rich and governance-oriented perception. Each sample corresponds to real urban governance scenarios: illegal construction, open burning, environmental pollution, infrastructure monitoring. Aligned multi-modal signals (RGB, IR, metadata) enable joint reasoning about object presence, scene semantics, and environmental status.
Tasks: Change detection, cross-time Re-ID.
UAIV provides strictly aligned multi-temporal data across different time spans, flight routes, and environmental conditions. For change detection, it distinguishes true semantic changes from appearance variations (illumination/weather). Cross-time Re-ID enables identity-consistent learning under viewpoint and altitude shifts.
Tasks: Rain/snow/fog removal, cross-weather translation, weather-aware degradation modeling.
Real-world paired/unpaired weather-degraded data (rain, snow, fog, haze) captured under consistent UAV trajectories. Preserves physical consistency: illumination variation, atmospheric scattering, sensor response differences. Enables recovery of intrinsic scene properties rather than only appearance translation.
UAIV has been deployed in real-world urban governance systems using Xuzhou (Jiangsu Province, China) as a pilot region, achieving full coverage of the Huaihai Economic Zone. Based on UAIV, we developed:
The system supports automated analysis for illegal construction detection, environmental anomaly monitoring, infrastructure status assessment, leading to significant improvement in semantic understanding under complex conditions, enhanced event detection accuracy, reduced manual inspection costs, and faster decision-making.
📢 User Feedback: “UAIV provides strong coverage of complex real-world scenarios, robust representation of multi-scale objects, and high-quality semantic consistency — a reliable foundation for large-scale model training and deployment.”
UAIV is a large-scale multimodal dataset designed for urban fine-grained governance and low-altitude remote sensing intelligence.
Standardized benchmarks and baselines for scene understanding, change detection, and image restoration will be released in future versions.
Current status: Partially released · Open subset (~30%) available now.
Full dataset will be released in future versions.
👉 ScienceDB (official release): https://www.scidb.cn/detail?dataSetId=203705443be44f7882bb9ddfd7d401da
👉 GitHub: https://github.com/JennyZhang0810/LowAltitude-Multimodal-Dataset
👉 Project Page: https://jennyzhang0810.github.io/LowAltitude-Multimodal-Dataset/
📌 The dataset is officially released on ScienceDB. Please use the ScienceDB link for data download.
This dataset was designed and led by the author, covering the full pipeline of data acquisition, organization, and annotation system design. We sincerely thank:
📄 MIT License · UAIV Project · Open for Research & Industrial Collaboration
© 2025 UAIV Team | Low-Altitude Urban Intelligence Dataset