Automatic building footprint extraction using remote sensing data within the City of Cape Town

dc.contributor.advisorAdeleke, Adedayo
dc.contributor.emailkhumelenimakungo5@gmail.comen_US
dc.contributor.postgraduateMakungo, Khumeleni
dc.date.accessioned2024-05-23T08:10:54Z
dc.date.available2024-05-23T08:10:54Z
dc.date.created2024-09-20
dc.date.issued2023-11-30
dc.descriptionDissertation (MSc Geoinformatics)—University of Pretoria, 2023.en_US
dc.description.abstractIn the City of Cape Town Metropolitan (CoCT), South Africa, GIS analysts currently delineate building footprints by digitizing aerial imagery and stereo-aerial images. This approach requires a lot of manual work. It takes a long time, is expensive, and inefficient. Recent studies have explored automatic and semi-automatic methods for extracting building footprints. Automatic extraction of building footprints from remotely sensed data is useful for urban planning, service delivery, and humanitarian efforts. However, there is currently no readily available method that can automatically extract footprints while considering the unique characteristics of the landscape, such as formal residential areas, industrial zones, and informal settlements. Therefore, the main goal of this research is to find a suitable and efficient spatial analysis method that accurately extracts building footprints of different sizes and shapes within the City of Cape Town, South Africa, using high-resolution aerial imagery and LiDAR-derived nDSM. To achieve this goal, a literature review is conducted to explore different building footprint extraction algorithms. The review identified Mask Regional Convolutional Neural Network (R-CNN) as an effective algorithm for instance segmentation and object extraction. Thus, an experiment is conducted to implement Mask R-CNN models that extract building footprints from aerial imagery and LiDAR-derived normalized Digital Surface Model (nDSM) for each of the three areas: formal residential, industrial, and informal settlements. The training focused on the Blaauwberg district, which includes formal residential areas, industrial zones, and informal settlements. Each trained model is separately tested on testing datasets for formal residential, industrial areas, and informal settlements. Evaluation metrics such as precision, recall, F1-score, and Average Precision (AP) score are calculated for each model to assess their performance in extracting building footprints from aerial imagery and LiDAR-derived nDSM in formal residential, industrial areas, and informal settlements. The Mask R-CNN algorithm proved to be very effective in extracting building footprints from high-resolution aerial imagery and LiDAR-derived nDSM in formal residential areas, achieving satisfactory precision, recall, F1-score, and AP score. In industrial areas, the Mask R-CNN algorithm is found to be highly effective in extracting footprints from LiDAR-derived nDSM. However, when extracting shacks in densely populated settlements, the Mask R-CNN algorithm performed inadequately, with an AP score of 0.28 and 0.31 from aerial imagery and LiDAR-derived nDSM, respectively. Nevertheless, the fusion of footprints extracted from LiDAR-derived nDSM and high-resolution aerial imagery improved the AP score to 0.52. Hence, this study concludes that the Mask R-CNN algorithm is highly effective in extracting building footprints in formal residential areas from both aerial imagery and LiDAR-derived nDSM, as well as industrial building footprints from LiDAR-derived nDSM. For optimal performance in informal settlements, the fusion of footprints extracted from aerial imagery and LiDAR-derived nDSM is necessary. Overall, these trained Mask R-CNN models demonstrated satisfactory performance. To enhance the existing 2D building footprint layer, these models can supplement by extracting building footprints. This updated layer will be more comprehensive and current. Various departments within the CoCT can utilize this layer for infrastructure planning, service delivery planning, land use planning, and change detection. For better performance, it is recommended to add more informal and industrial training datasets with sufficient roof variability. Fine-tuning the Mask R-CNN models will ensure accurate extraction of shacks and industrial building footprints by allowing the models to learn effectively.en_US
dc.description.availabilityUnrestricteden_US
dc.description.degreeMSc (Geoinformatics)en_US
dc.description.departmentGeography, Geoinformatics and Meteorologyen_US
dc.description.facultyFaculty of Natural and Agricultural Sciencesen_US
dc.description.sdgSDG-09: Industry, innovation and infrastructureen_US
dc.description.sdgSDG-11: Sustainable cities and communitiesen_US
dc.identifier.citation*en_US
dc.identifier.doi10.25403/UPresearchdata.25847125en_US
dc.identifier.otherS2024en_US
dc.identifier.urihttp://hdl.handle.net/2263/96189
dc.language.isoen_USen_US
dc.publisherUniversity of Pretoria
dc.rights© 2023 University of Pretoria. All rights reserved. The copyright in this work vests in the University of Pretoria. No part of this work may be reproduced or transmitted in any form or by any means, without the prior written permission of the University of Pretoria.
dc.subjectUCTDen_US
dc.subjectSustainable Development Goals (SDGs)en_US
dc.subjectDeep Learningen_US
dc.subjectMask R-CNNen_US
dc.subjectSegmentationen_US
dc.subjectHigh-Resolution aerial imageryen_US
dc.subjectLiDARen_US
dc.subjectNormalized Digital Surface Modelen_US
dc.subjectDetectionen_US
dc.subjectBuilding Footprinten_US
dc.titleAutomatic building footprint extraction using remote sensing data within the City of Cape Townen_US
dc.typeDissertationen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Makungo_Automatic_2024.pdf
Size:
4.32 MB
Format:
Adobe Portable Document Format
Description:
Dissertation

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: