فصلنامه علمی- پژوهشی اطلاعات جغرافیایی « سپهر»

فصلنامه علمی- پژوهشی اطلاعات جغرافیایی « سپهر»

ارزیابی عملکرد بهینه‌سازها در مدل های یادگیری عمیق U-Net و ResNet-34 برای طبقه‌بندی دقیق کاربری اراضی از تصاویر هوایی

نوع مقاله : مقاله پژوهشی

نویسندگان
1 دانشجوی کارشناسی ارشد فتوگرامتری دانشکده مهندسی عمران، آب و محیط‌زیست، دانشگاه شهید بهشتی، تهران، ایران
2 دانشجوی کارشناسی مهندسی نقشه‌برداری دانشکده مهندسی عمران، آب و محیط‌زیست، دانشگاه شهید بهشتی، تهران، ایران
3 استادیاردانشکده مهندسی عمران، آب و محیط زیست، دانشگاه شهید بهشتی، تهران، ایران
4 دانشیار دانشکده مهندسی عمران، آب و محیط‌زیست، دانشگاه شهید بهشتی، تهران، ایران
چکیده
مطالعه کاربری اراضی و تغییرات آن در دوره‌های زمانی نقشی کلیدی در مدیریت منابع طبیعی و برنامه‌ریزی شهری ایفا می‌کند. یکی از روش‌های بهینه و مقرون به صرفه در این زمینه استفاده از الگوریتم‌ها و روش‌های طبقه‌بندی تصاویر هوایی و سنجش از دور است. طبقه‌بندی تصاویر هوایی با استفاده از مدل‌های یادگیری عمیق یکی از روش‌های پیشرفته در پردازش داده‌های مکانی محسوب می شود که بهبود صحت و کارایی آن به انتخاب مدل مناسب و تنظیم بهینه‌ فراپارامترها بستگی دارد. در این پژوهش، عملکرد دو مدل یادگیری عمیق U-Net و ResNet-34 در ترکیب با شش بهینه‌ساز مختلف شامل SGD، Adam، RMSprop، Adagrad، Nadam و AdamW برای طبقه‌بندی تصاویر هوایی بررسی شده است. نتایج نشان داد که ResNet-34 در تمامی معیارهای ارزیابی عملکرد بهتری نسبت به U-Net ارائه داده است. بالاترین صحت کلی در مدل ResNet-34 با بهینه‌ساز RMSprop برابر با 87.54% بود، درحالی‌که همین بهینه‌ساز در مدل U-Net صحت77.17% را به دست آورد. بهینه‌ساز Adam نیز در ResNet-34 صحت83.97% و در U-Net صحت 63.30% را ارائه داد. در مقابل، بهینه‌ساز Adagrad ضعیف‌ترین عملکرد را داشت و همگرایی کندی نشان داد. تحلیل معیارهای ضریب کاپا، امتیاز ژاکارد و امتیاز F1 تأیید کرد که بهینه‌سازهای تطبیقی مانند RMSprop و Adam تأثیر مثبتی بر بهبود صحت مدل‌ها دارند. نتایج این تحقیق نشان داد که انتخاب مدل مناسب و بهینه‌ساز کارآمد نقش مهمی در افزایش صحت و کاهش خطای مدل‌های یادگیری عمیق دارد. در مطالعات آینده، بررسی بهینه‌سازی ترکیبی روش‌های کلاسیک و تطبیقی و استفاده از مدل‌های پیشرفته‌تر مانند ++ U-Net و بهینه‌سازهای نوظهور نظیر Lion، AdaBelief و RAdam پیشنهاد می‌شود.
کلیدواژه‌ها

موضوعات


عنوان مقاله English

Evaluation of optimizers' performance in U-Net and ResNet-34 deep learning models for accurate land use classification from aerial images

نویسندگان English

Mahdi Farhangi 1
Mohammad mahdi Esfandiary 2
Asghar Milan 3
Saeid Sadeghian 4
1 Master's student in photogrammetry, Faculty of civil engineering, Water and environmental engineering, Shahid Beheshti University, Tehran, Iran
2 Bachelor's student in surveying engineering, Faculty of civil engineering, Water and environmental engineering, Shahid Beheshti University, Tehran, Iran
3 Assistant professor, Faculty of civil, Water and environmental engineering, Shahid Beheshti University. Tehran, Iran
4 Associate professor, Faculty of civil engineering, Water and environmental engineering, Shahid Beheshti University, Tehran, Iran
چکیده English

Extended Abstract
Introduction
Accurate land use classification using deep learning models has become a crucial aspect of remote sensing and geographic information systems (GIS), enabling precise monitoring and management of natural and urban environments. In recent years, convolutional neural networks (CNNs) have gained significant attention in image segmentation and classification tasks, particularly due to their ability to automatically extract hierarchical spatial features. Among these, U-Net and ResNet-34 have demonstrated remarkable success in remote sensing applications, with U-Net excelling in pixel-wise segmentation tasks and ResNet-34 offering deeper feature extraction capabilities. Despite the success of deep learning models, their performance is highly influenced by the choice of optimization algorithm. Optimizers play a fundamental role in adjusting model parameters, impacting convergence speed, computational efficiency, and generalization ability. Selecting an appropriate optimizer is crucial to ensuring that the model effectively learns complex spatial patterns while minimizing classification errors. However, the comparative impact of different optimizers on U-Net and ResNet-34 for land use classification remains underexplored. This study aims to evaluate the effectiveness of six optimizers—SGD, Adam, RMSprop, Adagrad, Nadam, and AdamW—when applied to U-Net and ResNet-34 for land use classification using high-resolution aerial imagery. The research primarily focuses on assessing how different optimization techniques influence classification accuracy, model stability, and computational efficiency in remote sensing applications. The findings provide insights into the most suitable optimizer for training deep learning models for land use classification, thereby assisting future research and practical implementations.
Materials and Methods
The dataset used in this study comprises aerial images from selected regions in Poland, which were obtained from high-resolution remote sensing data sources. These images were pre-processed and divided into three sets: training, validation, and testing. In total, 616 labeled images (256×256 pixels each) were utilized, with 462 images allocated for training, 154 for validation, and 15 for final testing. Given the limited dataset size, data augmentation techniques such as horizontal and vertical flipping, rotation at different angles, and brightness adjustments were applied to enhance model generalization and prevent overfitting. Both U-Net and ResNet-34 were implemented using Python, TensorFlow, and Keras, and the training was conducted in Google Colab's cloud environment to utilize GPU acceleration for efficient computation. Each model was trained using the six selected optimizers, with hyperparameters tuned to achieve optimal performance. Model evaluation was conducted based on multiple performance metrics, including Overall Accuracy, Kappa Coefficient, F1-score, Jaccard Index, Mean Absolute Error, and Allocation Discrepancy. To better understand the impact of each optimizer on model convergence and training stability, training and validation loss curves were analyzed, allowing for an assessment of optimization efficiency and the prevention of issues such as gradient vanishing or overfitting. Furthermore, the training epochs were scaled using a custom epoch compression technique, ensuring that the optimization progress was clearly visualized without excessive data compression.
Results and Discussion
The experimental results confirmed that ResNet-34 consistently outperformed U-Net across all evaluation metrics, emphasizing the importance of deeper architectures in enhancing classification accuracy. The superior performance of ResNet-34 can be attributed to its residual learning framework, which facilitates efficient feature propagation and mitigates gradient vanishing issues, making it particularly effective for remote sensing image classification. Among the evaluated optimizers, RMSprop and Adam yielded the highest classification accuracies, ensuring faster convergence and lower classification errors. The best accuracy achieved in ResNet-34 was 87.54% using RMSprop, whereas in U-Net, the highest accuracy was 77.17%. These results suggest that adaptive optimizers like RMSprop and Adam dynamically adjust learning rates, leading to more efficient weight updates and improved generalization. Conversely, Adagrad demonstrated the weakest performance, achieving 83.71% accuracy in ResNet-34 and 77.87% in U-Net, which can be attributed to its aggressive learning rate decay, causing stagnation in later training epochs. Similarly, SGD exhibited slower convergence, resulting in lower classification accuracy compared to adaptive methods. These findings align with previous research, which suggests that adaptive optimizers, particularly Adam and RMSprop, enhance deep learning model generalization and classification precision. Additionally, comparisons with prior studies revealed that many recent works utilize Mean Intersection over Union (MIoU) and Frequency-Weighted IoU (FWIoU) as key performance indicators. Although these metrics were not directly employed in the present study, their relevance in remote sensing classification underscores the importance of optimizing segmentation models for both pixel-wise accuracy and spatial consistency.
Conclusion
This study demonstrated that the choice of optimizer significantly impacts the accuracy, efficiency, and stability of U-Net and ResNet-34 models in land use classification. ResNet-34 consistently achieved higher accuracy compared to U-Net, reinforcing the advantage of deeper architectures in aerial image processing. Among the optimizers, RMSprop and Adam emerged as the most effective, delivering faster convergence rates, higher classification accuracies, and improved generalization. Meanwhile, SGD and Adagrad exhibited slower convergence and lower classification accuracy, indicating that their static learning rate mechanisms are less suitable for complex remote sensing datasets.
For future research, it is recommended to explore hybrid optimization strategies, where SGD is employed during early training stages to enhance generalization, followed by adaptive optimizers like Adam or RMSprop to accelerate convergence and stabilize learning. Moreover, further investigations into emerging optimization techniques, such as Lion, AdaBelief, and RAdam, could provide valuable insights into their potential applications in deep learning-based remote sensing tasks. Additionally, the integration of advanced architectures, such as U-Net++, UNet3+, and Transformer-based segmentation models, could further improve classification precision by capturing multi-scale spatial features more effectively. Expanding the dataset to include multi-resolution and multi-spectral aerial imagery would also enable a more comprehensive assessment of optimizer adaptability across diverse geospatial environments. Overall, these findings emphasize that careful selection of the optimizer is essential for maximizing deep learning model performance in remote sensing applications. By incorporating advanced optimization techniques and architectural innovations, future studies can further enhance the accuracy and robustness of land use classification models, ultimately contributing to more efficient environmental monitoring and urban planning.

کلیدواژه‌ها English

Land use classification
Aerial imagery
Optimizer
U-Net
ResNet
1- Audebert, N., Le Saux, B., & Lefèvre, S. (2018). Beyond RGB: Very high resolution urban remote sensing with multimodal deep networks. ISPRS Journal of Photogrammetry and Remote Sensing, 140, 20–32. Retrieved from https://doi.org/10.1016/j.isprsjprs.2017.11.011
2- Baek, W. K., Lee, M. J., & Jung, H. S. (2024). Land Cover Classification From RGB and NIR Satellite Images Using Modified U-Net Model. IEEE Access. Retrieved from https://doi.org/10.1109/ACCESS.2024.3401416
3- Bera, S., & Shrivastava, V. K. (2020). Analysis of various optimizers on deep convolutional neural network model in the application of hyperspectral remote sensing image classification. International Journal of Remote Sensing, 41(7), 2664–2683. Retrieved from https://doi.org/10.1080/01431161.2019.1694725
4- Boguszewski, A., Batorski, D., Ziemba-Jankowska, N., Dziedzic, T., & Zambrzycka, A. (2020). LandCover.ai: Dataset for Automatic Mapping of Buildings, Woodlands, Water and Roads from Aerial Imagery.
5- Chen, B., Xia, M., & Huang, J. (2021). Mfanet: A multi-level feature aggregation network for semantic segmentation of land cover. Remote Sensing. Retrieved from https://doi.org/10.3390/rs13040731
6- Clark, A., Phinn, S., & Scarth, P. (2023). Optimised U-Net for Land Use–Land Cover Classification Using Aerial Photography. PFG – Journal of Photogrammetry, Remote Sensing and Geoinformation Science, 91(2), 125–147. Retrieved from https://doi.org/10.1007/s41064-023-00233-3
7- Digra, M., Dhir, R., & Sharma, N. (2022). Land use land cover classifcation of remote sensing images based on the deep learning approaches: a statistical analysis and review. Arabian Journal of Geosciences, 15(10). Retrieved from https://doi.org/10.1007/s12517-022-10246-8
8- Ding, X., Wang, Z., Peng, S., Shao, X., & Deng, R. (2024). Research on Land Use and Land Cover Information Extraction Methods for Remote Sensing Images Based on Improved Convolutional Neural Networks. ISPRS International Journal of Geo-Information, 13(11). Retrieved from https://doi.org/10.3390/ijgi13110386
9- Dozat, T. (2016). Incorporating Nesterov Momentum into Adam. ICLR Workshop, (1), 2013–2016.
10- Duchi, J., Elad, H., & Yoram, S. (2012). Adaptive Subgradient Methods for Online Learning and Stochastic Optimization. In Journal ofMachine Learning Research 12 (2011) 2121-2159 (Vol. 37, pp. 2122–2159). IEEE.
11- El fallah, K., El kharrim, K., & Belghyti, D. (2024). Land use land cover change detection by using remote sensing in Meknes province, Morocco with an indicator based (DPSIR) approach. Vegetos. Retrieved from https://doi.org/10.1007/s42535-024-01110-z
12- Fan, X., Ding, W., Li, X., Li, T., Hu, B., & Shi, Y. (2024). An Improved U-Net Infrared Small Target Detection Algorithm Based on Multi-Scale Feature Decomposition and Fusion and Attention Mechanism. Sensors, 24(13), 4227. Retrieved from https://doi.org/10.3390/s24134227
13- Ghaznavi, A., Saberioon, M., Brom, J., & Itzerott, S. (2024). Comparative performance analysis of simple U-Net, residual attention U-Net, and VGG16-U-Net for inventory inland water bodies. Applied Computing and Geosciences, 21(July 2023), 100150. Retrieved from https://doi.org/10.1016/j.acags.2023.100150
14- Giang, T. L., Dang, K. B., Le, Q. T., Nguyen, V. G., Tong, S. S., & Pham, V. M. (2020). U-net convolutional networks for mining land cover classification based on high-resolution UAV imagery. IEEE Access, 8, 186257–186273. Retrieved from https://doi.org/10.1109/ACCESS.2020.3030112
15- Hassan, E., Shams, M. Y., Hikal, N. A., & Elmougy, S. (2023). The effect of choosing optimizer algorithms to improve computer vision tasks : a comparative study. Multimedia Tools and Applications.
16- Hinton, G., Srivastava, N., & Swersky, S. (2012). Lecture 6a Overview of mini-batch gradient descent. In Neural Networks for Machine Learning (Vol. 4, pp. 1–31).
17- Kampffmeyer, M., Salberg, A. B., & Jenssen, R. (2016). Semantic Segmentation of Small Objects and Modeling of Uncertainty in Urban Remote Sensing Images Using Deep Convolutional Neural Networks. IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 680–688. Retrieved from https://doi.org/10.1109/CVPRW.2016.90
18- Kingma, D. P., & Ba, J. (2014). Adam: A Method for Stochastic Optimization. 3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings.
19- Koringa, H., & Patel, M. (2024). Automated, Quick, and Precise Building Extraction From Aerial Images Using Ll-Unet Model. Radioelectronic and Computer Systems, (2(110)), 41–51. Retrieved from https://doi.org/10.32620/reks.2024.2.04
20- Lakshminarayana, B., & Rao, K. G. (2010). Artificial neural networks in spectral-spatial landuse classification. ISH Journal of Hydraulic Engineering, 16, 64–73. Retrieved from https://doi.org/10.1080/09715010.2010.10515016
21- Li, Z., Zhang, H., Lu, F., Xue, R., Yang, G., & Zhang, L. (2022). Breaking the resolution barrier: A low-to-high network for large-scale high-resolution land-cover mapping using low-resolution labels. ISPRS Journal of Photogrammetry and Remote Sensing. Retrieved from https://doi.org/10.1016/j.isprsjprs.2022.08.008
22- Loshchilov, I., & Hutter, F. (2019). Decoupled weight decay regularization. In 7th International Conference on Learning Representations, ICLR 2019.
23- Naanjam, R., & Farnood Ahmadi, F. (2024). An improved self-training network for building and road extraction in urban areas by integrating optical and radar remotely sensed data. Earth Science Informatics, 17(3), 2159–2176. Retrieved from https://doi.org/10.1007/s12145-024-01270-1
24- Pagliardini, M., Ablin, P., & Grangier, D. (2024). The AdEMAMix Optimizer: Better, Faster, Older, 1–42.
25- Pontius, R. G., & Millones, M. (2011). Death to Kappa: Birth of quantity disagreement and allocation disagreement for accuracy assessment. International Journal of Remote Sensing, 32(15), 4407–4429. Retrieved from https://doi.org/10.1080/01431161.2011.552923
26- Robbins, H., & Monro, S. (1951). A Stochastic Approximation Method. The Annals OfMathematical Statistics, 22(3), 400–407.
27- Saeedizadeh, N., Jalali, S. M. J., Khan, B., Kebria, P. M., & Mohamed, S. (2024). A new optimization approach based on neural architecture search to enhance deep U-Net for efficient road segmentation. Knowledge-Based Systems, 296(December 2023), 111966. Retrieved from https://doi.org/10.1016/j.knosys.2024.111966
28- Strahler, A. H. (1980). The use of prior probabilities in maximum likelihood classification of remotely sensed data. Remote Sensing of Environment, 10(2), 135–163. Retrieved from https://doi.org/10.1016/0034-4257(80)90011-5
29- Talukdar, S., Singha, P., Shahfahad, Mahato, S., Praveen, B., & Rahman, A. (2020). Dynamics of ecosystem services (ESs) in response to land use land cover (LU/LC) changes in the lower Gangetic plain of India. Ecological Indicators, 112(September 2019), 106121. Retrieved from https://doi.org/10.1016/j.ecolind.2020.106121
30- Vasavi, S., Likhitha, A. L., Premchand, V. S., & Yasaswini, J. (2024). Object Classification by Effective Segmentation of Tree Canopy Using U-Net Model. Journal of Advances in Information Technology, 15(3), 422–434. Retrieved from https://doi.org/10.12720/jait.15.3.422-434
31- Weng, L., Pang, K., Xia, M., Lin, H., Qian, M., & Zhu, C. (2023). Sgformer: A Local and Global Features Coupling Network for Semantic Segmentation of Land Cover. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing. Retrieved from https://doi.org/10.1109/JSTARS.2023.3295729
32- Xu, W., Deng, X., Guo, S., Chen, J., Sun, L., Zheng, X., … Wang, X. (2020). High-resolution u-net: Preserving image details for cultivated land extraction. Sensors (Switzerland), 20(15), 1–23. Retrieved from https://doi.org/10.3390/s20154064
33- Zeiler, M. D. (2012). ADADELTA: An Adaptive Learning Rate Method.
34- Zhang, C., Sargent, I., Pan, X., Li, H., Gardiner, A., Hare, J., & Atkinson, P. M. (2019). Joint Deep Learning for land cover and land use classification. Remote Sensing of Environment, 221(May 2018), 173–187. Retrieved from https://doi.org/10.1016/j.rse.2018.11.014