Application of deep learning neural networks for solving the problem of forest fire segmentation on satellite images
The aim of the work is to develop algorithms for the semantic segmentation of forest fire areas on satellite images. Despite the active development of computer vision algorithms, today there are a number of problems in this area that have not been fully solved and do not provide the required accuracy of work. Therefore, today there is a need for the development of algorithms and software that provide high quality image segmentation. The analysis of existing algorithms for image segmentation was carried out and it was revealed that the most suitable algorithms for solving this problem are deep learning neural networks. The machine learning libraries Keras, TensorFlow and PyTorch were reviewed. The library performance was tested on a set of 60,000 images. In the process of research, the PyTorch library showed the best results, so it was decided to use it to develop algorithms. Convolutional neural network consisting of 20 layers has been developed. The neural network was trained using a generated set of 50 images of Earth remote sensing with a resolution of 8000x8000. The set of images was selected from the Landsat 8 satellite database. The main selection criteria concerned the size of the scene, as well as the number of images taken by the satellite during the day. The generated set of images contains data of the following classes: forest fire (red); burnt-out area (black); smoke from a fire (white); reservoirs (blue); forest (green). For a set of images, augmentation was performed, that is, modification of the data for training. Using this method improves the generalizing ability of the neural network, adds new training examples that the neural network has not yet seen and does not provide an opportunity to retrain. As augmentation, the following modifications were performed: image rotation by an arbitrary degree; compression along the axes; stretching along the axes; mirroring along the axes; Gaussian Blur; change in brightness and contrast. The training included 50 epochs, each of which contains 2000 iterations. When choosing an algorithm for learning a neural network, the following algorithms were considered: Adam - adaptive moment estimation; Adagrad - adaptive gradient; RMSProp - gradient descent with momentum. During the research, the best results were obtained using the Adam algorithm. A comparison of the results of the proposed neural network with some analogues is presented. A comparative study of the accuracy of the segmentation algorithms was carried out on a set of reference and test images subjected to noise distortions. To compare the segmentation results, the boundaries of the segmented objects were used, which is a set of points that do not depend on the shading of the segments. To measure the segmentation results, two metrics were used: mean and Hausdorff distance. The study of the quality of work of a number of algorithms showed that they behave unstably when the image is noisy and blurred. Thus, we can conclude that it is advisable to clean the image from noise and increase its clarity before the segmentation procedure. The accuracy of the developed neural network is 94.22%. For the classes of objects, the accuracy was the following: fire - 93.6%; burnt-out area - 95.7; smoke - 87.6; reservoirs - 96.9; forest - 97.3. This result is the best in comparison with the presented analogs. However, the developed system is somewhat inferior to some analogues in terms of such indicators as fire, burnt out area, smoke. However, in such classes as forests, reservoirs, it wins.
Keywords
neural networks,
semantic segmentation,
computer vision,
image processing,
images of the Earth's surfaceAuthors
Vik Ksenia V. | Tomsk Polytechnic University | kvv11@tpu.ru |
Druki Alexey A. | Tomsk Polytechnic University | druki@tpu.ru |
Grigoriev Dmitriy S. | Tomsk Polytechnic University | trygx@tpu.ru |
Spitsyn Vladimir G. | Tomsk Polytechnic University | spvg@tpu.ru |
Всего: 4
References
Bundzel M., Hashimoto S. Object identification in dynamic images based on the memory-prediction theory of brain function // Journal of Intelligent Learning Systems and Applications. 2010. V. 2, № 4. P. 212-220.
Tawfiq A., Ahmed J. Object detection and recognition by usingenhanced Speeded Up Robust Feature // International Journal of Computer Science and Network Security. 2016. V. 16, № 4. P. 66-71.
Park S., Yoo J.H. Realtime face recognition with SIFT based local feature points for mobile devices // The 1st International Conference of Artificial Intelligence, Modelling and Simulation (AIMS 13). Malaysia, 2013. P. 304-308.
Mammeri A., Boukerche A., Khiari E. MSER based text detection and communication algorithm for autonomous vehicles // IEEE Symposium of Computers and Communication. Messina, Italy. 2016. P. 456-460.
Tore V., Chawan P.M. FAST Clustering based feature subset selection algorithm for high dimensional data // International Journal of Computer Science and Mobile Computing. 2016. V. 5, № 7. P. 234-238
Dalal N., Triggs B. Histograms of oriented gradients for human detection // IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR). San Diego, USA. 2005. V. 1. P. 886-893.
Mohey D.E. Enhancement bagofwords model for solving the challenges of sentiment analysis // International Journal of Advanced Computer Science and Applications. 2016. V. 7, № 1. P. 244-251.
Kecman V., Melki G. Fast online algorithms for Support Vector Machines // IEEE South East Conference. Virginia, USA. 2016. P. 26-31.
Le Cun Y., Bengio Y. Convolutional networks for images, speech and time series // The handbook of brain theory and neural networks. 1998. V. 7, № 1. P. 255-258.
Tensor Flow: the Python Deep Learning library. URL: https://www.tensorflow.org (accessed: 02.12.2020).
Deep Learning Frameworks: a Survey of TensorFlow, Torch, Theano, Caffe, Neon, and the IBM Machine Learning Stack. URL: https://www.microway.com/hpc-tech-tips/deep-leaming-frameworks-survey-tensorflow-torch-theano-caffe-neon-ibm-machine-learning-stack (accessed: 02.12.20).
Ronneberger O., Fischer P., Brox T. U-Net: Convolutional networks for biomedical image segmentation // International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI-2015). 2015. V. 93. P. 234-241.
Paszke A., Chaurasia A., Kim S., Culurciello E. ENet: a Deep Neural Network Architecture for Real-Time Semantic Segmentation // 5th International Conference on Learning Representations. 2017. Toulon, France. 2017. P. 1-10.
Badrinarayanan V., Kendall A., Cipolla R. SegNet: a deep convolutional encoder-decoder architecture for image segmentation // IEEE transactions on pattern analysis and machine intelligence. 2017. V. 39, № 12. P. 54-62.
Sharing Earth Observation Resourses. URL: https://directory.eoportal.org/web/eoportal/satellite-missions/l/landsat-8-ldcm (accessed: 02.12.20).
Kingma D.P. Adam: a Method for Stochastic Optimization // International Conference of Learning Representations. San Diego, USA. 2015. P. 1-13.
Nguyen V., Kim H., Jun S. A Study on Real-Time Detection Method of Lane and Vehicle for Lane Change Assistant System Using Vision System on Highway // Engineering Science and Technology. 2018. V. 21. P. 822-833.
El-Khatib S.A. Image segmentation using a mixed and exponential particle-based algorithm // Computer Science and Cybernetics. 2015. № 1. P. 126-133.
Koltsov P.P. The use of metrics in a comparative study of the quality of work of image segmentation algorithms // Informatics and Its Applications. 2011. № 5. P. 53-63.
Bundzel, M. & Hashimoto, S. (2010) Object identification in dynamic images based on the memory-prediction theory of brain function. Journal of Intelligent Learning Systems and Applications. 2(4). pp. 212-220. DOI: 10.4236/jilsa.2010.24024
Tawfiq, A. & Ahmed, J. (2016) Object detection and recognition by usingenhanced Speeded Up Robust Feature. International Journal of Computer Science and Network Security. 16(4). pp. 66-71.
Park, S. & Yoo, J.H. (2013) Realtime face recognition with SIFT based local feature points for mobile devices. The 1st Interna tional Conference of Artificial Intelligence, Modelling and Simulation (AIMS 13). pp. 304-308. DOI: 10.1109/AIMS.2013.56
Mammeri, A., Boukerche, A. & Khiari, E. (2016) MSER based text detection and communication algorithm for autonomous vehicles. IEEE Symposium of Computers and Communication. pp. 456-460. DOI: 10.1109/ISCC.2016.7543902
Tore, V. & Chawan, P.M. (2016) FAST Clustering based feature subset selection algorithm for high dimensional data. Interna tional Journal of Computer Science and Mobile Computing. 5(7). pp. 234-238. DOI: 10.1109/TKDE.2011.181
Dalal, N. & Triggs, B. (2005) Histograms of oriented gradients for human detection. IEEE Computer Society Conference on Com puter Vision and Pattern Recognition (CVPR). 1. pp. 886-893. DOI: 10.1109/CVPR.2005.177
Mohey, D.E. (2016) Enhancement bag-of-words model for solving the challenges of sentiment analysis. International Journal of Advanced Computer Science and Applications. 7(1). pp. 244-251. DOI: 10.14569/IJACSA.2016.070134
Kecman V. & Melki G. (2016) Fast online algorithms for Support Vector Machines. IEEE South East Conference. pp. 26-31. DOI: 10.1109/SECON.2016.7506733
Le Cun, Y. & Bengio, Y. (1998) Convolutional networks for images, speech and time series. The Handbook of Brain Theory and Neural Networks. 7(1). pp. 255-258.
TensorFlow: The Python Deep Learning library. [Online] Available from: https://www.tensorflow.org (Accessed: 2nd December 2020).
Deep Learning Frameworks. (n.d.) A Survey of TensorFlow, Torch, Theano, Caffe, Neon, and the IBM Machine Learning Stack. [Online] Available from: https://www.microway.com/hpc-tech-tips/deep-learning-frameworks-survey-tensorflow-torch-theano-caffe-neon-ibm-machine-learning-stack (Accessed: 2nd December 2020).
Ronneberger, O., Fischer, P. & Brox, T. (2015) U-Net: Convolutional networks for biomedical image segmentation. International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI-2015). 93. pp. 234-241. DOI: 10.1007/978-3-319-24574-4_28
Paszke, A., Chaurasia, A., Kim, S. & Culurciello, E. (2017) ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation. 5-th International Conference on Learning Representations. pp. 1 -10.
Badrinarayanan, V., Kendall, A. & Cipolla, R. (2017) SegNet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE transactions on pattern analysis and machine intelligence. 39(12). pp. 54-62. DOI: 10.1109/TPAMI.2016.2644615
Sharing Earth Observation Resourses. (n.d.) [Online] Available from: https://directory.eoportal.org/web/eoportal/satellite-missions/l/landsat-8-ldcm (Accessed: 2nd December 2020).
Kingma, D.P. (2015) Adam: a Method for Stochastic Optimization. International Conference of Learning Representations. pp. 1-13.
Nguyen, V., Kim, H. & Jun, S. (2018) A Study on Real-Time Detection Method of Lane and Vehicle for Lane Change Assistant System Using Vision System on Highway. Engineering Science and Technology. 21. pp. 822-833. DOI: 10.1016/j.jestch.2018.06.006
El-Khatib, S.A. Image segmentation using a mixed and exponential particle-based algorithm. Computer Science and Cybernetics. 1. pp. 126-133.
Koltsov, P.P. (2011) The use of metrics in a comparative study of the quality of work of image segmentation algorithms. Informatics and its Applications. 5. pp. 53-63.