WEB OF SCIENCE
SCOPUS
Machine-learning algorithms have attracted much attention in a wide range of areas. Because machine-learning algorithms can extract patterns from data automatically, it is possible to model the input-output function of a system using a machine-learning algorithm. In cognitive radio networks, machine-learning-based spectrum sensing schemes depend on the positioning of the nodes. During building extraction tasks based on high-resolution aerial images, large-scale datasets are required, with customized architecture necessary to process the datasets. Moreover, when there is a domain gap between the training and test data, trained models fail to segment objects for unseen images. In this Theses, the challenges facing machine-learning-based systems and solutions are discussed.
The first application is for spectrum sensing in cognitive radio networks. The hidden primary user (PU) problem, however, is a critical issue in cognitive radio networks because spectrum sensing nodes (SNs) can misclassify the degree of spectrum occupancy. To cope with this, machine-learning-based cooperative spectrum sensing schemes (CSSs) have been proposed. CSSs that do not consider node placement, however, continue to be affected by the hidden PU problem. In this paper, we present a method by which to place SNs to guarantee the performance of machine-learning-based CSSs. We verify that the hidden PU problem causes some overlap in the data distribution, which deteriorates of the spectrum sensing ability. Based on Kullback-Leibler divergence, analytical expressions for the spectrum-sensing coverage of a single SN are derived. We then propose a strategy for placing a few SNs to cover the entire area of the PU and prove the feasibility of the proposed method experimentally.
The second application is related to deep-learning architecture for semantic segmentation from high-resolution aerial images. Extracting manufactured features such as buildings, roads, and water from aerial images is critical for urban planning, traffic management, and industrial development. Recently, convolutional neural networks (CNNs) have become a popular strategy to use to capture contextual features automatically.
We design a multi-object segmentation system and propose an algorithm that utilizes pyramid pooling layers to improve U-Net. Test results indicate that U-Net with pyramid pooling layers, referred to as UNetPPL, learns fine-grained classification maps and outperforms other algorithms, specifically FCN and U-Net, achieving a mean intersection of union (mIOU) value of 79.52 and pixel accuracy of 87.61% for four types of objects (buildings, roads, water, and background).
The final application is domain adaptation for building extraction. CNN-based semantic segmentation models garnered much attention in relation to remote sensing and achieved remarkable performance during the extraction of buildings from high -resolution aerial images. However, limited generalization for unseen images remains. When there is a domain gap between the training and test datasets, CNN-based segmentation models that are trained using a training dataset fail to segment buildings in the test dataset. In this paper, we propose domain-adaptive transfer attack (DATA)-based segmentation networks for building extraction from aerial images. The proposed system utilizes jointly both domain transfers and adversarial attacks. Based on the DATA scheme, the distribution of input images can be shifted to that of target images while maintaining the semantic spaces. This reduces the domain gap and expands the generalization of the segmentation model. From test results with two different datasets (i.e., the Inria aerial image labeling dataset and the Massachusetts building dataset), it is verified that the performance when extracting buildings is improved to 0.16% and 7.12%, respectively.