A Graph Convolutional Network-Based Sensitive Information Detection Algorithm

Complexity 2021:1-8 (2021)
  Copy   BIBTEX

Abstract

In the field of natural language processing, the task of sensitive information detection refers to the procedure of identifying sensitive words for given documents. The majority of existing detection methods are based on the sensitive-word tree, which is usually constructed via the common prefixes of different sensitive words from the given corpus. Yet, these traditional methods suffer from a couple of drawbacks, such as poor generalization and low efficiency. For improvement purposes, this paper proposes a novel self-attention-based detection algorithm using the implementation of graph convolutional network. The main contribution is twofold. Firstly, we consider a weighted GCN to better encode word pairs from the given documents and corpus. Secondly, a simple, yet effective, attention mechanism is introduced to further integrate the interaction among candidate words and corpus. Experimental results from the benchmarking dataset of THUC news demonstrate a promising detection performance, compared to existing work.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 92,907

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Overlapping Community Detection in Dynamic Networks.Nathan Aston - 2014 - Journal of Software Engineering and Applications 7:872-882.
Human Skin Color Detection Using Neural Networks.Arvin Agah & Mohammadreza Hajiarbabi - 2015 - Journal of Intelligent Systems 24 (4):425-436.

Analytics

Added to PP
2021-03-25

Downloads
12 (#1,110,155)

6 months
10 (#306,562)

Historical graph of downloads
How can I increase my downloads?

Author's Profile

Ying Liu
University of Glasgow

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references