A reconstruction method for cross-cut shredded documents based on the extreme learning machine algorithm
dc.cclicence | CC BY | en |
dc.contributor.author | Zhang, Zhenghui | |
dc.contributor.author | Zou, Juan | |
dc.contributor.author | Yang, Shengxiang | |
dc.contributor.author | Zheng, Jinhua | |
dc.contributor.author | Gong, Dunwei | |
dc.contributor.author | Pei, Tingrui | |
dc.date.acceptance | 2022-01-16 | |
dc.date.accessioned | 2022-08-09T15:55:16Z | |
dc.date.available | 2022-08-09T15:55:16Z | |
dc.date.issued | 2022-07-24 | |
dc.description | The file attached to this record is the author's final peer reviewed version. The Publisher's final version can be found by following the DOI link. | en |
dc.description.abstract | Reconstruction of cross-cut shredded text documents (RCCSTD) has important applications for information security and judicial evidence collection. The traditional method of manual construction is a very time-consuming task, so the use of computer-assisted efficient reconstruction is a crucial research topic. Fragment consensus information extraction and fragment pair compatibility measurement are two fundamental processes in RCCSTD. Due to the limitations of the existing classical methods of these two steps, only documents with specific structures or characteristics can be spliced, and pairing error is larger when the cutting is more fine-grained. In order to reconstruct the fragments more effectively, this paper improves the extraction method for consensus information and constructs a new global pairwise compatibility measurement model based on the extreme learning machine algorithm. The purpose of the algorithm’s design is to exploit all available information and computationally suggest matches to increase the algorithm’s ability to discriminate between data in various complex situations, then find the best neighbor of each fragment for splicing according to pairwise compatibility. The overall performance of our approach in several practical experiments is illustrated. The results indicate that the matching accuracy of the proposed algorithm is better than that of the previously published classical algorithms and still ensures a higher matching accuracy in the noisy datasets, which can provide a feasible method for RCCSTD intelligent systems in real scenarios. | en |
dc.funder | Other external funder (please detail below) | en |
dc.funder.other | National Natural Science Foundation of China | en |
dc.identifier.citation | Zhang, Z., Zou, J. Yang, S., Zheng, J., Gong, D. and Pei. T. (2022) A reconstruction method for cross-cut shredded documents based on the extreme learning machine algorithm. Soft Computing, 26, pp. 12851–12862 | en |
dc.identifier.doi | https://doi.org/10.1007/s00500-022-07311-5 | |
dc.identifier.issn | 1433-7479 | |
dc.identifier.uri | https://hdl.handle.net/2086/22096 | |
dc.language.iso | en_US | en |
dc.peerreviewed | Yes | en |
dc.projectid | 61876164, 61673331, 61772178 | en |
dc.publisher | Springer | en |
dc.researchinstitute | Institute of Artificial Intelligence (IAI) | en |
dc.subject | Reconstruction of cross-cut shredded text documents (RCCSTD) | en |
dc.subject | Extreme learning machine algorithm | en |
dc.subject | Consensus information | en |
dc.subject | Pairwise compatibility measurement model | en |
dc.title | A reconstruction method for cross-cut shredded documents based on the extreme learning machine algorithm | en |
dc.type | Article | en |