A reconstruction method for cross-cut shredded documents based on the extreme learning machine algorithm

dc.cclicenceCC BYen
dc.contributor.authorZhang, Zhenghui
dc.contributor.authorZou, Juan
dc.contributor.authorYang, Shengxiang
dc.contributor.authorZheng, Jinhua
dc.contributor.authorGong, Dunwei
dc.contributor.authorPei, Tingrui
dc.date.acceptance2022-01-16
dc.date.accessioned2022-08-09T15:55:16Z
dc.date.available2022-08-09T15:55:16Z
dc.date.issued2022-07-24
dc.descriptionThe file attached to this record is the author's final peer reviewed version. The Publisher's final version can be found by following the DOI link.en
dc.description.abstractReconstruction of cross-cut shredded text documents (RCCSTD) has important applications for information security and judicial evidence collection. The traditional method of manual construction is a very time-consuming task, so the use of computer-assisted efficient reconstruction is a crucial research topic. Fragment consensus information extraction and fragment pair compatibility measurement are two fundamental processes in RCCSTD. Due to the limitations of the existing classical methods of these two steps, only documents with specific structures or characteristics can be spliced, and pairing error is larger when the cutting is more fine-grained. In order to reconstruct the fragments more effectively, this paper improves the extraction method for consensus information and constructs a new global pairwise compatibility measurement model based on the extreme learning machine algorithm. The purpose of the algorithm’s design is to exploit all available information and computationally suggest matches to increase the algorithm’s ability to discriminate between data in various complex situations, then find the best neighbor of each fragment for splicing according to pairwise compatibility. The overall performance of our approach in several practical experiments is illustrated. The results indicate that the matching accuracy of the proposed algorithm is better than that of the previously published classical algorithms and still ensures a higher matching accuracy in the noisy datasets, which can provide a feasible method for RCCSTD intelligent systems in real scenarios.en
dc.funderOther external funder (please detail below)en
dc.funder.otherNational Natural Science Foundation of Chinaen
dc.identifier.citationZhang, Z., Zou, J. Yang, S., Zheng, J., Gong, D. and Pei. T. (2022) A reconstruction method for cross-cut shredded documents based on the extreme learning machine algorithm. Soft Computing, 26, pp. 12851–12862en
dc.identifier.doihttps://doi.org/10.1007/s00500-022-07311-5
dc.identifier.issn1433-7479
dc.identifier.urihttps://hdl.handle.net/2086/22096
dc.language.isoen_USen
dc.peerreviewedYesen
dc.projectid61876164, 61673331, 61772178en
dc.publisherSpringeren
dc.researchinstituteInstitute of Artificial Intelligence (IAI)en
dc.subjectReconstruction of cross-cut shredded text documents (RCCSTD)en
dc.subjectExtreme learning machine algorithmen
dc.subjectConsensus informationen
dc.subjectPairwise compatibility measurement modelen
dc.titleA reconstruction method for cross-cut shredded documents based on the extreme learning machine algorithmen
dc.typeArticleen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
SOCO22.pdf
Size:
3.77 MB
Format:
Adobe Portable Document Format
Description:
Main article
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
4.2 KB
Format:
Item-specific license agreed upon to submission
Description: