Abstract: This paper presents an experimental analysis on document image retrieval using a multi-domain intelligent system. More specifically, on the same document image, the combination of three different domains: layout, logo and signature is discussed. This new method analyzes every single decision provided by multi-domain system so that, in the training phase, a new sample classified with a dissimilar confidence to the previous trained samples is used to update the system. DTW, Euclidean Distance and Cosine Similarity have been used, respectively for the analysis of layout, logo and signature. Finally, the weighted combination of individual decisions was considered. The experimental results, carried out on 30 rotated forms belonging to 13 different companies, demonstrate the superiority of the proposed approach with respect to single-domain retrieval systems, based on the ANR performance index. The ANR parameter is able to evaluate the multi-domain system.
Keywords: Document Management System, Document Image Retrieval, Multi-Expert Intelligent System, Feedback-based strategy, Instance Selection