In the current popular visual tasks, one single model is usually used to output the final results. But no model is perfect, in this paper, we propose a simple and general multi-model method. To combine the advantages of multiple models, we design a familiarity prediction network to output the model's familiarity of images, then select the optimal model based on the familiarity value. Since the loss value is a single value, the familiarity value of any task can be reflected in the loss value, so the output of the familiarity prediction network can be regarded as an estimate of the loss value. The accuracy of the multi-model exceeds any single model that composes the multi-model. By limiting the number of feature layers input into the familiarity network, the sacrifice of computation and detection speed is acceptable. Our method is general and task-agnostic, it not only performs well on classification tasks but also on object detection tasks and other vision tasks.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.