Abstract
This paper considers on a number of issues that arise when a trainable machine vision system learns directly from humans. We contrast this to the ldquonormalrdquo situation where machine learning (ML) techniques are applied to a ldquocleanedrdquo data set which is considered to be perfectly labeled with complete accuracy. This paper is done within the context of a generic system for the visual surface inspection of manufactured parts; however, the issues treated are relevant not only to wider computer vision applications such as medical image screening but also to classification more generally. Many of the issues we consider arise from the nature of humans themselves: They will be not only internally inconsistent but also will often not be completely confident about their decisions, particularly if they are making decisions rapidly. People will also often differ systematically from each other in the decisions they make. Other issues may arise from the nature of the process, which may require the ML to have the capacity for real-time online adaptation in response to users' input. Because of this, it may be that the users cannot always provide input to a consistent level of detail. We describe how all of these issues may be tackled within a coherent methodology. By using a range of classifiers trained on data sets from a compact disc imprint production process, we present results which demonstrate that training methods designed to take proper consideration of these issues may actually lead to improved performance.
Original language | English |
---|---|
Pages (from-to) | 960-971 |
Journal | IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans |
Volume | 39 |
Issue number | 5 |
DOIs | |
Publication status | Published - 2009 |
Keywords
- Human-machine interaction (HMI)
- image classification
- insight into classifier structures
- online adaptation
- partial confidence
- resolving contradictory inputs
- variable input levels