Abstract
Semantic attributes extracted from images could help to improve many interesting applications, including image classification, recommendation systems and online advertising. However, learning of such attributes requires a large well-labelled dataset which is usually difficult and expensive to collect and sometimes requires human domain experts to annotate. Partially labelled data, on the contrary, are relatively easy to obtain from social media websites or be annotated by less experienced people. However, a partially labelled dataset usually contains a lot of noisy data which are challenging for previous methods. In this paper, we propose a semi-supervised Random Forest algorithm that can handle a small well-labelled attribute dataset and large scale pairwise data at the same time for classifying grouped attributes. Results on two typical attribute datasets show that the proposed method outperforms the state-of-the-art attribute learner.
Original language | English |
---|---|
Pages (from-to) | 253-262 |
Number of pages | 10 |
Journal | Proceedings of the International Conference on Electronic Business (ICEB) |
Volume | 2019 |
Publication status | Published - 31 Dec 2019 |
Event | 19th International Conference on Electronic Business, ICEB 2019 - Newcastle upon Tyne, United Kingdom Duration: 8 Dec 2019 → 12 Dec 2019 |
Keywords
- Imperfect Data
- Machine Learning
- Pairwise data