At present, in the field of person reidentification (re-id), the commonly used supervised learning algorithms require a large amount of labeled samples, which is not conducive to the model promotion. On the other hand, the accuracy of unsupervised learning algorithms is lower than supervised algorithms due to the lack of discriminant information. To address these issues, we make use of a small amount of labeled samples to add discriminant information in the basic dictionary learning. Moreover, the sparse coefficients of dictionary learning are decomposed into a projection problem of the original features, and the projection matrix is trained by labeled samples, which is transformed into a metric learning problem. It thus integrates the advantages of the two methods through combining dictionary learning and metric learning. After the data are trained, a projection matrix is used to project the unlabeled features into a feature subspace and the labels of the samples are reconstructed. The semisupervised learning problem is then transformed to a supervised learning problem with a graph regularization term. Experiments on different public pedestrian datasets, such as VIPeR, PRID, iLIDS, and CUHK01, show that the recognition accuracy of our method is better than some other existing person re-id methods.
A discriminative deep transfer metric learning method called DDTML is proposed for cross-scenario person re-identification (Re-ID). To develop the Re-ID model in a new scenario, a large number of pairwise cross-camera-view person images are deemed necessary. However, this work is very expensive due to both monetary cost and labeling time. In order to solve this problem, a DDTML for cross-scenario Re-ID is proposed using the transferring data in other scenarios to help build a Re-ID model in a new scenario. Specifically, to measure distribution difference across scenarios, a maximum mean discrepancy based on class distribution called MMDCD is proposed by embedding the discriminative information of data into the concept of the maximum mean discrepancy. Unlike most metric learning methods, which usually learn a linear distance to project data into the feature space, DDTML uses a deep neural network to develop the multilayers nonlinear transformations for learning the nonlinear distance metric, while DDTML transfers discriminative information from the source domain to the target domain. By bedding the MMDCD criteria, DDTML minimizes the distribution divergence between the source domain and the target domain. Experimental results on widely used Re-ID datasets show the effectiveness of the proposed classifiers.
At present, the level of urbanization in China has exceeded 50% and the number of car ownership has reached 140 million. The consequent problem of traffic congestion has become increasingly prominent. It is increasingly important that how to get the basic vehicle information in real time and accurately so that the traffic department can timely manage the vehicles of the specific road sections and intersections. At present, some related methods and algorithms have high real-time performance, but the accuracy is not high or the contrary. Accordingly, this paper proposes a method of automatic vehicle detection based on YOLOV2 framework which has both real-time and accuracy. The method improves the YOLOv2 framework model, optimizes the important parameters in the model, expands the grid size, and improves the number and sizes of anchors in the model, which can automatically learn the vehicle features and realize real-time and high-precision vehicle automatic detection and vehicle class identification. The evaluation on home-made dataset shows that compared with YOLOv2 and Faster RCNN, the accuracy rate is raised to 91.80 %, the recall rate to 63.86 %.
Focused on the issue that the person re-identification across non-overlapping camera views and the high dimensional features extracted from the images, a novel person re-identification algorithm is proposed. The algorithm obtained the semantic information of each camera view by the sparse learning, and then the Canonical Correlation Analysis (CCA) is used to carry out the high-level feature projection transformation. The algorithm aims to avoid the curse of dimensionality caused by the high dimensional feature operation via improving the feature matching ability. To the end, the characteristic distance between different views can be compared. The advantages of this method is to learn the robust pedestrian image feature representation and it also builds person re-identification model with block structure feature of pedestrian dataset, and the associated optimization problem is solved by utilizing the alternating directions framework in order to improve the performance of person re-identification. At last, the experimental results show that the proposed method has higher recognition efficiency on three benchmark datasets of the PRID 2011, iLIDS-VID and VIPeR.
A novel discriminative deep transfer learning method called DDTML is proposed for Cross-scenario Person Reidentification( Re-ID). Using a deep neural network, DDTML learns a set of hierarchical nonlinear transformations for Cross-scenario Person Re-identification by transferring discriminative knowledge from the source domain to the target domain. Meanwhile, taking account of the inherent characteristics of Re-ID data sets, in order to reduce the distribution divergence between the source data and the target data, DDTML minimizes a new maximum mean discrepancy based on Class Distribution called MMDCD at the top layer of the network. Experimental results on widely used Re-identification datasets show the effectiveness of the proposed classifiers.
At present, in the field of person re-identification, the commonly used supervised learning algorithms require a large size of labelled sample, which is not conducive to the model promotion. On the other hand, the accuracy of unsupervised learning algorithms is lower than supervised algorithms due to the lack of discriminant information. To address these issues in this paper, we make use of a small size of labelled sample to add discriminant information in the basic dictionary learning. Moreover, the sparse coefficients of dictionary learning are decomposed into a projection problem of the original features, and the projection matrix is trained by labelled samples, which is transformed into a metric learning problem. It thus integrates the advantages of the two methods through combining dictionary learning and metric learning. After the data is trained, a new projection matrix is used to project the unlabeled features into a new feature subspace and the labels of the samples are reconstructed. The semi-supervised learning problem is then transformed to a supervised learning problem with a Laplace term. Experiments on different public pedestrian datasets, such as VIPeR, PRID, iLIDS and CUHK01, show that the recognition accuracy of our method is better than some other existing person reidentification methods.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.