Estimating the number of people in images is an important task in computer vision, and has a wide range of applications, such as video surveillance, traffic monitoring, public safety and urban planning. The task of crowd counting faces many challenges, e.g. extremely scale variations in extremely dense crowd scenarios, severe occlusion, and perspective distortion. We have studied a new type of multi-granularity full convolutional network as an effective solution for crowd counting. We not only designed the parallel multi-receptive field module to learn the mapping from the crowd images to the density maps, but also introduced the skip-connection mechanism to better train the mapping to improve the quality of the estimated density map, which is critical for accurate crowd counting. The experimental results upon ShanghaiTech public dataset showed that the proposed method can obtain more accurate and more robust results on crowd counting than the most advanced method.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.