Few shot text classification using adaptive cross capsule network

Bin Qin; Yumeng Yan; Hongyu Chen

doi:10.1117/12.2667207

1 March 2023 Few shot text classification using adaptive cross capsule network

Bin Qin, Yumeng Yan, Hongyu Chen

Proceedings Volume 12588, International Conference on Artificial Intelligence, Virtual Reality, and Visualization (AIVRV 2022); 125880P (2023) https://doi.org/10.1117/12.2667207
Event: International Conference on Artificial Intelligence, Virtual Reality, and Visualization (AIVRV 2022), 2022, Chongqing, China

Abstract

In recent years, meta-learning has become a mainstream technique for few-shot learning, and it has been widely used and achieved good results in computer vision and image processing. Based on this powerful empirical performance, we are interested in using Meta-learning frameworks in NLP to deal with the task of few-shot learning (FSL). However, due to the sparse sample size, sample-level comparisons based on other expressions are highly susceptible to interference, leading to serious overfitting problems. To achieve classification tasks, we suggest a novel Adaptive Cross-Capsule Network (ACCN) for learning generalized representations. A dynamic routing technique is utilized with the concept of a prototype network to train the support set to generalize the generalized representations of each category. The support set and the query set can fully interact dynamically to capture the essential semantic aspects of the query set following a successful non-parametric cross-attention method. Experimental results show that ACCN proposed in this paper is well adaptive to the intention classification task under additional categories, which obtain SOTA results on FewRel Datasets, which also can perform significantly better than the original classification system on Huffpost Datasets. This provides a crucial foundation for this study.

Citation Download Citation

Bin Qin, Yumeng Yan, and Hongyu Chen "Few shot text classification using adaptive cross capsule network", Proc. SPIE 12588, International Conference on Artificial Intelligence, Virtual Reality, and Visualization (AIVRV 2022), 125880P (1 March 2023); https://doi.org/10.1117/12.2667207

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $17.00

Non-members: $21.00 ADD TO CART

PROCEEDINGS
9 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Semantics

Computer programming

Data modeling

Classification systems

Education and training

Performance modeling

Design and modelling

Show All Keywords

Keywords/Phrases

Search In:

Publication Years