Fake news has always been a problem for online communities. It can build mistrust, create boundaries between people, and lower productivity in many government departments. Distinguishing fake news by human is slow and expensive, therefore, machines were proposed by many scholars to do the job. There are mature machine learning algorithms available to use for distinguishing fake news and real news. Despite the high accuracy of most well designed machine learning models, they can still make mistakes on some news. In this paper, we compared three machine learning models for news classification and evaluate biases in these models. The first one is Embedding Bag, second one is CNN network, and then the Naive Bayes Classifier. After tidying and vectorizing the data, they were feed to these classifiers to get the model. As a result, all of the models achieved very high accuracy. All of them had an accuracy of higher than 90 percent. The biases for all of the approaches are primarily sampling biases. The less represented words have higher chances to be mis classified than those more represented words in the database.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.