��

�� :

�� .�., �� .�., �� .�., �� .�. �� LSTM � �� -�� // �� . 2017. � 4. �. 53-60. DOI: 10.7256/2454-0714.2017.4.24655 URL: https://nbpublish.com/library_read_article.php?id=24655

�� LSTM � �� -��

�� 

��, ��

634028, ��, �� , �. ��, ��. ��, 2, ��. 103�

Banokin Pavel Ivanovich

Assistant, Tomsk Polytechnic University

634028, Russia, Tomskaya oblast', g. Tomsk, ul. Lenina, 2, of. 103a

pavel805@gmail.com

�� 

��, ��

634028, ��, �� , �. ��, ��. ��, 2, ��. 115�

Efremov Aleksandr Aleksandrovich

Assistant, Tomsk Polytechnic University

634028, Russia, Tomskaya oblast', g. Tomsk, ul. Lenina, 2, of. 115a

alexyefremov@tpu.ru

�� 

��

��, ��

634028, ��, �� , �. ��, ��. ��, 2, ��. 115�

Luneva Elena Evgenevna

PhD in Technical Science

Associate Professor, Tomsk Polytechnic University

634028, Russia, Tomskaya oblast', g. Tomsk, ul. Lenina, 2, of. 115a

lee@tpu.ru

�� 

��

��, ��

634028, ��, �� , �. ��, ��. ��, 2, ��. 112�

Kochegurova Elena Alekseevna

PhD in Technical Science

Associate Professor, Tomsk Polytechnic University

634028, Russia, Tomskaya oblast', g. Tomsk, ul. Lenina, 2, of. 112a

kocheg@mail.ru

DOI:

10.7256/2454-0714.2017.4.24655

�� :

08-11-2017

�� :

11-01-2018

��: � �� -�� (long short-term memory, LSTM) �� Twitter. �� , �� . �� LSTM � �� . �� , �� . �� LSTM-�� . �� . �� , �� LSTM. �� , �� , � �� .

�� :

�� , �� , �� , �� -�� , �� , �� , ��, �� , �� , ��

�� (�� 17-07-00034 �).

Abstract: The article explores the applicability of long short-term memory (LSTM) recurrent networks for the binary classification of text messages of the social network Twitter. A three-stage classification process has been designed, allowing a separate analysis of pictograms and verification of the text for neutrality. The accuracy of the classification of the emotional polarity of text messages using the LSTM network and vector representations of words was verified. The percentage of coincidences of vector representations of words with a training set of data is determined, which makes it possible to obtain an acceptable classification accuracy. The estimation of the learning speed of the LSTM network and the use of memory was carried out. To solve the task of classifying text messages, methods of processing natural language and machine learning using precedents are applied. The algorithmic base for processing text data from social networks, obtained as a result of the application of LSTM neural networks, has been optimized. The novelty of the proposed solution method is due to the implementation of pre-processing of messages, which allows to improve the accuracy of classification, and the use of the neural network configuration taking into account the specifics of text data of social networks.

Keywords:

recurrent neural networks, natural language processing, sentiment analysis, LSTM networks, social networks, word embeddings, Twitter, text data preprocessing, reccurent network, binary classification

��. �� . �� , �� . �� , �� , �� . � �� -�� .

�� LSTM, �� , �� -�� Twitter. �� : �� . �� .

�� . �� . � �� «�� » (�� ), �� . ^[1].�� .

�� , �� , �� ^[1]. � �� , �� , �� , �� ^[2]. �� , �� Stanford NLP ^[3], �� . ��, �� , �� , �� , �� . �� .

�� , �� , �� , �� ^[4]. �� .

�� ^{[5, 6]}, �� . �� (convolution neural network, CNN) � �� LSTM-�� . �� . �� , � �� ^{[5, 6]}.

�� 76%-95% � �� ^[5]. � �� ^[16] �� LSTM � �� .

�� ^[5].

�� LSTM �� . �� , �� , � �� . �� (��) ��.

�� LSTM �� CNN �� ^[7]. �� k-�� `X=(x_(1),x_(2),...,x_(k))` �� (`x_(i)inR` ). ��, �� , �� , �� Glove ^[8], fastText ^[9] � Word2Vec ^[10]. �� Wikipedia, Twitter, IMDB � �� .

�� , �� ^[11]. �� , �� . �� , � �� , �� (��, �� .�.) �� , �� . �� .

�� , �� , �� . �� 300 � �� . �� (� �� ), �� .

�� . �� (��. 1).

I �� – �� . �� – �� . �� .

II �� – �� . �� . �� SentiWordNet ^[12]. �� `E_(max)` – �� .

III �� – �� . �� `alpha` , �.�. `E_(max)>alpha` , �� . �� `E_(max)<=alpha` , �� . �� , �� .

��. 1 – �� Twitter

�� . � �� . �� : �� , �� . �� . �� (�� Twitter) �� `psi` �� `mxxn`, �� `m` – �� , `n` – �� . �� `psi_(ij)` �� j-�� i-� ��.

��. 2 – ��

�� , �� .

�� . � �� LSTM �� :

1. �� .

2. �� .

3. �� . �� . �� – �� , �� .

� �� , �� Wikipedia ^[8]. �� 400000 �� , �� 50, 100 � 200 ��. �� , �� ^[13], �� 624077 �� Twitter �� . �� 13.19 ��. �� – 340077, �� – 74442 (21.89% �� ).

� �� LSTM �� , �� ^[14]. � �� (��. 3) �� :

1. �� Glove.

2. �� LSTM-�� . �� ^[15].

3. �� . �� -�� .

4. �� . �� , �� , �� .

��. 3 – �� , ��

� �� LSTM �� 2200 ��. �� 2.5 �� , �� 100000.

�� 50, 100 � 200 �� (±2.39%). �� 200 �� 4 � 5. � �� ^[16] �� .

��. 4 – �� LSTM

��. 5 – ��

�� , �� . �� , �� , �� , �� . �� Twitter, � �� (). �� 1.

�� 1. �� 

��	LSTM-��	LSTM �� > 0.3
�� , 100 ��.	96%	96%
�� Twitter, �� , 100 ��.	77%	87.8%

�� , �� , � � �� , �� . �� , �� . �� 21.89% �� .

��. �� , �� LSTM �� , �� Twitter. �� . ��, �� 50 �� .

�� LSTM � �� , �� . �� (�� 17-07-00034 �).

�� (�� 17-07-00034 �).

��

1. Perkins J. Python 3 Text Processing with NLTK 3 Cookbook.-Birmingham, UK: Packt Publishing Ltd, 2014 .-304 �.
2. �� .�., �� .�., �� .�. �� ̆ ��̆ � �� ̆ �� ̆ �� Twitter // �� . � ��, ��-�� , 2015.-No1.1(59), �. 157-162.
3. The Stanford Parser: A statistical parser // The Stanford Natural Language Processing Group URL: https://nlp.stanford.edu/software/lex-parser.shtml (�� : 10.10.2017).
4. Mozetic I, Grcar M, Smailovic J. . Perc M. Multilingual Twitter Sentiment Classification: The Role of Human Annotators // PLoS ONE.-2016.-�11(5).
5. Kim Y. Convolutional Neural Networks for Sentence Classification // Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing.-Stroudsburg, USA: Association for Computational Linguistics, 2014.-�. 1746-1752.
6. Dos Santos C. N., Gatti M. Deep Convolutional Neural Networks for Sentiment Analysis of Short Texts //COLING. � 2014. � �. 69-78.
7. Zhang X., Zhao J., LeCun Y. Character-level Convolutional Networks for Text Classification // Advances in Neural Information Processing Systems.-NY, USA: Curran Associates, 2015.-�. 649-658.
8. GloVe: Global Vectors for Word Representation // Stanford NLP URL: https://nlp.stanford.edu/projects/glove/ (�� : 10.10.2017).
9. FastText-Library for fast text representation and classification // GitHub URL: https://github.com/facebookresearch/fastText (�� : 10.10.2017)
10. Mikolov T., Sutskever I., Chen K., Corrado G., Dean J., Distributed representations of words and phrases and their compositionality // Advances in neural information processing systems.-2013.-�26.-�. 3111-3119.
11. Johnson R., Zhang T. Neural Networks for Text Categorization: Shallow Word-level vs. Deep Character-level // arXiv URL: https://arxiv.org/abs/1609.00718 (�� : 10.10.2017).
12. Baccianella S., Esuli A., Sebastiani F. SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining. // Proceedings of the International Conference on Language Resources and Evaluation.-Valletta, Malta: European Language Resources Association (ELRA, 2010.
13. Twitter Sentiment Analysis Training Corpus // Thinkbook URL: http://thinknook.com/twitter-sentiment-analysis-training-corpus-dataset-2012-09-22/ (�� : 10.10.2017).
14. Perform sentiment analysis with LSTMs, using TensorFlow // O'Reilly Media URL: Perform sentiment analysis with LSTMs, using TensorFlow (�� : 16.10.2017).
15. Sak H., Senior A., Beaufays F. Long Short-Term Memory Recurrent Neural Network Architectures for Large Scale Acoustic Modeling // INTERSPEECH-2014.-Singapore: ISC, 2014.-�. 338-342.
16. Hong J., Fang M. Sentiment analysis with deeply learned distributed representations of variable length texts: �� . Stanford, USA: Stanford University, 2015. 9 c

References

1. Perkins J. Python 3 Text Processing with NLTK 3 Cookbook.-Birmingham, UK: Packt Publishing Ltd, 2014 .-304 s.
2. Luneva E.E., Efremov A.A., Banokin P.I. Sposob otsenki emotsiĭ pol'zovateleĭ s ispol'zovaniem nechetkoĭ logiki na primere sotsial'noĭ seti Twitter // Sistemy upravleniya i informatsionnye tekhnologii. � Voronezh, Izd-vo OOO �Nauchnoe izdatel'stvo �Nauchnaya kniga�, 2015.-No1.1(59), s. 157-162.
3. The Stanford Parser: A statistical parser // The Stanford Natural Language Processing Group URL: https://nlp.stanford.edu/software/lex-parser.shtml (data obrashcheniya: 10.10.2017).
4. Mozetic I, Grcar M, Smailovic J. . Perc M. Multilingual Twitter Sentiment Classification: The Role of Human Annotators // PLoS ONE.-2016.-�11(5).
5. Kim Y. Convolutional Neural Networks for Sentence Classification // Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing.-Stroudsburg, USA: Association for Computational Linguistics, 2014.-S. 1746-1752.
6. Dos Santos C. N., Gatti M. Deep Convolutional Neural Networks for Sentiment Analysis of Short Texts //COLING. � 2014. � S. 69-78.
7. Zhang X., Zhao J., LeCun Y. Character-level Convolutional Networks for Text Classification // Advances in Neural Information Processing Systems.-NY, USA: Curran Associates, 2015.-S. 649-658.
8. GloVe: Global Vectors for Word Representation // Stanford NLP URL: https://nlp.stanford.edu/projects/glove/ (data obrashcheniya: 10.10.2017).
9. FastText-Library for fast text representation and classification // GitHub URL: https://github.com/facebookresearch/fastText (data obrashcheniya: 10.10.2017)
10. Mikolov T., Sutskever I., Chen K., Corrado G., Dean J., Distributed representations of words and phrases and their compositionality // Advances in neural information processing systems.-2013.-�26.-S. 3111-3119.
11. Johnson R., Zhang T. Neural Networks for Text Categorization: Shallow Word-level vs. Deep Character-level // arXiv URL: https://arxiv.org/abs/1609.00718 (data obrashcheniya: 10.10.2017).
12. Baccianella S., Esuli A., Sebastiani F. SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining. // Proceedings of the International Conference on Language Resources and Evaluation.-Valletta, Malta: European Language Resources Association (ELRA, 2010.
13. Twitter Sentiment Analysis Training Corpus // Thinkbook URL: http://thinknook.com/twitter-sentiment-analysis-training-corpus-dataset-2012-09-22/ (data obrashcheniya: 10.10.2017).
14. Perform sentiment analysis with LSTMs, using TensorFlow // O'Reilly Media URL: Perform sentiment analysis with LSTMs, using TensorFlow (data obrashcheniya: 16.10.2017).
15. Sak H., Senior A., Beaufays F. Long Short-Term Memory Recurrent Neural Network Architectures for Large Scale Acoustic Modeling // INTERSPEECH-2014.-Singapore: ISC, 2014.-S. 338-342.
16. Hong J., Fang M. Sentiment analysis with deeply learned distributed representations of variable length texts: tekhnicheskii otchet. Stanford, USA: Stanford University, 2015. 9 c

������������ ������������ ������������ ����� LSTM � ������ ������ �������������-��������� ���������� �����

�� LSTM � �� -��