Lytvyn V. The method of formation of the status of personality understanding based on the content analysis / V. Lytvyn, P. Pukach, І. Bobyk, V. Vysotska // Журнал "Восточно-Европейский журнал передовых технологий"(№5/2(83).2016) . – Р. 4-12. - .

UDC 004.89
DOI: 10.15587/1729-4061.2016.77174

V . L y t v y n - Doctor of Technical Sciences, Professor*
P . P u k a c h - Doctor of Technical Sciences, Associate Professor**
І . B o b y k - PhD, Associate Professor**
V . V y s o t s k a - PhD, Associate Professor*

*Department of Information Systems and Networks***
**Department of Mathematics***
***Lviv Polytechnic National University
S. Bandery str., 12, Lvіv, Ukraine, 79013


      The approach is proposed to developing an information system of determining the psychological state of personalities based on the five personality dispositions (extraversion/introversion, amiability, integrity, neuroticism, openness to experience), which is based on the content analysis of the Internet resources where users leave their mark (social networks, forums, chats, etc.).
     In general, to form the status of psychological state of a personality based on the content analysis, it is necessary to solve four problems. First, it is necessary to collect content from various sources from the Internet. Then it is necessary to process it at the initial level (remove the tags, auxilary words, signs, special symbols, hyperlinks, pictures, etc. from the text). Then the content is filtered (to identify spam, detect duplication, format the content, etc.) and sorted out (comments to the comments, likes, posts) according to the statistics over a specific period. The last stage is conducting the content analysis of collected information, which is categorized by the stop-words (markers).
     To determine the psychological dispositions of a personality we implemented the developed method of the search and analysis of the marked words in the English and Ukrainian languages. We used the Potter stemming, lemmatising and the modified Potter stemming for the Ukrainian texts, designed by the authors. The tables of correlation between the marked words and psychological dispositions were developed. The information system is created for determining the psychological state of personality, based on the developed approach and the methods of the content processing. The system operates by analyzing the messages from the users in a social network based on the traits of the “Big Five”. The system is designed in the form of a desktop program, which is the Internet service at the same time, and allows analyzing the psychological state of a particular user of a social network by his/her messages. All collected results are stored in the database. The results are displayed in the form of percent ratio for each trait, the number of tweets, as well as the most frequently used words related to these traits.
     Potential users of such systems are consulting and marketing companies. The collected and analyzed information on users may be used in hiring or promotion of products/services. Automated compilation of the personality models of users is helpful for social networks and Web services. It improves the quality and efficiency of context advertising, referral systems, recommendations and dating services.
     The in-depth knowledge of the audience is crucial for business and recruiting. The approbation of functioning of the constructed system was conducted. The results of the work of the system are satisfactory. Such an information system is recommended to use for searching employees for certain positions.
     Automated analysis of messages of users in a social network to form the status of psychological state of a personality based on the content analysis significantly reduces the time of finding a potentially promising employee among those applied taking into account his/her psychological portrait for a specific position.

Keywords: content, information resource, content analysis, linguistic analysis, morphological analysis, social network.

Запропоновано підхід до розроблення системи розуміння особистості через контентаналіз інформаційних ресурсів. Використано модель Big-Five на основі користувацької поведінки в соціальних мережах. Для визначення психологічних диспозицій розроблено метод аналізу англомовних та україномовних постів. Система призначена для рекрутингу, маркетингу, соціальних мереж і Web-сервісів. Аналіз аудиторії покращує ефективність контекстної реклами, систем рекомендацій і служб знайомств.

Ключові слова: контент, інформаційний ресурс, контент-аналіз, лінгвістичний аналіз, морфологічний аналіз, соціальна мережа.

Предложен подход к разработке системы понимания личности через контент-анализ информационных ресурсов. Использована модель Big-Five на основе пользовательской поведения в социальных сетях. Для определения психологических диспозиций разработан метод анализа англоязычных и украиноязычных постов. Система предназначена для рекрутинга, маркетинга, социальных сетей и Web-сервисов. Анализ аудитории улучшает эффективность контекстной рекламы, систем рекомендаций и служб знакомств.

Ключевые слова: контент, информационный ресурс, контент-анализ, лингвистический анализ, морфологический анализ, социальная сеть.

