INFORMATION SEARCH SYSTEMS BASED ON THE ANALYSIS OF WEAKLY STRUCTURED TEXT
Keywords:information search; weakly structured data; information search systems.
Since the middle of the XX century, the problem of information storage and access to it has attracted more and more attention. All major news outlets have long created their own web pages where they post their news. Social networks and forums allow millions of people to share billions of messages daily. Many corporate and medical documents are stored on dedicated servers. Each message or document is a block of unstructured or weakly structured text. There is a need to create information search systems based on the analysis of weakly structured text.
Nikolaev A.A. Variety of data structures in modern information // Young scientist. 2019. No. 23 (261). S. 21-23.
Ukuev B.T. Features of processing unstructured data in the information base of scientific research of the university // Natural and technical sciences. 2018.No. 3.P. 75-76.
Smirnov Yu.V. Information retrieval for cloud library systems: features of linguistic support: diss. Ph.D. - Moscow, 2019 .-- 228 p.
GOST R 7.0.91_2015. SIBID. Thesauri for information retrieval. - Introduction. 2016-07-01. -Moscow: Standartinform, 2016 .-- P. 4
Magomedov R.M. On the development of intelligent systems // Territory of Science. 2015. No. 6. S 39-44
Batura T.V. Semantic analysis and ways of representing the meaning of the text in computational linguistics // Software products and systems. 2016. No. 4. S. 45-57
Tsitulsky AM, Ivannikov AV, Rogov IS NLP - Natural Language Processing // Scientific and educational journal for students and teachers "StudNet". 2020. No. 6. C.467-475.
Panfilova O.A., Kryukova D.Yu., Davydova E.N. Informational resources.Search systems. Vologda. 2019.S. 81-88.
Leonov E.A., Sinitsin I.V., Sheptunov S.A. Application of thematic modeling methods for analyzing student performance in the framework of monitoring the educational process // Quality. Innovation. Education. Moscow. 2018.S. 15-19
This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License.
A work licensed in this way allows the following:
1. The freedom to use and perform the work: The licensee must be allowed to make any use, private or public, of the work.
2. The freedom to study the work and apply the information: The licensee must be allowed to examine the work and to use the knowledge gained from the work in any way. The license may not, for example, restrict "reverse engineering."
2. The freedom to redistribute copies: Copies may be sold, swapped or given away for free, in the same form as the original.