Organization and optimization of information space user

Daily performing their official and other functions, modern man is faced with the task of analyzing large amounts of information and search for necessary data. Over time, the accumulation of user data in the form of documents. These documents in the amount of some information space for the user. With each new document, all the more acutely raises the question of the organization of this space: with time of a pair of three folders hierarchically – arranged their files and get a huge pile of documents, which is difficult enough to lead to a hierarchical form with linear constraints. The challenge is concretization, categorization and visualization of the information space of the user.

Define some terminology: the information under the user space in this article will be understood to be a set of text (not tabular and graphical) and documents (files), distributed file system within a hierarchy of directories. For clarity, simplify the description of the conditions of belonging of the documents of the information space to one subject area, such as the economy. Text files can represent economic articles, scientific papers, academic literature, and other forms of presentation of economic information text.

At the initial stage of formation of the information space, the user can simply navigate because of its small size and, consequently, a fairly clear structure and relationships between its elements. With time and doing the official, academic and everyday functions, the power of the information space increases, the weight of individual links between nodes (files) and decreases to navigate it is becoming increasingly difficult. This increases the search time of the necessary information, decreases the quality and productivity of activity of the user within the framework of information space.
As a rule, it is connected not only with the increase of textual information, but also with low speed of its perception by the user. Search the desired scene in the entire array is also difficult: the user should properly make a search query to obtain adequate results, and sometimes it can be problematic due to, for example, low awareness of the user in the subject area or the presence of synonyms or the facts that describe two different things with similar wording. Also, the use of full-text search of documents forces the operating system does not provide personalization and relevancy of the SERPs, which also negatively affects the speed of the user experience and the quality of his information space.
Of the above shortcomings of the standard search and the organization of the information space, it follows that for optimization of the information activities of the user should:

the

to Divide the subject area into categories or "zones"
to Highlight the key components of the subject area
to Visualize the subject area to accelerate the perception of the person
to Determine the nodes inside each element of the subject area (ontology formation)
to Define properties of objects within the nodes of the subject area and their relationships (the completion of the ontologies)
Define the connection and interaction between the nodes of a domain (semantic network from the nodes of the ontology)
to Link together layers and functional description of the subject area (Map overlay Tags, ontologies and semantic networks on top of each other)
Implement the function of personalization of the subject area and the relevance of its presentation based on an iterative learning process of interaction with the user.

made

fault tolerance (objects of the ontology will be stored on the server with the data backup system)
scalability (the system fairly quickly will be possible to connect new users)
optimal use of computing
development of a group of users (ontology will be updated and optimized, not one, but several people that will accelerate its development will allow to build ontologies of high capacities in a relatively short time, and will avoid redundancy by providing the ability to search for duplicates and synonyms means of designing and supporting ontologies)

provided

scope Definition and scope of the ontology

Which area will be covered by the ontology?
To what will be used the ontology?
what types of questions should give the information in the ontology?

Consideration of options for reuse of existing ontologies

Enumerate important terms in the ontology

Defining classes and a class hierarchy

Process down development starts with defining the most General concepts of the subject area with the subsequent specialization of the concepts.
the rising Process of development begins with the definition of the most specific classes, the leaves of the hierarchy, with subsequent grouping of these classes into more General concepts.
the Process of combined development is a combination of top-down and bottom-up approaches: First, we define the concept more visible, and then appropriately summarize and limit them.

define the properties of classes (slots)

internal properties object
external object property
if the object has the structure (can be both physical and abstract parts)
relationships with other individnal concepts

definition of the facets of the slots

Power slot

the value type of the slot

Domain slot and a range of values

Creating instances

If the intersection is not empty, for each term from T(O) are plotted two sets T_s and T_q — terms that are related to each ontology in any relationship
For each term from T(O) is the intersection of the sets and T_s T_q.
analysis of the types of relations between terms from T(O) and crossing the T_s and sets T_q. (all relationships of the ontology are divided into three types – hierarchical, synonymous and other).
to build the ratio of similarities of ontologies, a numerical display of the similarity of semantics of two ontologies. This takes into account the following factors: the occurrence of the same term in both ontologies; the fact that two terms are in different ontologies in the same attitude; the fact that two terms are in different ontologies in the relations of the same type or different (e.g., in a hierarchical relationship, and the relation of synonymy); whether there is any relationship (direct or indirectly) between the same terms.

want

Literature

Article based on information from habrahabr.ru

Поиск по этому блогу

computer express

Organization and optimization of information space user

Комментарии

Отправить комментарий

Популярные сообщения из этого блога

Import iblock from 1C-Bitrix to MODx Revolution

Freelance vs. business

"Aliketo" — looking for similar things, and even sometimes, but only in English