Tovek Indexing

Ensures accessibility of data in our solutions. It unifies data across different sources, formats, and languages. Automatically identifies additional information in the processed data.

Targeted Indexing, together with the design of individual index structures, is usually the most demanding part of the Tovek platform implementation.

The following components provide individual indexing tasks.


Provides access to the data repositories (file system, database, e-mail, DMS, etc.). Automatically searches for new and altered data in the repository and transmits it to Tovek. It also takes over the existing access rights. For structured storage, it also provides the information about the structure of the data to the Tovek system. It can also supply additional metadata (e.g. the last edit date).


From various data formats (PDF, HTML, Word, Excel ...), it obtains text information and available metadata.


A set of language-specific actions. It includes a number of steps, ranging from text to word over normalisation and lemmatisation, to extracting names.

Metadata extraction

It searches for other structured information according to specified rules.

Creating an index

A custom process of creating the inversion index and write to disk. At this stage, data becomes available for other Tovek platforms.


The processing information is written into logs. These are primarily used to detect potential problems during data processing.


We do not need to interfere actively with existing information systems. The original data remains unchanged at its original location. From the data, we create the internal registers that we use. Access rights are retained. Users can also create and store additional information directly on our system using remote indexing available through Tovek Server.

Contact us to schedule a demonstration today.


Telephone +420 606 688 775