Catalogue of TDM Components
Find TDM and NLP components and mix them together
The catalogue of Text and Data Mining (TDM) components brings together pieces of software that perform basic TDM and Natural Language Processing (NLP) tasks and can be mixed together to build sophisticated applications. All components have been adapted to the OpenMinTeD interoperability specifications which aim at enabling their integration irrespective of their original implementation framework and ensure that they can be executed without problems in the OpenMinTeD platform.
The catalogue targets mainly TDM developers, i.e. expert users who know how to combine them appropriately in the same workflow and build ready-to-run applications.
The service is empowered with browse and search (faceted and google-like free text) functionalities based on the harmonised metadata descriptions of the resources. For each component, the user can view:
- technical information, such as the task it performs, the requirements on the input resource (e.g. whether it runs on a corpus of English, on texts with a specific data format, etc.), details on the output it produces, links to documents that may help the users (e.g. user manuals, video tutorials, publications about the component, etc.), etc.
Developers of Text and Data Mining applications; researchers and other users with an advanced technical know-how on creating workflows
The main asset of this catalogue is that it brings together basic TDM and NLP software components conforming to the OpenMinTeD interoperability specifications and are, thus, guaranteed to be executed without problems in the platform and easy to be combined with other components.
EC funds (H2020 grant 654021 for the OpenMinTeD project) & National funds for the GRNET cloud infrastructure on which the platform operates