Blog

Join the OpenMinTeD call for content

We are happy to announce that the OpenMinTeD platform for text and data mining is now ready to accept content. We invite publishers, repositories, libraries and other holders of scholarly publications to join the open call for content, by submitting a proposal by 29 October 2017 at the latest.

Why you should join the call for content

By opening your content up for text and data mining through the OpenMinTeD platform, you will increase your content’s visibility and impact. Winners of this call will be awarded € 7.000 (small bids) or € 17.500 (larger bids) to implement the connection of their content to the OpenMinTeD infrastructure.

How to join the call

You can join the call by submitting a CV and proposal of maximum 10 pages that describes your work plan, method, risk assessment, people and experience etc. A review committee will judge the proposals based on among other things alignment with the OpenMinTeD objectives, coverage, feasibility of integration methodology and experience.

More information

All relevant information, specifications and the submission form can be found on the open tenders webpage: https://openminted.bsc.es/

About OpenMinTeD

OpenMinTeD is a European project of 16 partners that sets out to develop a sustainable infrastructure for text and data mining. We are working on a platform that brings together  text and data, TDM tools and services, training, support and guidelines. Rather than developing new tools, we are adapting existing tools and services and repositories in order to make them interoperable.

 

 

Read more

21 September: Join free webinar on Text Mining in PoolParty Semantic Suite

We are pleased to invite you to attend an upcoming Webinar on the Text and Data mining functionalities of the PoolParty Semantic Suite.

Read more

Textmining in the vineyard at Open Harvest 2017

In 2016, 30 people from important institutions all over the world came together for the first Open Harvest gathering. The goal was to set the stage for a global data infrastructure for agriculture and food. One year later,  Agroknow presented the OpenMinTeD application VITIS at Open Harvest 2017.  

Read more

FutureTDM and OpenMinTeD organise TDM workshop for research libraries

Many university and national libraries are exploring the best way to support researchers with text and data mining.  That’s why on July 5th 2017, OpenMinTeD and FutureTDM organised a workshop about text and data mining at the LIBER conference in Patras. 4 different speakers guided 16 participants through  the various aspects of TDM.

Read more

The amount of information out there is staggering

Tom Potok works at the Oak Ridge National Laboratory in Tenessee. He has been in the field of text and data mining for twenty years and worked on a wide variety of things.  Some of the biggest challenges are the amounts of information out there, and trying to figure out how the mind works with text.

Read more

Helping researchers to find new articles and opportunities

Benj Pettit works at Mendeley and works on text and data mining tools that help researchers to find new articles, collaborators etc. One of the special things about the Mendeley catalogue is that it is formed in a crowdsourced way.

Read more

Upcoming: Open Science Fair in Athens in September

 Open Science is a new research paradigm that is facing many challenges. In order to improve the uptake of Open Science, four EU-projects join forces and organise an event that will showcase critical elements, from infrastructures to policies and  new types of activities. Join us for the Open Science FAIR, September 6-8 in Athens, and get inspired.

Read more

Proceedings of the BioCreative V.5 Challenge Evaluation Workshop

Last April 26-27 the BioCreative V.5 Challenge Evaluation Workshop took place in Barcelona. The goal of BioCreative V.5 was to address some of the major barriers to the adoption and use of text mining tools, related to assessment, accessibility, interoperability, robustness and integration.

Read more

A start-up’s perspective on TDM

Mads Rydahl has a small start-up that applies machine learning to scientific publishing. Thanks to their deep partnership with Springer Nature, they can build value added services inside their platform.

Read more

On the role of a university library in the TDM landscape

Leiden city25 years ago, when Laurents Sesink was still a history student, his thesis on political internal relations included a lot of reading and tally marks. Back then he already thought “There must be a better way to do this”, so he built a database and started to get into informatics and digitisation. Now he is the head of the Centre for Digital Scholarship at the library of Leiden University.

Read more

We look at recommending articles to users

Daniel Kershaw works at Mendeley, where he uses text and data mining to recommend relevant articles to users. A lot of the issues in this work have to do with inconsistencies between data sources from different journals.

Read more

OpenMinTeD Partner presents VITIS pilot application at RDA’s IGAD pre-meeting

Conference and presentationThe 9th Plenary Meeting of the Research Data Alliance (RDA) took place in Barcelona, Spain, from 5 to 7 April 2017. The RDA Plenary Meetings constitute a major event where more than 4000 members from 100 countries come together to discuss, develop and promote data-sharing and data-driven research infrastructure through Working and Interest Groups. The Interest Group on Agricultural Data (IGAD) pre-meeting took place just a couple of days before the 9th RDA plenary meeting, from 3 to 4 April 2017 and attracted more than 100 participants from all over the world.

Read more

A two-fold approach to measuring impact

Mike Lauruhn works at Elsevier and uses text and data mining to help researchers measure their impact factor. More specifically, he wants to know if there is a link between using a database for Arabidopsis data and the likeliness of being cited.

Read more

New publication: A Framework for Collaborative Curation of Neuroscientific Literature

Women looking at computer screenFrontiers in Neuroinformatics has just released a new paper by O’Reilly, Iavarone and Hill. It describes a systematic framework to curate neuroscientific literature. This framework provides an easier and more reliable way to integrate published data into neuronal models. The work was done in the context of the OpenMinTeD and Blue Brain projects. 

Read more

OpenMinTeD partner presents VITIS pilot application at Agricultural University of Athens

presentation of VITISOn February 20th 2017, Agroknow had the pleasure to host a workshop at the premises of the Agricultural University of Athens (AUA). The workshop was organized together with colleagues from the Laboratory of Viticulture.

Read more

TDM and the reading revolution

Library

You will not catch Steven Claeyssens carrying a smartphone and he will always prefer a paper book to an e-reader. Yet he is the curator of digital collections at the National Library of the Netherlands. I interviewed him about his job, text and data mining (TDM) in the humanities and the role of libraries in the research landscape.

 

Read more

Providing insight into the structure of scientific papers

How is a scientific paper structured and how related is it to other papers? These are some of the things that Iana Atanassova of the University of Bourgogne Franche-Comte (Besancon, France) focuses on in her research. She uses text and data mining (TDM) to study full-text scientific articles. Studying these papers can be a challenge, as they are usually in a format that is hard to process.

Read more

From information society to knowledge society

Marc Bertin is assistant professor at the University of Toulouse uses text and data mining to study scientific papers. Text and data mining can help us move from an information society to a knowledge society, but not without open access to research papers.

 

 

 

Read more

Text mining for the discovery of small molecules

When scientists need information about the structure, name or properties of small molecules, they often turn to a high quality database called ChEBI. This database is largely curated manually and this process takes a lot of time. OpenMinTeD is working on a textmining application that can help to speed up the process, while maintaining the quality of the database. 

 

 

 

 

Read more

Text and data mining in history

Joris van Eijnatten is professor of cultural history at Utrecht University, The Netherlands. He has a fascination for numbers that not many historians have. Last year he was the research fellow for digital humanities at the National Library of The Netherlands, where he applied text and data mining to study the image people have of Europe based on newspapers. I interviewed him about text and data mining in humanities, his work and his personal romance with numbers.

 

Read more

Using textmining to spot innovation in biomedical sciences

shubhanshuWhat is the real novelty of a research paper? How do different researchers contribute to innovation? And does this change throughout their career? Shubhanshu Mishra of the University of Illionois uses textmining techniques to study the novelty of biomedical articles.

 

Read more

Learning software to systematically review articles

 Systematic review of medical research papers can lead to new knowledge and treatments of diseases. The existing software tools however, are very limited and often a lot of manual work is involved. Stephen Gilbert of Iowa State University uses artificial intelligence and machine learning to automate the process of systematic review.

Read more

Future TDM’s policy recommendations

logo-futuretdm-h150While discussions at the EU on copyright reform and an exception for text and data mining (TDM) are very much live, FutureTDM, a Commission funded project of TDM experts has, for the past year, already been gathering information, mapping the TDM landscape and listening to the wide variety of individuals and organisations involved in data analytics. The project has just produced the first in a series of reports, providing a range of stakeholders with recommendations to improve TDM uptake in the EU. This FutureTDM policy framework document sets out high-level principles and recommendations.

Read more

Studying interdisciplinarity

frederico-nanniFrederico Nanni was not always a text miner. He actually started out as a historian and then switched to digital humanities. During his PhD, he developed a method to detect interdisciplinary research, based on scientific abstracts. Now, he finds text mining fascinating and thinks more historians should learn how to do it.

Read more

Evaluating the impact of research

drahomira-hermannovaIt took some time for Drahomira Hermannova to see the value of her research topic, but now she thinks it is the best topic she could ever choose: using text and data mining to evaluate which research can change the world. Not only can this help scientists, it may change the way research is done altogether.

Read more

Text Mining for social sciences – tackling the challenges to make search systems smarter

9hi8ujmsdza-braden-collumIn the OpenMinTeD project, partners from different scientific communities are involved to make sure the OpenMinTeD infrastructure will address their needs. As regards the social sciences, a useful application for text mining is the improvement of literature search and information interlinking. To this end, three main challenges were identified: named entity recognition, automatic keyword assignment to texts and automatic detection of mentions of survey variables. This post gives an overview of these tasks and the progress of work so far.

 

Read more

I’m trying to help scientists do science

duma2Would you like to get more insight in the world of text and data miners? Daniel Duma is a PhD student at the Alan Turing Institute and the University of Edinburgh and he shares his story in a short movie. He is working on software that will recommend relevant papers to scientists writing papers.

Read more

Webinar on Text and Data Mining interoperability at the legal level

tdm-legal-level-webinarIf you want to do text and data mining in the EU, you run into a complex legal framework of copyright rules. During the OpenMinTeD webinar of November 23rd , this legal  framework, limits and opportunities were discussed with legal as well as non-legal TDM experts. Recordings of the webinar and the discussion are available online.

Read more

Things aren’t always what they seem: The PDF challenge (accepted)

Image CC-BY

There are situations where text miners might struggle with getting the textual data to perform the mining on in the first place. One problem for us is that most of scientific publications – especially in social sciences and humanities – are only available in PDF format, which is not suitable to be read and processed by computers. The OpenMinTeD social sciences work group accepted the challenge to work on this problem.

Read more

Sneak preview: the OpenMinTeD knowledge base for text and data mining

CCO license

CCO license

Are you looking for support or training for text and data mining? Then you’re at the right place! OpenMinTeD recently released a Knowledge Base, that will host open access support and training material. At the moment we are still in the process of uploading content, but you can already have a look.

Read more

White paper on community requirements for text and data mining

Image CCO

Text and data mining is important to different scientific communities, but what do these different user communities need to mine succesfully?  One of the aims of workpackage 4 of the OpenMinTeD project is to collect these requirements. This was done using  a combination of methods, including online surveys and focus groups. The results are summarized in the  ‘White paper on OpenMinTed Community Requirements’ that was finished last week.

Read more

Text mine millions of research papers with the CORE dataset

CORE dataset CORE is an aggregation service that harvests open access journals and repositories, institutional and disciplinary, from around the world. It offers one of the largest collections of scientific content via its Datasets, ready to be text-mined. We encourage everyone to use it as part of OpenMinTeD and beyond.

 

Read more

How the Future TDM workshop highlighted the draft exception must be improved for TDM to have a future in Europe

AcademicCircle-760x428For the legal geeks among us, it is now old news that the European Commission, after promising to modernise copyright, issued a rather unhinged and disappointing copyright review proposal aimed at creating what it claims to be a ‘well-functioning marketplace’.

Read more

Can Europe lead a data revolution in agriculture and food?

europeleadLet’s take a step to the near future.

A shared global data space for agriculture and food will propel the industry forward. Information will become available to all actors producing innovation.

Read more

Why Text Mining is often not Legal, but how it could be in the Future

ApprovedHi there, I’m Lucie Guibault, Associate Professor at the Institute for Information Law of the University of Amsterdam.

Over the past few years, I became increasingly aware of TDM as a research method in all fields of science and humanities. With the increase of computational capacity, of digital born information and the digitisation of collections, the use of TDM in research is on its way towards achieving tremendous societal and economic benefits. Think about all the new insights and cost savings that would otherwise not be possible. This means more scientific breakthroughs and a greater understanding of society.

Read more

Text and Data Mining Researchers present Studies at WOSP2016 workshop

IMG_7939_1On 22-23 June 2016, OpenMinTeD organised its third stakeholder workshop at the Joint Conference on Digital Libraries in Newark, just outside of New York City. The workshop, called “the International Workshop on Mining Scientific Publications,” was organised by the Open University for the fifth time (almost everytime in conjunction with JCDL) and featured speakers from OpenMinTeD, as well as speakers who presented their text and data mining research results.

Read more

LREC Workshop on Cross-Platform Text Mining and Natural Language Processing Interoperability

IMG_0729Our efforts towards improving interoperability in the communities of Text Mining (TM) and Natural Language (NLP) processing continue. OpenMinTeD organised a workshop on this subject at the International Conference on Language Resources and Evaluation (LREC) on 23 May 2016. Alessandro Di Bari (IBM) opened the workshop with a keynote on transferring ideas from the model driven approaches of software engineering to enhance interoperability in TM and NLP.  

Read more

Envisaging a Broader TDM Exception to Overcome the Pitfalls of Current Copyright Law in the EU

Copyright exception for TDMConducting TDM activities in the current legal context is very difficult. This is due to the unclear and uncoherent legal framework for copyright licences and to the highly fragmented landscape of copyright exceptions and limitations in the EU. In this blogpost, we’ll discuss the current legal context and what needs to be changed to open the paths for TDM in the EU. 

Read more

Mining Repositories: Assisting Researchers in their Text and Data mining Needs

IMG_1070

On 13 June 2016, the OpenMinTeD project organised its third stakeholder  workshop titled “Mining Repositories: How to assist the research and academic community in their text and data mining needs”. The workshop took place in Trinity College Dublin as part of the OpenRepositories Conference, and brought together repository managers from all over the world who are interested in text and data mining.

Read more

Berlin Buzzwords 2016: what was hot and what was not?

CkQvbQ-WEAApMGGThe seventh Berlin Buzzwords 2016, Germany‘s leading Conference on Open Source Big Data technologies, was held from 5-7 June at the Kulturbrauerei in Berlin. A very interesting venue for cultural events, under national trust protection, Kulturbrauerei is a spacious former brewery with a lot of courtyards and buildings.

Read more

e-Infrastructures in the language technology community get together

IMG_0738On 22 May 2016, OpenMinTeD held its second stakeholder workshop at the LREC conference in Portoroz, Slovenia. The workshop took place in the form of a roundtable, and brought together strategic players and stakeholders from the language technology community and neighboring areas. Stelios Piperidis (Athena Research Center / ILSP) led the discussion. Among the attendees were representatives from CLARIN-CZ, CLARIN-ERIC, OpenAire, ELDA and LAPPS Grid. 

 

 

Read more

Text mining in Agriculture: The AgroTagger Keyword Extractor

agrotaggerThe use of keywords is crucial for the description, organization, indexing, retrieval and sharing of research in every scientific field and agriculture is not excluded. However, manual annotation of research outcomes is time-consuming and error-prone so automatic methods for metadata annotation are always explored. AgroTagger is one of the tools facilitating the work of information and knowledge managers (among others) in the agri-food sector, by applying text-mining on top of agri-food research outcomes.

Read more

Text Mining projects in the Agri-Food sector

Can you text mine agricultural content?

tuscany-428041_1280“Absolutely!” is the answer that AgroKnow will give you. And they can prove it! AgroKnow is one of the partners in the OpenMinTeD projects who are already very active in projects which apply text mining technologies to the agricultural sector.

Read more

We are looking for researchers in frequent need of searching and accessing textual content

searchingAre you a researcher in frequent need of searching and accessing textual content? Does your research involve looking for information in repositories of publications, reports, patents, and other textual content archives?

Then we are looking for your input!

Read more

We are looking for developers of TDM-powered applications

developerDoes your company develop text-mining powered applications? Would you benefit from a platform that provides access to a variety of text mining tools and components, along with the possibility to examine their specifications and performance? Are you an application developer in need of integrating text-mining services in your software? Then we are looking for your input! 

Read more

We are looking for organisations that want to make their data available for text mining

data-1Does your organisation have tons of data that you want to make available for text and data mining? Would you benefit from an infrastructure that brings your data together with text and data mining tools? Are you a repository manager, a publisher, or do you represent any other type of content collection?

Then we are looking for your input!

Read more

We are looking for text and data miners

researcher in tdmAre you a researcher in text and data mining? Would you benefit from making your mining software widely discoverable and interoperable, and would you like to easily explore and evaluate the work of other researchers in your field?

Then we are looking for your input!

Read more

Join us at the 5th International Workshop on Mining Scientific Publications

In association with the OpenMinTeD project, The Open University organises the 5th International Workshop on Mining Scientific Publications (WOSP) at JCDL 2016

The workshop is organised by Open University and aims to give a useful overview of Text and Data Mining (TDM). The topics of the workshop are organised around the following themes:

Read more

Training Course: Mining Social Media Content with GATE

The 9th GATE training course will be taught this June, at The University of Sheffield, and we are looking for you to join us! GATE, or the General Architecture for Text Engineering, is a mature, comprehensive suite of tools for information extraction, natural language processing and related tasks that has been developed continuously since 1995 at the University of Sheffield. The course is open to industrial and academic participants of any ability or experience level.

Read more

Open Science: What does it mean and how do text and data miners benefit?

tokyo1

On February 29th  researchers from around the world gathered in Tokyo for the data sharing symposium “Data-driven Science – The trigger of Scientific development”. It’s been a place of vibrant discussion of opportunities and challenges brought by current trends, such as open science, data-driven research and big data. OpenMinTeD, which perceives openness as one of its basic principles, participated in this event.

Read more

Leave a comment

You must be logged in to post a comment.