In an interview about the international ELIXIR project, we learn how research works across national borders.
More and more research involves teams that are spread out all over the world. Cross-border access to documents is thus essential, meaning that authentication and authorisation need to work internationally.
We spoke to Tommi Nyrönen of ELIXIR Finland (see box), who explained just how crucial it is for international research to be able to access files across national borders.
What sort of things are the life scientists using the ELIXIR infrastructure working on?
Tommi Nyrönen: In Finland, our node hosted at the CSC-IT Center for Science is working with local organisations that do, for example, molecular-level studies of cancer, genetic diagnosis of dogs and horses, and basic research into resolving the birch tree genome.
What are the biggest challenges you face in the ELIXIR project?
The amount of data produced by research of this type is one challenge, but it is also demanding in terms of expertise. The data created have to be annotated and structured according to international standards and shared collectively with other countries.
There are some rare diseases caused by genetic variation that must be put into the larger reference data context before they are properly diagnosed.
What is special about working on a pan-European project?
It is about challenges that no country can manage in isolation. The life science ecosystem is global. The application of genetics in healthcare is emerging at an accelerated speed as the public starts to understand the possibilities in light of real-world examples. There are some rare diseases caused by genetic variation that must be put into the larger reference data context before they are properly diagnosed. Doctors are seeking to diagnose the root of these diseases, and they will be able to optimise treatments when provided with molecular-level data. Now, this is a real challenge. How do we make data available online for research to create services that pull intelligence for those who need it when they need it? Once a service has been created relying on the data, how do we keep the whole distributed infrastructure together so that critical operators like healthcare providers can build on it? These types of challenges outline what ELIXIR nodes like CSC need to make happen within ELIXIR in collaboration with the European e-Infrastructure.
How are you working together with GÉANT at the CSC-IT Center for Science to overcome these challenges?
Research data volumes have to move seamlessly around the continent. The reference data for human genetic variation alone is hundreds of terabytes. The data created and published need to be shared and channelled back into research effectively. On top of that, ELIXIR provides data sets describing other animals, insects, plants, bacteria, viruses and all other life forms on the planet. Volumes more than double every year. This is where GÉANT and national research and education network providers play a key infrastructure role. The goal is to bring online processing of biological data to researchers where they need it and when they need it. This way, we are eventually able to create biotechnology applications. When the discovery of existing research data stored in ELIXIR data centres is made easier, savings are also possible as duplication of basic research reduces.
The goal is to bring online processing of biological data to researchers where they need it and when they need it.
What role does eduGAIN play in ELIXIR's efforts to support life scientists?
In your view, what role should eduGAIN play for research going forward?
I think eduGAIN and GÉANT enabling user actions have been instrumental in moving forward together to work on life science data infrastructure challenges. Especially for open but sensitive data, we must offer a level of security to ensure that users with access to data are recognised researchers. Scientific communities stand at different starting points in the implementation of federated identity management technologies. Proactivity from the eduGAIN provider to understand the needs of the life science community is highly desirable, and vice versa; the life science community needs to learn what it can expect eduGAIN to deliver.