%1 services
Name | Description | ELIXIR Node |
---|---|---|
A microbial metabolism resource for Systems Biology
|
The microbial world provides an abundant source of biological catalysts for chemicals of medical and economic interest such as pharmaceuticals and biofuels. However, a sustainable resource for systems biology, that integrates experimental and predicted data on microbial metabolism, is still lacking. This Implementation Study defines the requirements for a European Bioinformatics resource for microbial metabolism as well as providing a practical demonstration of how existing funded resources from ELIXIR partners could be integrated to meet those needs. This Study brings together complimentary data types:
These connections via RDF standards have many underpinning implications in medicine and economy such as pharmaceuticals and biofuels. This study has now been completed, the end report is available here. An article in the ELIXIR F1000R gateway will be made available shortly. Webinar summarising the outcomes |
ELIXIR Switzerland, ELIXIR France, EMBL-EBI |
A microbial metabolism resource for Systems Biology
|
The microbial world provides an abundant source of biological catalysts for chemicals of medical and economic interest such as pharmaceuticals and biofuels. However, a sustainable resource for systems biology, that integrates experimental and predicted data on microbial metabolism, is still lacking. This Implementation Study defines the requirements for a European Bioinformatics resource for microbial metabolism as well as providing a practical demonstration of how existing funded resources from ELIXIR partners could be integrated to meet those needs. This Study brings together complimentary data types:
These connections via RDF standards have many underpinning implications in medicine and economy such as pharmaceuticals and biofuels. This study has now been completed, the end report is available here. An article in the ELIXIR F1000R gateway will be made available shortly. Webinar summarising the outcomes |
ELIXIR Switzerland, ELIXIR France, EMBL-EBI |
A microbial metabolism resource for Systems Biology
|
The microbial world provides an abundant source of biological catalysts for chemicals of medical and economic interest such as pharmaceuticals and biofuels. However, a sustainable resource for systems biology, that integrates experimental and predicted data on microbial metabolism, is still lacking. This Implementation Study defines the requirements for a European Bioinformatics resource for microbial metabolism as well as providing a practical demonstration of how existing funded resources from ELIXIR partners could be integrated to meet those needs. This Study brings together complimentary data types:
These connections via RDF standards have many underpinning implications in medicine and economy such as pharmaceuticals and biofuels. This study has now been completed, the end report is available here. An article in the ELIXIR F1000R gateway will be made available shortly. Webinar summarising the outcomes |
ELIXIR Switzerland, ELIXIR France, EMBL-EBI |
Administration and support for Core Data Resource (CDR), Deposition Database (EDD) portfolio and Community Data Resources (2022-23)
|
Supporting the establishment, and continuing the monitoring of global partnerships in the Aim: Global partnerships for sustainable core data resources.
|
EMBL-EBI, ELIXIR Switzerland |
Administration and support for Core Data Resource (CDR), Deposition Database (EDD) portfolio and Community Data Resources (2022-23)
|
Supporting the establishment, and continuing the monitoring of global partnerships in the Aim: Global partnerships for sustainable core data resources.
|
EMBL-EBI, ELIXIR Switzerland |
Annotation and curation of human genomic variations (2018-Variations)
|
This implementation study aims to understand the existing infrastructure, resources and protocols for human genome variation annotation and curation. Work focuses on processes that can be automated to support interpretation of high-throughput genome sequencing results. The outcome will be a report that describes the current status within ELIXIR member states, identified requirements and potential solutions. The report will be part of the ELIXIR Human Genomics and Translational Data Services strategy and roadmap. This project coordinates with ELIXIR Data Platform on surveys regarding data archives and other resources. It also consults with Compute and Tools Platforms on potential models for resourcing, scaling and providing portable tools based on the identified requirements for running data analysis workflows. The aim is also to work in close collaboration with the ELIXIR Interoperability Platform to understand the future requirements on managing variation annotation and their interpretation. This implementation study will also aim to support the coordination between ELIXIR Human Genomics and Translational Data use case and the relevant GA4GH technical work streams. The expected outcome is a better alignment of ELIXIR activities with those in the GA4GH and direct communication with relevant resources outside of ELIXIR such as ClinVar. |
ELIXIR Finland, EMBL-EBI, ELIXIR Switzerland, ELIXIR UK, ELIXIR Norway, ELIXIR Italy |
Annotation and curation of human genomic variations (2018-Variations)
|
This implementation study aims to understand the existing infrastructure, resources and protocols for human genome variation annotation and curation. Work focuses on processes that can be automated to support interpretation of high-throughput genome sequencing results. The outcome will be a report that describes the current status within ELIXIR member states, identified requirements and potential solutions. The report will be part of the ELIXIR Human Genomics and Translational Data Services strategy and roadmap. This project coordinates with ELIXIR Data Platform on surveys regarding data archives and other resources. It also consults with Compute and Tools Platforms on potential models for resourcing, scaling and providing portable tools based on the identified requirements for running data analysis workflows. The aim is also to work in close collaboration with the ELIXIR Interoperability Platform to understand the future requirements on managing variation annotation and their interpretation. This implementation study will also aim to support the coordination between ELIXIR Human Genomics and Translational Data use case and the relevant GA4GH technical work streams. The expected outcome is a better alignment of ELIXIR activities with those in the GA4GH and direct communication with relevant resources outside of ELIXIR such as ClinVar. |
ELIXIR Finland, EMBL-EBI, ELIXIR Switzerland, ELIXIR UK, ELIXIR Norway, ELIXIR Italy |
Annotation and curation of human genomic variations (2018-Variations)
|
This implementation study aims to understand the existing infrastructure, resources and protocols for human genome variation annotation and curation. Work focuses on processes that can be automated to support interpretation of high-throughput genome sequencing results. The outcome will be a report that describes the current status within ELIXIR member states, identified requirements and potential solutions. The report will be part of the ELIXIR Human Genomics and Translational Data Services strategy and roadmap. This project coordinates with ELIXIR Data Platform on surveys regarding data archives and other resources. It also consults with Compute and Tools Platforms on potential models for resourcing, scaling and providing portable tools based on the identified requirements for running data analysis workflows. The aim is also to work in close collaboration with the ELIXIR Interoperability Platform to understand the future requirements on managing variation annotation and their interpretation. This implementation study will also aim to support the coordination between ELIXIR Human Genomics and Translational Data use case and the relevant GA4GH technical work streams. The expected outcome is a better alignment of ELIXIR activities with those in the GA4GH and direct communication with relevant resources outside of ELIXIR such as ClinVar. |
ELIXIR Finland, EMBL-EBI, ELIXIR Switzerland, ELIXIR UK, ELIXIR Norway, ELIXIR Italy |
Annotation and curation of human genomic variations (2018-Variations)
|
This implementation study aims to understand the existing infrastructure, resources and protocols for human genome variation annotation and curation. Work focuses on processes that can be automated to support interpretation of high-throughput genome sequencing results. The outcome will be a report that describes the current status within ELIXIR member states, identified requirements and potential solutions. The report will be part of the ELIXIR Human Genomics and Translational Data Services strategy and roadmap. This project coordinates with ELIXIR Data Platform on surveys regarding data archives and other resources. It also consults with Compute and Tools Platforms on potential models for resourcing, scaling and providing portable tools based on the identified requirements for running data analysis workflows. The aim is also to work in close collaboration with the ELIXIR Interoperability Platform to understand the future requirements on managing variation annotation and their interpretation. This implementation study will also aim to support the coordination between ELIXIR Human Genomics and Translational Data use case and the relevant GA4GH technical work streams. The expected outcome is a better alignment of ELIXIR activities with those in the GA4GH and direct communication with relevant resources outside of ELIXIR such as ClinVar. |
ELIXIR Finland, EMBL-EBI, ELIXIR Switzerland, ELIXIR UK, ELIXIR Norway, ELIXIR Italy |
Annotation and curation of human genomic variations (2018-Variations)
|
This implementation study aims to understand the existing infrastructure, resources and protocols for human genome variation annotation and curation. Work focuses on processes that can be automated to support interpretation of high-throughput genome sequencing results. The outcome will be a report that describes the current status within ELIXIR member states, identified requirements and potential solutions. The report will be part of the ELIXIR Human Genomics and Translational Data Services strategy and roadmap. This project coordinates with ELIXIR Data Platform on surveys regarding data archives and other resources. It also consults with Compute and Tools Platforms on potential models for resourcing, scaling and providing portable tools based on the identified requirements for running data analysis workflows. The aim is also to work in close collaboration with the ELIXIR Interoperability Platform to understand the future requirements on managing variation annotation and their interpretation. This implementation study will also aim to support the coordination between ELIXIR Human Genomics and Translational Data use case and the relevant GA4GH technical work streams. The expected outcome is a better alignment of ELIXIR activities with those in the GA4GH and direct communication with relevant resources outside of ELIXIR such as ClinVar. |
ELIXIR Finland, EMBL-EBI, ELIXIR Switzerland, ELIXIR UK, ELIXIR Norway, ELIXIR Italy |
Annotation and curation of human genomic variations (2018-Variations)
|
This implementation study aims to understand the existing infrastructure, resources and protocols for human genome variation annotation and curation. Work focuses on processes that can be automated to support interpretation of high-throughput genome sequencing results. The outcome will be a report that describes the current status within ELIXIR member states, identified requirements and potential solutions. The report will be part of the ELIXIR Human Genomics and Translational Data Services strategy and roadmap. This project coordinates with ELIXIR Data Platform on surveys regarding data archives and other resources. It also consults with Compute and Tools Platforms on potential models for resourcing, scaling and providing portable tools based on the identified requirements for running data analysis workflows. The aim is also to work in close collaboration with the ELIXIR Interoperability Platform to understand the future requirements on managing variation annotation and their interpretation. This implementation study will also aim to support the coordination between ELIXIR Human Genomics and Translational Data use case and the relevant GA4GH technical work streams. The expected outcome is a better alignment of ELIXIR activities with those in the GA4GH and direct communication with relevant resources outside of ELIXIR such as ClinVar. |
ELIXIR Finland, EMBL-EBI, ELIXIR Switzerland, ELIXIR UK, ELIXIR Norway, ELIXIR Italy |
APICURON integration with curation databases (2022-23)
|
Biocuration plays a key role in making research data available to the scientific community in a |
ELIXIR Italy, EMBL-EBI, ELIXIR UK, ELIXIR Germany |
APICURON integration with curation databases (2022-23)
|
Biocuration plays a key role in making research data available to the scientific community in a |
ELIXIR Italy, EMBL-EBI, ELIXIR UK, ELIXIR Germany |
APICURON integration with curation databases (2022-23)
|
Biocuration plays a key role in making research data available to the scientific community in a |
ELIXIR Italy, EMBL-EBI, ELIXIR UK, ELIXIR Germany |
APICURON integration with curation databases (2022-23)
|
Biocuration plays a key role in making research data available to the scientific community in a |
ELIXIR Italy, EMBL-EBI, ELIXIR UK, ELIXIR Germany |
Apple as a Model for Genomic Information Exchange
|
Apple is one of the most famous fruits globally and occupies a central position in folklore, culture, and art. Apple cultivars have retained high genetic and phenotypic diversity, evidenced by the high number of apple varieties cultivated today. The economic and cultural importance of apple has driven efforts to catalogue and exploit this genetic diversity, but few of these data are currently integrated into ELIXIR resources. We propose a data implementation study to integrate the high quality apple reference genome and its associated catalogue of genetic diversity, representing the most widely cultivated apple varieties around the world. We will use apple as a case study for managing the growing number of ‘multi-genome’ fruit projects, testing and where necessary, improving tools to streamline data import and exchange between ELIXIR supported resources, specifically BioSamples, ENA, EVA, ORCAE and Ensembl Plants. |
ELIXIR Italy, ELIXIR Belgium , EMBL-EBI |
Apple as a Model for Genomic Information Exchange
|
Apple is one of the most famous fruits globally and occupies a central position in folklore, culture, and art. Apple cultivars have retained high genetic and phenotypic diversity, evidenced by the high number of apple varieties cultivated today. The economic and cultural importance of apple has driven efforts to catalogue and exploit this genetic diversity, but few of these data are currently integrated into ELIXIR resources. We propose a data implementation study to integrate the high quality apple reference genome and its associated catalogue of genetic diversity, representing the most widely cultivated apple varieties around the world. We will use apple as a case study for managing the growing number of ‘multi-genome’ fruit projects, testing and where necessary, improving tools to streamline data import and exchange between ELIXIR supported resources, specifically BioSamples, ENA, EVA, ORCAE and Ensembl Plants. |
ELIXIR Italy, ELIXIR Belgium , EMBL-EBI |
Apple as a Model for Genomic Information Exchange
|
Apple is one of the most famous fruits globally and occupies a central position in folklore, culture, and art. Apple cultivars have retained high genetic and phenotypic diversity, evidenced by the high number of apple varieties cultivated today. The economic and cultural importance of apple has driven efforts to catalogue and exploit this genetic diversity, but few of these data are currently integrated into ELIXIR resources. We propose a data implementation study to integrate the high quality apple reference genome and its associated catalogue of genetic diversity, representing the most widely cultivated apple varieties around the world. We will use apple as a case study for managing the growing number of ‘multi-genome’ fruit projects, testing and where necessary, improving tools to streamline data import and exchange between ELIXIR supported resources, specifically BioSamples, ENA, EVA, ORCAE and Ensembl Plants. |
ELIXIR Italy, ELIXIR Belgium , EMBL-EBI |
BILS-ProteomeXchange integration using EUDAT resources
|
While the current data deluge creates a need for distributed data storage and replication, it is essential to enable data access through a single access interface. This project aimed to integrate the raw data repositories for mass spectrometry (MS) proteomics data run by BILS (Sweden) and ProteomeXchange (via the PRIDE database, EMBL-EBI, UK), using the European infrastructure EUDAT, and served as an example to connect national data storage services and international repositories through ELIXIR. It also showed the potential of collaboration among research infrastructures and e-infrastructures to better manage the data deluge, and helped to evaluate the requirements of such federated systems. Other Implementation Studies:
The study is now complete, the end report is available here. Webinar summarising the outcomesThe webinar was recorded in May 2015). See the slides. |
ELIXIR Sweden, EMBL-EBI |
BILS-ProteomeXchange integration using EUDAT resources
|
While the current data deluge creates a need for distributed data storage and replication, it is essential to enable data access through a single access interface. This project aimed to integrate the raw data repositories for mass spectrometry (MS) proteomics data run by BILS (Sweden) and ProteomeXchange (via the PRIDE database, EMBL-EBI, UK), using the European infrastructure EUDAT, and served as an example to connect national data storage services and international repositories through ELIXIR. It also showed the potential of collaboration among research infrastructures and e-infrastructures to better manage the data deluge, and helped to evaluate the requirements of such federated systems. Other Implementation Studies:
The study is now complete, the end report is available here. Webinar summarising the outcomesThe webinar was recorded in May 2015). See the slides. |
ELIXIR Sweden, EMBL-EBI |
Community and Data Management network engagement (2022-23)
|
To engage the Communities and increase the uptake of services and to align the Data Platform with other activities (such as FAIRplus, CONVERGE, EIP and the work of the Registries Focus Group) it is proposed to:
|
ELIXIR Italy, ELIXIR France, ELIXIR Netherlands, ELIXIR Norway, ELIXIR Portugal, ELIXIR Czech Republic, ELIXIR Sweden, ELIXIR Greece, ELIXIR Belgium |
Community and Data Management network engagement (2022-23)
|
To engage the Communities and increase the uptake of services and to align the Data Platform with other activities (such as FAIRplus, CONVERGE, EIP and the work of the Registries Focus Group) it is proposed to:
|
ELIXIR Italy, ELIXIR France, ELIXIR Netherlands, ELIXIR Norway, ELIXIR Portugal, ELIXIR Czech Republic, ELIXIR Sweden, ELIXIR Greece, ELIXIR Belgium |
Community and Data Management network engagement (2022-23)
|
To engage the Communities and increase the uptake of services and to align the Data Platform with other activities (such as FAIRplus, CONVERGE, EIP and the work of the Registries Focus Group) it is proposed to:
|
ELIXIR Italy, ELIXIR France, ELIXIR Netherlands, ELIXIR Norway, ELIXIR Portugal, ELIXIR Czech Republic, ELIXIR Sweden, ELIXIR Greece, ELIXIR Belgium |
Community and Data Management network engagement (2022-23)
|
To engage the Communities and increase the uptake of services and to align the Data Platform with other activities (such as FAIRplus, CONVERGE, EIP and the work of the Registries Focus Group) it is proposed to:
|
ELIXIR Italy, ELIXIR France, ELIXIR Netherlands, ELIXIR Norway, ELIXIR Portugal, ELIXIR Czech Republic, ELIXIR Sweden, ELIXIR Greece, ELIXIR Belgium |
Community and Data Management network engagement (2022-23)
|
To engage the Communities and increase the uptake of services and to align the Data Platform with other activities (such as FAIRplus, CONVERGE, EIP and the work of the Registries Focus Group) it is proposed to:
|
ELIXIR Italy, ELIXIR France, ELIXIR Netherlands, ELIXIR Norway, ELIXIR Portugal, ELIXIR Czech Republic, ELIXIR Sweden, ELIXIR Greece, ELIXIR Belgium |
Community and Data Management network engagement (2022-23)
|
To engage the Communities and increase the uptake of services and to align the Data Platform with other activities (such as FAIRplus, CONVERGE, EIP and the work of the Registries Focus Group) it is proposed to:
|
ELIXIR Italy, ELIXIR France, ELIXIR Netherlands, ELIXIR Norway, ELIXIR Portugal, ELIXIR Czech Republic, ELIXIR Sweden, ELIXIR Greece, ELIXIR Belgium |
Community and Data Management network engagement (2022-23)
|
To engage the Communities and increase the uptake of services and to align the Data Platform with other activities (such as FAIRplus, CONVERGE, EIP and the work of the Registries Focus Group) it is proposed to:
|
ELIXIR Italy, ELIXIR France, ELIXIR Netherlands, ELIXIR Norway, ELIXIR Portugal, ELIXIR Czech Republic, ELIXIR Sweden, ELIXIR Greece, ELIXIR Belgium |
Community and Data Management network engagement (2022-23)
|
To engage the Communities and increase the uptake of services and to align the Data Platform with other activities (such as FAIRplus, CONVERGE, EIP and the work of the Registries Focus Group) it is proposed to:
|
ELIXIR Italy, ELIXIR France, ELIXIR Netherlands, ELIXIR Norway, ELIXIR Portugal, ELIXIR Czech Republic, ELIXIR Sweden, ELIXIR Greece, ELIXIR Belgium |
Community and Data Management network engagement (2022-23)
|
To engage the Communities and increase the uptake of services and to align the Data Platform with other activities (such as FAIRplus, CONVERGE, EIP and the work of the Registries Focus Group) it is proposed to:
|
ELIXIR Italy, ELIXIR France, ELIXIR Netherlands, ELIXIR Norway, ELIXIR Portugal, ELIXIR Czech Republic, ELIXIR Sweden, ELIXIR Greece, ELIXIR Belgium |
Comparison, benchmarking and dissemination of proteomics data analysis pipelines
|
This project will be led by the ELIXIR Proteomics Community in collaboration with members of the Metabolomics Community and three ELIXIR platforms. High-throughput proteomics has become a popular choice in biological, biomedical and clinical studies and led to the development of hundreds of bioinformatics tools and data analysis pipelines. Given their large diversity, there is a urgent need to compare and benchmark different software pipelines over a large data spectrum. This study aims to create the framework to benchmark proteomics data analysis workflows, to be built upon and improve resources from ELIXIR Tool, Data and Compute platforms by creating an interface between them linked with public proteomics data and open source stand-alone software and pipelines. The involved data will be annotated with at least EOSC minimum information according to ELIXIR metadata standards. Our benchmarking will identify robust workflows and therefore nurture the proteomics community with high quality standards required for reproducible research and clinical applications. |
ELIXIR Denmark, EMBL-EBI, ELIXIR Netherlands, ELIXIR Spain, ELIXIR France, ELIXIR Sweden, ELIXIR Italy, ELIXIR Czech Republic, ELIXIR Germany |
Comparison, benchmarking and dissemination of proteomics data analysis pipelines
|
This project will be led by the ELIXIR Proteomics Community in collaboration with members of the Metabolomics Community and three ELIXIR platforms. High-throughput proteomics has become a popular choice in biological, biomedical and clinical studies and led to the development of hundreds of bioinformatics tools and data analysis pipelines. Given their large diversity, there is a urgent need to compare and benchmark different software pipelines over a large data spectrum. This study aims to create the framework to benchmark proteomics data analysis workflows, to be built upon and improve resources from ELIXIR Tool, Data and Compute platforms by creating an interface between them linked with public proteomics data and open source stand-alone software and pipelines. The involved data will be annotated with at least EOSC minimum information according to ELIXIR metadata standards. Our benchmarking will identify robust workflows and therefore nurture the proteomics community with high quality standards required for reproducible research and clinical applications. |
ELIXIR Denmark, EMBL-EBI, ELIXIR Netherlands, ELIXIR Spain, ELIXIR France, ELIXIR Sweden, ELIXIR Italy, ELIXIR Czech Republic, ELIXIR Germany |
Comparison, benchmarking and dissemination of proteomics data analysis pipelines
|
This project will be led by the ELIXIR Proteomics Community in collaboration with members of the Metabolomics Community and three ELIXIR platforms. High-throughput proteomics has become a popular choice in biological, biomedical and clinical studies and led to the development of hundreds of bioinformatics tools and data analysis pipelines. Given their large diversity, there is a urgent need to compare and benchmark different software pipelines over a large data spectrum. This study aims to create the framework to benchmark proteomics data analysis workflows, to be built upon and improve resources from ELIXIR Tool, Data and Compute platforms by creating an interface between them linked with public proteomics data and open source stand-alone software and pipelines. The involved data will be annotated with at least EOSC minimum information according to ELIXIR metadata standards. Our benchmarking will identify robust workflows and therefore nurture the proteomics community with high quality standards required for reproducible research and clinical applications. |
ELIXIR Denmark, EMBL-EBI, ELIXIR Netherlands, ELIXIR Spain, ELIXIR France, ELIXIR Sweden, ELIXIR Italy, ELIXIR Czech Republic, ELIXIR Germany |
Comparison, benchmarking and dissemination of proteomics data analysis pipelines
|
This project will be led by the ELIXIR Proteomics Community in collaboration with members of the Metabolomics Community and three ELIXIR platforms. High-throughput proteomics has become a popular choice in biological, biomedical and clinical studies and led to the development of hundreds of bioinformatics tools and data analysis pipelines. Given their large diversity, there is a urgent need to compare and benchmark different software pipelines over a large data spectrum. This study aims to create the framework to benchmark proteomics data analysis workflows, to be built upon and improve resources from ELIXIR Tool, Data and Compute platforms by creating an interface between them linked with public proteomics data and open source stand-alone software and pipelines. The involved data will be annotated with at least EOSC minimum information according to ELIXIR metadata standards. Our benchmarking will identify robust workflows and therefore nurture the proteomics community with high quality standards required for reproducible research and clinical applications. |
ELIXIR Denmark, EMBL-EBI, ELIXIR Netherlands, ELIXIR Spain, ELIXIR France, ELIXIR Sweden, ELIXIR Italy, ELIXIR Czech Republic, ELIXIR Germany |
Comparison, benchmarking and dissemination of proteomics data analysis pipelines
|
This project will be led by the ELIXIR Proteomics Community in collaboration with members of the Metabolomics Community and three ELIXIR platforms. High-throughput proteomics has become a popular choice in biological, biomedical and clinical studies and led to the development of hundreds of bioinformatics tools and data analysis pipelines. Given their large diversity, there is a urgent need to compare and benchmark different software pipelines over a large data spectrum. This study aims to create the framework to benchmark proteomics data analysis workflows, to be built upon and improve resources from ELIXIR Tool, Data and Compute platforms by creating an interface between them linked with public proteomics data and open source stand-alone software and pipelines. The involved data will be annotated with at least EOSC minimum information according to ELIXIR metadata standards. Our benchmarking will identify robust workflows and therefore nurture the proteomics community with high quality standards required for reproducible research and clinical applications. |
ELIXIR Denmark, EMBL-EBI, ELIXIR Netherlands, ELIXIR Spain, ELIXIR France, ELIXIR Sweden, ELIXIR Italy, ELIXIR Czech Republic, ELIXIR Germany |
Comparison, benchmarking and dissemination of proteomics data analysis pipelines
|
This project will be led by the ELIXIR Proteomics Community in collaboration with members of the Metabolomics Community and three ELIXIR platforms. High-throughput proteomics has become a popular choice in biological, biomedical and clinical studies and led to the development of hundreds of bioinformatics tools and data analysis pipelines. Given their large diversity, there is a urgent need to compare and benchmark different software pipelines over a large data spectrum. This study aims to create the framework to benchmark proteomics data analysis workflows, to be built upon and improve resources from ELIXIR Tool, Data and Compute platforms by creating an interface between them linked with public proteomics data and open source stand-alone software and pipelines. The involved data will be annotated with at least EOSC minimum information according to ELIXIR metadata standards. Our benchmarking will identify robust workflows and therefore nurture the proteomics community with high quality standards required for reproducible research and clinical applications. |
ELIXIR Denmark, EMBL-EBI, ELIXIR Netherlands, ELIXIR Spain, ELIXIR France, ELIXIR Sweden, ELIXIR Italy, ELIXIR Czech Republic, ELIXIR Germany |
Comparison, benchmarking and dissemination of proteomics data analysis pipelines
|
This project will be led by the ELIXIR Proteomics Community in collaboration with members of the Metabolomics Community and three ELIXIR platforms. High-throughput proteomics has become a popular choice in biological, biomedical and clinical studies and led to the development of hundreds of bioinformatics tools and data analysis pipelines. Given their large diversity, there is a urgent need to compare and benchmark different software pipelines over a large data spectrum. This study aims to create the framework to benchmark proteomics data analysis workflows, to be built upon and improve resources from ELIXIR Tool, Data and Compute platforms by creating an interface between them linked with public proteomics data and open source stand-alone software and pipelines. The involved data will be annotated with at least EOSC minimum information according to ELIXIR metadata standards. Our benchmarking will identify robust workflows and therefore nurture the proteomics community with high quality standards required for reproducible research and clinical applications. |
ELIXIR Denmark, EMBL-EBI, ELIXIR Netherlands, ELIXIR Spain, ELIXIR France, ELIXIR Sweden, ELIXIR Italy, ELIXIR Czech Republic, ELIXIR Germany |
Comparison, benchmarking and dissemination of proteomics data analysis pipelines
|
This project will be led by the ELIXIR Proteomics Community in collaboration with members of the Metabolomics Community and three ELIXIR platforms. High-throughput proteomics has become a popular choice in biological, biomedical and clinical studies and led to the development of hundreds of bioinformatics tools and data analysis pipelines. Given their large diversity, there is a urgent need to compare and benchmark different software pipelines over a large data spectrum. This study aims to create the framework to benchmark proteomics data analysis workflows, to be built upon and improve resources from ELIXIR Tool, Data and Compute platforms by creating an interface between them linked with public proteomics data and open source stand-alone software and pipelines. The involved data will be annotated with at least EOSC minimum information according to ELIXIR metadata standards. Our benchmarking will identify robust workflows and therefore nurture the proteomics community with high quality standards required for reproducible research and clinical applications. |
ELIXIR Denmark, EMBL-EBI, ELIXIR Netherlands, ELIXIR Spain, ELIXIR France, ELIXIR Sweden, ELIXIR Italy, ELIXIR Czech Republic, ELIXIR Germany |
Comparison, benchmarking and dissemination of proteomics data analysis pipelines
|
This project will be led by the ELIXIR Proteomics Community in collaboration with members of the Metabolomics Community and three ELIXIR platforms. High-throughput proteomics has become a popular choice in biological, biomedical and clinical studies and led to the development of hundreds of bioinformatics tools and data analysis pipelines. Given their large diversity, there is a urgent need to compare and benchmark different software pipelines over a large data spectrum. This study aims to create the framework to benchmark proteomics data analysis workflows, to be built upon and improve resources from ELIXIR Tool, Data and Compute platforms by creating an interface between them linked with public proteomics data and open source stand-alone software and pipelines. The involved data will be annotated with at least EOSC minimum information according to ELIXIR metadata standards. Our benchmarking will identify robust workflows and therefore nurture the proteomics community with high quality standards required for reproducible research and clinical applications. |
ELIXIR Denmark, EMBL-EBI, ELIXIR Netherlands, ELIXIR Spain, ELIXIR France, ELIXIR Sweden, ELIXIR Italy, ELIXIR Czech Republic, ELIXIR Germany |
Curation of Lipid Pathways by Domain Experts to Generate Open Access Biology Resources (2022-23)
|
The primary objective is to curate high-quality biochemical knowledge (reactions/enzymes/ genes) on lipid metabolism, working with lipid experts worldwide. The data will be housed in a shared resource created in partnership between two ELIXIR data services, LIPID MAPS (ELIXIR-UK), and WikiPathways (ELIXIR-NL).
|
ELIXIR UK, ELIXIR Netherlands |
Curation of Lipid Pathways by Domain Experts to Generate Open Access Biology Resources (2022-23)
|
The primary objective is to curate high-quality biochemical knowledge (reactions/enzymes/ genes) on lipid metabolism, working with lipid experts worldwide. The data will be housed in a shared resource created in partnership between two ELIXIR data services, LIPID MAPS (ELIXIR-UK), and WikiPathways (ELIXIR-NL).
|
ELIXIR UK, ELIXIR Netherlands |
Data Integration (2022-23)
|
The goal of this task is to explore extending the connected ecosystem to any ELIXIR data resource, incorporating and aggregating more orphan data and human data, and providing connectivity with other elements of the ELIXIR infrastructure.
|
ELIXIR Switzerland, ELIXIR Italy, EMBL-EBI, ELIXIR Netherlands, ELIXIR Portugal, ELIXIR UK |
Data Integration (2022-23)
|
The goal of this task is to explore extending the connected ecosystem to any ELIXIR data resource, incorporating and aggregating more orphan data and human data, and providing connectivity with other elements of the ELIXIR infrastructure.
|
ELIXIR Switzerland, ELIXIR Italy, EMBL-EBI, ELIXIR Netherlands, ELIXIR Portugal, ELIXIR UK |
Data Integration (2022-23)
|
The goal of this task is to explore extending the connected ecosystem to any ELIXIR data resource, incorporating and aggregating more orphan data and human data, and providing connectivity with other elements of the ELIXIR infrastructure.
|
ELIXIR Switzerland, ELIXIR Italy, EMBL-EBI, ELIXIR Netherlands, ELIXIR Portugal, ELIXIR UK |
Data Integration (2022-23)
|
The goal of this task is to explore extending the connected ecosystem to any ELIXIR data resource, incorporating and aggregating more orphan data and human data, and providing connectivity with other elements of the ELIXIR infrastructure.
|
ELIXIR Switzerland, ELIXIR Italy, EMBL-EBI, ELIXIR Netherlands, ELIXIR Portugal, ELIXIR UK |
Data Integration (2022-23)
|
The goal of this task is to explore extending the connected ecosystem to any ELIXIR data resource, incorporating and aggregating more orphan data and human data, and providing connectivity with other elements of the ELIXIR infrastructure.
|
ELIXIR Switzerland, ELIXIR Italy, EMBL-EBI, ELIXIR Netherlands, ELIXIR Portugal, ELIXIR UK |
Data Integration (2022-23)
|
The goal of this task is to explore extending the connected ecosystem to any ELIXIR data resource, incorporating and aggregating more orphan data and human data, and providing connectivity with other elements of the ELIXIR infrastructure.
|
ELIXIR Switzerland, ELIXIR Italy, EMBL-EBI, ELIXIR Netherlands, ELIXIR Portugal, ELIXIR UK |
Development of the PROCOGNATE database
|
The functional annotation of enzymes is an interesting but nontrivial task requiring experimental data and scientists' manual revision for optimal results. Due to the increasing amount of structural and sequence data, it is more difficult to do the case- by-case analysis, and there is a high demand for automated solutions. One of the first attempts to collect such data was the PROCOGNATE database (Bashton et al., 2008, DOI: 10.1093/nar/gkm611) followed by the development of the Transform- MinER tool (Tyzack et al., 2018, DOI: 10.1093/bioinformatics/bty394) which searches the reactants and products in KEGG database and matches them with ligand-protein complexes structures from PDB database. The current dataset has around 150,000 cases in nearly 13,000 unique PDBs. The current dataset's usefulness for researchers is limited mainly through two factors: 1) the database contains only basic information about the mapping, 2) it is available only as a CSV file. The first limitation will be solved by enriching the original dataset with multiple structural features, such as pockets, tunnels, and interactions, directly related to the binding and unbinding of the ligands. The calculations are already ongoing and will be finished in the following months. The second limitation will be solved by developing the web user interface, which will present the data in a complete form using 3-D structure feature visualizations. The main aim of this project is to kick-off the database development by: 1) acquisition of the pipeline used to construct the PROCOGNATE dataset, its merge with the pipeline for structure features assessment and preparation for regular automated updates; 2) design the database structure and import all the data; and 3) design of the user interface of the database. Once these stages are finished, the user interface development will begin and will continue till approx. Q2 2022. |
ELIXIR Czech Republic, EMBL-EBI |
Development of the PROCOGNATE database
|
The functional annotation of enzymes is an interesting but nontrivial task requiring experimental data and scientists' manual revision for optimal results. Due to the increasing amount of structural and sequence data, it is more difficult to do the case- by-case analysis, and there is a high demand for automated solutions. One of the first attempts to collect such data was the PROCOGNATE database (Bashton et al., 2008, DOI: 10.1093/nar/gkm611) followed by the development of the Transform- MinER tool (Tyzack et al., 2018, DOI: 10.1093/bioinformatics/bty394) which searches the reactants and products in KEGG database and matches them with ligand-protein complexes structures from PDB database. The current dataset has around 150,000 cases in nearly 13,000 unique PDBs. The current dataset's usefulness for researchers is limited mainly through two factors: 1) the database contains only basic information about the mapping, 2) it is available only as a CSV file. The first limitation will be solved by enriching the original dataset with multiple structural features, such as pockets, tunnels, and interactions, directly related to the binding and unbinding of the ligands. The calculations are already ongoing and will be finished in the following months. The second limitation will be solved by developing the web user interface, which will present the data in a complete form using 3-D structure feature visualizations. The main aim of this project is to kick-off the database development by: 1) acquisition of the pipeline used to construct the PROCOGNATE dataset, its merge with the pipeline for structure features assessment and preparation for regular automated updates; 2) design the database structure and import all the data; and 3) design of the user interface of the database. Once these stages are finished, the user interface development will begin and will continue till approx. Q2 2022. |
ELIXIR Czech Republic, EMBL-EBI |
Establishment of an ELIXIR Contextual Data Clearinghouse
|
The objective is to develop and deploy an “ELIXIR Contextual Data Clearinghouse (clearinghouse)” for extending, correcting and improving publicly available annotations on records in sample and sequencing data resources. Contextual data is fundamental for FAIR data in ELIXIR. So far, little attention has been paid to connect and exchange curated contextual data to improve the quality of primary and secondary and data resources within the metagenomics domain. In this proposal, we will build a “clearinghouse” to allow seamless exchange of contextual data between ELIXIR data resources. The project will strengthen the collaborations between these ELIXIR resources, build synergies to improve the quality and impact of the content and, not least, build more sustainable data resources. The proposed project will be an excellent showcase on how the outcomes of the EXCELERATE Marine Metagenomics Use Case, together with established and new ELIXIR data resources, can improve the quality and impact of publicly available data, especially towards the marine domain. |
ELIXIR Norway, EMBL-EBI, ELIXIR Germany, ELIXIR Italy |
Establishment of an ELIXIR Contextual Data Clearinghouse
|
The objective is to develop and deploy an “ELIXIR Contextual Data Clearinghouse (clearinghouse)” for extending, correcting and improving publicly available annotations on records in sample and sequencing data resources. Contextual data is fundamental for FAIR data in ELIXIR. So far, little attention has been paid to connect and exchange curated contextual data to improve the quality of primary and secondary and data resources within the metagenomics domain. In this proposal, we will build a “clearinghouse” to allow seamless exchange of contextual data between ELIXIR data resources. The project will strengthen the collaborations between these ELIXIR resources, build synergies to improve the quality and impact of the content and, not least, build more sustainable data resources. The proposed project will be an excellent showcase on how the outcomes of the EXCELERATE Marine Metagenomics Use Case, together with established and new ELIXIR data resources, can improve the quality and impact of publicly available data, especially towards the marine domain. |
ELIXIR Norway, EMBL-EBI, ELIXIR Germany, ELIXIR Italy |
Establishment of an ELIXIR Contextual Data Clearinghouse
|
The objective is to develop and deploy an “ELIXIR Contextual Data Clearinghouse (clearinghouse)” for extending, correcting and improving publicly available annotations on records in sample and sequencing data resources. Contextual data is fundamental for FAIR data in ELIXIR. So far, little attention has been paid to connect and exchange curated contextual data to improve the quality of primary and secondary and data resources within the metagenomics domain. In this proposal, we will build a “clearinghouse” to allow seamless exchange of contextual data between ELIXIR data resources. The project will strengthen the collaborations between these ELIXIR resources, build synergies to improve the quality and impact of the content and, not least, build more sustainable data resources. The proposed project will be an excellent showcase on how the outcomes of the EXCELERATE Marine Metagenomics Use Case, together with established and new ELIXIR data resources, can improve the quality and impact of publicly available data, especially towards the marine domain. |
ELIXIR Norway, EMBL-EBI, ELIXIR Germany, ELIXIR Italy |
Establishment of an ELIXIR Contextual Data Clearinghouse
|
The objective is to develop and deploy an “ELIXIR Contextual Data Clearinghouse (clearinghouse)” for extending, correcting and improving publicly available annotations on records in sample and sequencing data resources. Contextual data is fundamental for FAIR data in ELIXIR. So far, little attention has been paid to connect and exchange curated contextual data to improve the quality of primary and secondary and data resources within the metagenomics domain. In this proposal, we will build a “clearinghouse” to allow seamless exchange of contextual data between ELIXIR data resources. The project will strengthen the collaborations between these ELIXIR resources, build synergies to improve the quality and impact of the content and, not least, build more sustainable data resources. The proposed project will be an excellent showcase on how the outcomes of the EXCELERATE Marine Metagenomics Use Case, together with established and new ELIXIR data resources, can improve the quality and impact of publicly available data, especially towards the marine domain. |
ELIXIR Norway, EMBL-EBI, ELIXIR Germany, ELIXIR Italy |
Extending open proteomics data analysis pipelines in the cloud: Additional tools and focus on scalability, supporting the dramatic growth of public proteomics data
|
An ELIXIR implementation study started in February 2017, as a collaboration between EMBL-EBI and ELIXIR-DE. Its main objective is to develop open, robust, scalable and reproducible proteomics data analysis workflows based on OpenMS, directly connected to the PRIDE database (an ELIXIR core data resource) and to deploy these pipelines in the EMBL-EBI "Embassy Cloud" as a proof of concept. Building on this work, we here propose a follow-up project that has three objectives:
The overarching goal is that these tools can be deployed in other cloud infrastructures, and can be easily reused by anyone in the community, thus bringing the users closer to the tools, and the tools closer to the data. Impact of the studyThe outcome will be that an increased range of open proteomics tools will be included in an extended range of cloud infrastructures, including new quality control features based on OpenMS. Impact – increased facility for proteomics analysis across multiple cloud platforms – all with increased degree of quality control. |
ELIXIR Belgium , EMBL-EBI, ELIXIR Germany, ELIXIR France, ELIXIR Spain |
Extending open proteomics data analysis pipelines in the cloud: Additional tools and focus on scalability, supporting the dramatic growth of public proteomics data
|
An ELIXIR implementation study started in February 2017, as a collaboration between EMBL-EBI and ELIXIR-DE. Its main objective is to develop open, robust, scalable and reproducible proteomics data analysis workflows based on OpenMS, directly connected to the PRIDE database (an ELIXIR core data resource) and to deploy these pipelines in the EMBL-EBI "Embassy Cloud" as a proof of concept. Building on this work, we here propose a follow-up project that has three objectives:
The overarching goal is that these tools can be deployed in other cloud infrastructures, and can be easily reused by anyone in the community, thus bringing the users closer to the tools, and the tools closer to the data. Impact of the studyThe outcome will be that an increased range of open proteomics tools will be included in an extended range of cloud infrastructures, including new quality control features based on OpenMS. Impact – increased facility for proteomics analysis across multiple cloud platforms – all with increased degree of quality control. |
ELIXIR Belgium , EMBL-EBI, ELIXIR Germany, ELIXIR France, ELIXIR Spain |
Extending open proteomics data analysis pipelines in the cloud: Additional tools and focus on scalability, supporting the dramatic growth of public proteomics data
|
An ELIXIR implementation study started in February 2017, as a collaboration between EMBL-EBI and ELIXIR-DE. Its main objective is to develop open, robust, scalable and reproducible proteomics data analysis workflows based on OpenMS, directly connected to the PRIDE database (an ELIXIR core data resource) and to deploy these pipelines in the EMBL-EBI "Embassy Cloud" as a proof of concept. Building on this work, we here propose a follow-up project that has three objectives:
The overarching goal is that these tools can be deployed in other cloud infrastructures, and can be easily reused by anyone in the community, thus bringing the users closer to the tools, and the tools closer to the data. Impact of the studyThe outcome will be that an increased range of open proteomics tools will be included in an extended range of cloud infrastructures, including new quality control features based on OpenMS. Impact – increased facility for proteomics analysis across multiple cloud platforms – all with increased degree of quality control. |
ELIXIR Belgium , EMBL-EBI, ELIXIR Germany, ELIXIR France, ELIXIR Spain |
Extending open proteomics data analysis pipelines in the cloud: Additional tools and focus on scalability, supporting the dramatic growth of public proteomics data
|
An ELIXIR implementation study started in February 2017, as a collaboration between EMBL-EBI and ELIXIR-DE. Its main objective is to develop open, robust, scalable and reproducible proteomics data analysis workflows based on OpenMS, directly connected to the PRIDE database (an ELIXIR core data resource) and to deploy these pipelines in the EMBL-EBI "Embassy Cloud" as a proof of concept. Building on this work, we here propose a follow-up project that has three objectives:
The overarching goal is that these tools can be deployed in other cloud infrastructures, and can be easily reused by anyone in the community, thus bringing the users closer to the tools, and the tools closer to the data. Impact of the studyThe outcome will be that an increased range of open proteomics tools will be included in an extended range of cloud infrastructures, including new quality control features based on OpenMS. Impact – increased facility for proteomics analysis across multiple cloud platforms – all with increased degree of quality control. |
ELIXIR Belgium , EMBL-EBI, ELIXIR Germany, ELIXIR France, ELIXIR Spain |
Extending open proteomics data analysis pipelines in the cloud: Additional tools and focus on scalability, supporting the dramatic growth of public proteomics data
|
An ELIXIR implementation study started in February 2017, as a collaboration between EMBL-EBI and ELIXIR-DE. Its main objective is to develop open, robust, scalable and reproducible proteomics data analysis workflows based on OpenMS, directly connected to the PRIDE database (an ELIXIR core data resource) and to deploy these pipelines in the EMBL-EBI "Embassy Cloud" as a proof of concept. Building on this work, we here propose a follow-up project that has three objectives:
The overarching goal is that these tools can be deployed in other cloud infrastructures, and can be easily reused by anyone in the community, thus bringing the users closer to the tools, and the tools closer to the data. Impact of the studyThe outcome will be that an increased range of open proteomics tools will be included in an extended range of cloud infrastructures, including new quality control features based on OpenMS. Impact – increased facility for proteomics analysis across multiple cloud platforms – all with increased degree of quality control. |
ELIXIR Belgium , EMBL-EBI, ELIXIR Germany, ELIXIR France, ELIXIR Spain |
FAIRness of the current ELIXIR Core resources: Application (and test) of newly available FAIR metrics, and identification of steps to increase interoperability (2018-FAIRCDR)
|
The FAIR (Findable, Accessible, Interoperable and Reusable) principles aim to maximize the discovery and reusability of digital resources. While the principles have enjoyed rapid uptake across communities (ELIXIR, G20, EOSC, H2020, NIH), the implementation details remain unclear. Recently, we developed a prototype software infrastructure and a set of metrics to assess the FAIRness of digital resources (http://fairmetrics.org/). In this ELIXIR Implementation Study we will put these into practice for the ELIXIR community by starting to FAIRify ELIXIR Core Data Resources ArrayExpress, ENA, PDBe, PRIDE, CatH, CHEMBL, ChEBI, UNIPROT, HPA, INTERPRO, MINT, and STRING-db. Our study will first establish effective guidelines for implementation, then involve hands-on FAIRification workshops, in which FAIRness will be assessed before and after the work done. Our work will raise awareness around what it takes to be FAIR, and to help drive interoperability between core ELIXIR resources and with efforts outside of ELIXIR. |
ELIXIR Netherlands, ELIXIR UK, EMBL-EBI, ELIXIR Italy, ELIXIR Sweden |
FAIRness of the current ELIXIR Core resources: Application (and test) of newly available FAIR metrics, and identification of steps to increase interoperability (2018-FAIRCDR)
|
The FAIR (Findable, Accessible, Interoperable and Reusable) principles aim to maximize the discovery and reusability of digital resources. While the principles have enjoyed rapid uptake across communities (ELIXIR, G20, EOSC, H2020, NIH), the implementation details remain unclear. Recently, we developed a prototype software infrastructure and a set of metrics to assess the FAIRness of digital resources (http://fairmetrics.org/). In this ELIXIR Implementation Study we will put these into practice for the ELIXIR community by starting to FAIRify ELIXIR Core Data Resources ArrayExpress, ENA, PDBe, PRIDE, CatH, CHEMBL, ChEBI, UNIPROT, HPA, INTERPRO, MINT, and STRING-db. Our study will first establish effective guidelines for implementation, then involve hands-on FAIRification workshops, in which FAIRness will be assessed before and after the work done. Our work will raise awareness around what it takes to be FAIR, and to help drive interoperability between core ELIXIR resources and with efforts outside of ELIXIR. |
ELIXIR Netherlands, ELIXIR UK, EMBL-EBI, ELIXIR Italy, ELIXIR Sweden |
FAIRness of the current ELIXIR Core resources: Application (and test) of newly available FAIR metrics, and identification of steps to increase interoperability (2018-FAIRCDR)
|
The FAIR (Findable, Accessible, Interoperable and Reusable) principles aim to maximize the discovery and reusability of digital resources. While the principles have enjoyed rapid uptake across communities (ELIXIR, G20, EOSC, H2020, NIH), the implementation details remain unclear. Recently, we developed a prototype software infrastructure and a set of metrics to assess the FAIRness of digital resources (http://fairmetrics.org/). In this ELIXIR Implementation Study we will put these into practice for the ELIXIR community by starting to FAIRify ELIXIR Core Data Resources ArrayExpress, ENA, PDBe, PRIDE, CatH, CHEMBL, ChEBI, UNIPROT, HPA, INTERPRO, MINT, and STRING-db. Our study will first establish effective guidelines for implementation, then involve hands-on FAIRification workshops, in which FAIRness will be assessed before and after the work done. Our work will raise awareness around what it takes to be FAIR, and to help drive interoperability between core ELIXIR resources and with efforts outside of ELIXIR. |
ELIXIR Netherlands, ELIXIR UK, EMBL-EBI, ELIXIR Italy, ELIXIR Sweden |
FAIRness of the current ELIXIR Core resources: Application (and test) of newly available FAIR metrics, and identification of steps to increase interoperability (2018-FAIRCDR)
|
The FAIR (Findable, Accessible, Interoperable and Reusable) principles aim to maximize the discovery and reusability of digital resources. While the principles have enjoyed rapid uptake across communities (ELIXIR, G20, EOSC, H2020, NIH), the implementation details remain unclear. Recently, we developed a prototype software infrastructure and a set of metrics to assess the FAIRness of digital resources (http://fairmetrics.org/). In this ELIXIR Implementation Study we will put these into practice for the ELIXIR community by starting to FAIRify ELIXIR Core Data Resources ArrayExpress, ENA, PDBe, PRIDE, CatH, CHEMBL, ChEBI, UNIPROT, HPA, INTERPRO, MINT, and STRING-db. Our study will first establish effective guidelines for implementation, then involve hands-on FAIRification workshops, in which FAIRness will be assessed before and after the work done. Our work will raise awareness around what it takes to be FAIR, and to help drive interoperability between core ELIXIR resources and with efforts outside of ELIXIR. |
ELIXIR Netherlands, ELIXIR UK, EMBL-EBI, ELIXIR Italy, ELIXIR Sweden |
FAIRness of the current ELIXIR Core resources: Application (and test) of newly available FAIR metrics, and identification of steps to increase interoperability (2018-FAIRCDR)
|
The FAIR (Findable, Accessible, Interoperable and Reusable) principles aim to maximize the discovery and reusability of digital resources. While the principles have enjoyed rapid uptake across communities (ELIXIR, G20, EOSC, H2020, NIH), the implementation details remain unclear. Recently, we developed a prototype software infrastructure and a set of metrics to assess the FAIRness of digital resources (http://fairmetrics.org/). In this ELIXIR Implementation Study we will put these into practice for the ELIXIR community by starting to FAIRify ELIXIR Core Data Resources ArrayExpress, ENA, PDBe, PRIDE, CatH, CHEMBL, ChEBI, UNIPROT, HPA, INTERPRO, MINT, and STRING-db. Our study will first establish effective guidelines for implementation, then involve hands-on FAIRification workshops, in which FAIRness will be assessed before and after the work done. Our work will raise awareness around what it takes to be FAIR, and to help drive interoperability between core ELIXIR resources and with efforts outside of ELIXIR. |
ELIXIR Netherlands, ELIXIR UK, EMBL-EBI, ELIXIR Italy, ELIXIR Sweden |
FONDUE - FAIR-ification of Plant Genotyping Data and its linking to Phenotyping using ELIXIR Platforms
|
Recent progress in sequencing technologies has produced several large scale genotyping data sets for crops. The insights afforded by this data have been published in high profile scientific articles, but the underlying raw genotype data and the associated sample and population metadata have not been routinely submitted to appropriate archives. The aim of this implementation study, led by the ELIXIR Plant Community and in coordination with the ELIXIR Interoperability Platform and Data Platform, is to provide this wealth of data according to FAIR principles. It will ensure an interoperable link with the phenotypic data that is stored in distributed institutional repositories which is crucial for excelerated crop breeding. We propose to create a sustainable toolbox to submit data to the ELIXIR Deposition Database “European Variation Archive” (EVA) and enrich the data with interoperable metadata regarding plant data standards like “Multi-Crop Passport Descriptor” (MCPD) and “Minimum Information About a Plant Phenotyping Experiment” (MIAPPE). |
ELIXIR France, ELIXIR Germany, ELIXIR Belgium , ELIXIR Netherlands, EMBL-EBI |
FONDUE - FAIR-ification of Plant Genotyping Data and its linking to Phenotyping using ELIXIR Platforms
|
Recent progress in sequencing technologies has produced several large scale genotyping data sets for crops. The insights afforded by this data have been published in high profile scientific articles, but the underlying raw genotype data and the associated sample and population metadata have not been routinely submitted to appropriate archives. The aim of this implementation study, led by the ELIXIR Plant Community and in coordination with the ELIXIR Interoperability Platform and Data Platform, is to provide this wealth of data according to FAIR principles. It will ensure an interoperable link with the phenotypic data that is stored in distributed institutional repositories which is crucial for excelerated crop breeding. We propose to create a sustainable toolbox to submit data to the ELIXIR Deposition Database “European Variation Archive” (EVA) and enrich the data with interoperable metadata regarding plant data standards like “Multi-Crop Passport Descriptor” (MCPD) and “Minimum Information About a Plant Phenotyping Experiment” (MIAPPE). |
ELIXIR France, ELIXIR Germany, ELIXIR Belgium , ELIXIR Netherlands, EMBL-EBI |
FONDUE - FAIR-ification of Plant Genotyping Data and its linking to Phenotyping using ELIXIR Platforms
|
Recent progress in sequencing technologies has produced several large scale genotyping data sets for crops. The insights afforded by this data have been published in high profile scientific articles, but the underlying raw genotype data and the associated sample and population metadata have not been routinely submitted to appropriate archives. The aim of this implementation study, led by the ELIXIR Plant Community and in coordination with the ELIXIR Interoperability Platform and Data Platform, is to provide this wealth of data according to FAIR principles. It will ensure an interoperable link with the phenotypic data that is stored in distributed institutional repositories which is crucial for excelerated crop breeding. We propose to create a sustainable toolbox to submit data to the ELIXIR Deposition Database “European Variation Archive” (EVA) and enrich the data with interoperable metadata regarding plant data standards like “Multi-Crop Passport Descriptor” (MCPD) and “Minimum Information About a Plant Phenotyping Experiment” (MIAPPE). |
ELIXIR France, ELIXIR Germany, ELIXIR Belgium , ELIXIR Netherlands, EMBL-EBI |
FONDUE - FAIR-ification of Plant Genotyping Data and its linking to Phenotyping using ELIXIR Platforms
|
Recent progress in sequencing technologies has produced several large scale genotyping data sets for crops. The insights afforded by this data have been published in high profile scientific articles, but the underlying raw genotype data and the associated sample and population metadata have not been routinely submitted to appropriate archives. The aim of this implementation study, led by the ELIXIR Plant Community and in coordination with the ELIXIR Interoperability Platform and Data Platform, is to provide this wealth of data according to FAIR principles. It will ensure an interoperable link with the phenotypic data that is stored in distributed institutional repositories which is crucial for excelerated crop breeding. We propose to create a sustainable toolbox to submit data to the ELIXIR Deposition Database “European Variation Archive” (EVA) and enrich the data with interoperable metadata regarding plant data standards like “Multi-Crop Passport Descriptor” (MCPD) and “Minimum Information About a Plant Phenotyping Experiment” (MIAPPE). |
ELIXIR France, ELIXIR Germany, ELIXIR Belgium , ELIXIR Netherlands, EMBL-EBI |
FONDUE - FAIR-ification of Plant Genotyping Data and its linking to Phenotyping using ELIXIR Platforms
|
Recent progress in sequencing technologies has produced several large scale genotyping data sets for crops. The insights afforded by this data have been published in high profile scientific articles, but the underlying raw genotype data and the associated sample and population metadata have not been routinely submitted to appropriate archives. The aim of this implementation study, led by the ELIXIR Plant Community and in coordination with the ELIXIR Interoperability Platform and Data Platform, is to provide this wealth of data according to FAIR principles. It will ensure an interoperable link with the phenotypic data that is stored in distributed institutional repositories which is crucial for excelerated crop breeding. We propose to create a sustainable toolbox to submit data to the ELIXIR Deposition Database “European Variation Archive” (EVA) and enrich the data with interoperable metadata regarding plant data standards like “Multi-Crop Passport Descriptor” (MCPD) and “Minimum Information About a Plant Phenotyping Experiment” (MIAPPE). |
ELIXIR France, ELIXIR Germany, ELIXIR Belgium , ELIXIR Netherlands, EMBL-EBI |
Funding models for knowledge bases – Towards a sustainable funding model for the UniprotKB/Swiss-Prot use case
|
The long-term sustainability of the databases within ELIXIR is a constant concern for those who are managing them, with only a very small minority having secured funding over 5 years or more. The NIH grant that is currently funding a large proportion of UniprotKB/Swiss-Prot is guaranteed until 2018. At the same time, millions of life science users in Europe and beyond rely on these resources for their everyday research. The aim of this Implementation Study is to review sustainable funding models for knowledge bases (Academic, Commercial or third party), with UniprotKB/Swiss-Prot as a specific use case. Consideration was given to a number of approaches, all of which would need to meet certain criteria:
This study has now been completed, the work is presented in a webinar and summarised comprehensive paper: Gabella C, Durinx C and Appel R. Funding knowledgebases: Towards a sustainable funding model for the UniProt use case [version 1; referees: awaiting peer review]. F1000Research 2017, 6(ELIXIR):2051 (doi: 10.12688/f1000research.12989.1) Webinar summarising the outcome |
ELIXIR Switzerland |
Increasing Interoperability between ELIXIR Protein Structure and Sequence Resources and Expanding these Resources with 3D-Models of CATH Domains, built by SWISS-MODEL
|
This project will increase interoperability between four ELIXIR resources (CATH, SWISS-MODEL, InterPro and PDBe), three of which are Core Resources, by building APIs that facilitate the import and export of data between them. The ultimate goal is to improve provision of 3D-Models for protein domain sequences via CATH, SWISS-MODEL and InterPro. Less than 10% of known sequences have experimentally characterised 3D structural information and yet this data is often essential for understanding the protein’s molecular function and biological role and for determining whether residue mutations could damage the protein and lead to disease. So this integration is very timely as it will enhance links between sequence and structure data. APIs will be built using well-established protocols and as well as promoting interoperability, and therefore sustainability, we will expand the data in each resource to ensure they serve a wider community of biologists. |
ELIXIR UK, ELIXIR Switzerland, EMBL-EBI |
Increasing Interoperability between ELIXIR Protein Structure and Sequence Resources and Expanding these Resources with 3D-Models of CATH Domains, built by SWISS-MODEL
|
This project will increase interoperability between four ELIXIR resources (CATH, SWISS-MODEL, InterPro and PDBe), three of which are Core Resources, by building APIs that facilitate the import and export of data between them. The ultimate goal is to improve provision of 3D-Models for protein domain sequences via CATH, SWISS-MODEL and InterPro. Less than 10% of known sequences have experimentally characterised 3D structural information and yet this data is often essential for understanding the protein’s molecular function and biological role and for determining whether residue mutations could damage the protein and lead to disease. So this integration is very timely as it will enhance links between sequence and structure data. APIs will be built using well-established protocols and as well as promoting interoperability, and therefore sustainability, we will expand the data in each resource to ensure they serve a wider community of biologists. |
ELIXIR UK, ELIXIR Switzerland, EMBL-EBI |
Increasing Interoperability between ELIXIR Protein Structure and Sequence Resources and Expanding these Resources with 3D-Models of CATH Domains, built by SWISS-MODEL
|
This project will increase interoperability between four ELIXIR resources (CATH, SWISS-MODEL, InterPro and PDBe), three of which are Core Resources, by building APIs that facilitate the import and export of data between them. The ultimate goal is to improve provision of 3D-Models for protein domain sequences via CATH, SWISS-MODEL and InterPro. Less than 10% of known sequences have experimentally characterised 3D structural information and yet this data is often essential for understanding the protein’s molecular function and biological role and for determining whether residue mutations could damage the protein and lead to disease. So this integration is very timely as it will enhance links between sequence and structure data. APIs will be built using well-established protocols and as well as promoting interoperability, and therefore sustainability, we will expand the data in each resource to ensure they serve a wider community of biologists. |
ELIXIR UK, ELIXIR Switzerland, EMBL-EBI |
Integrating ELIXIR Italy into ELIXIR activities
|
The implementation study project plan of ELIXIR Italy consists of six activities that aim to boost the cooperation with existing ELIXIR activities and are expected to deepen the interaction between ELIXIR-IIB, the Joint Research Unit embodying the Italian Node, and ELIXIR. The partners involved have already established contacts with other ELIXIR Nodes and the relevant ELIXIR Platforms and Services in order to ensure an advantageous outcome for all the involved parties. The goal of the proposed activities is to create and/or reinforce collaborations based on concrete measures. With this implementation study the Italian ELIXIR Node will achieve greater integration within ELIXIR service infrastructures and data interoperability policies. The topics of the selected activities and an additional coordination task are summarized below:
|
ELIXIR Italy |
Integrating epitranscriptomic data into the ELIXIR ecosystem (2022-23)
|
Epitranscriptome modifications are now emerging as important factors to fine tune gene expression and regulation. Among them, A-to-I RNA editing by ADAR enzymes plays relevant biological roles and has been linked to several human diseases. Thanks to deep transcriptome sequencing data, A-to-I events have been characterized at single nucleotide level and collected in the REDIportal database, a unique and specialized resource comprising about 16 millions of changes detected in more than 9000 human GTEx RNAseq data. Here we plan to upgrade REDIportal providing researchers an accurate, sustainable and accessible epitranscriptome resource through its integration into the ELIXIR ecosystem. Such integration will be established through a standardization, curation and “FAIRification” of data in combination with interconnections to existing ELIXIR resources such as Ensembl, UniProt, RNAcentral and PRIDE. Our proposal will facilitate data interoperability and the study of epitranscriptome, a very relevant research topic yet under-represented in the ELIXIR community. |
ELIXIR Italy, EMBL-EBI, ELIXIR Israel |
Integrating epitranscriptomic data into the ELIXIR ecosystem (2022-23)
|
Epitranscriptome modifications are now emerging as important factors to fine tune gene expression and regulation. Among them, A-to-I RNA editing by ADAR enzymes plays relevant biological roles and has been linked to several human diseases. Thanks to deep transcriptome sequencing data, A-to-I events have been characterized at single nucleotide level and collected in the REDIportal database, a unique and specialized resource comprising about 16 millions of changes detected in more than 9000 human GTEx RNAseq data. Here we plan to upgrade REDIportal providing researchers an accurate, sustainable and accessible epitranscriptome resource through its integration into the ELIXIR ecosystem. Such integration will be established through a standardization, curation and “FAIRification” of data in combination with interconnections to existing ELIXIR resources such as Ensembl, UniProt, RNAcentral and PRIDE. Our proposal will facilitate data interoperability and the study of epitranscriptome, a very relevant research topic yet under-represented in the ELIXIR community. |
ELIXIR Italy, EMBL-EBI, ELIXIR Israel |
Integrating epitranscriptomic data into the ELIXIR ecosystem (2022-23)
|
Epitranscriptome modifications are now emerging as important factors to fine tune gene expression and regulation. Among them, A-to-I RNA editing by ADAR enzymes plays relevant biological roles and has been linked to several human diseases. Thanks to deep transcriptome sequencing data, A-to-I events have been characterized at single nucleotide level and collected in the REDIportal database, a unique and specialized resource comprising about 16 millions of changes detected in more than 9000 human GTEx RNAseq data. Here we plan to upgrade REDIportal providing researchers an accurate, sustainable and accessible epitranscriptome resource through its integration into the ELIXIR ecosystem. Such integration will be established through a standardization, curation and “FAIRification” of data in combination with interconnections to existing ELIXIR resources such as Ensembl, UniProt, RNAcentral and PRIDE. Our proposal will facilitate data interoperability and the study of epitranscriptome, a very relevant research topic yet under-represented in the ELIXIR community. |
ELIXIR Italy, EMBL-EBI, ELIXIR Israel |
Integrating reference taxonomic databases for metabarcoding and metagenomics identification
|
Comparison of environmental sequences to reference sets from curated marker loci provides a mainstay for taxonomic analysis of microbial communities. Microbial eukaryotic sequencing requires many distinct reference sets to cover diversity adequately. Those producing reference sets follow different curation workflows, but share the need to provide their data onwards to a common set of tools and services, such as EMG, Megan, MetaPIPE and BioMaS. There are multiple inefficiencies:
Led by the ITSoneDB team, who provide the leading fungi and other eukaryotes ITS1 reference set, we will develop a new data type within ENA that will capture systematically these reference sets and serve them to dependent resources, eliminating inefficiencies, leveraging this core ELIXIR resource and building sustainability into reference set generation workflows. Currently, taxonomic analysis of microbial communities relies on multiple dispersed reference data sets. The impact of this study will be that ENA will be enriched with a new structured data type to accommodate these taxonomic reference datasets, beginning with ITS1 from rRNA, from the ITSoneDB team. By enhancing the connectivity and coordination between the various reference datasets and ENA a stable system to systematically capture their data and serve them to the consumer services from one place will be made available. This will increase both the sustainability and exposure of the data and facilitate/promote their use and re-use. |
ELIXIR Italy, EMBL-EBI |
Integrating reference taxonomic databases for metabarcoding and metagenomics identification
|
Comparison of environmental sequences to reference sets from curated marker loci provides a mainstay for taxonomic analysis of microbial communities. Microbial eukaryotic sequencing requires many distinct reference sets to cover diversity adequately. Those producing reference sets follow different curation workflows, but share the need to provide their data onwards to a common set of tools and services, such as EMG, Megan, MetaPIPE and BioMaS. There are multiple inefficiencies:
Led by the ITSoneDB team, who provide the leading fungi and other eukaryotes ITS1 reference set, we will develop a new data type within ENA that will capture systematically these reference sets and serve them to dependent resources, eliminating inefficiencies, leveraging this core ELIXIR resource and building sustainability into reference set generation workflows. Currently, taxonomic analysis of microbial communities relies on multiple dispersed reference data sets. The impact of this study will be that ENA will be enriched with a new structured data type to accommodate these taxonomic reference datasets, beginning with ITS1 from rRNA, from the ITSoneDB team. By enhancing the connectivity and coordination between the various reference datasets and ENA a stable system to systematically capture their data and serve them to the consumer services from one place will be made available. This will increase both the sustainability and exposure of the data and facilitate/promote their use and re-use. |
ELIXIR Italy, EMBL-EBI |
Integration and standardization of intrinsically disordered protein data (2018-IDPs)
|
Intrinsically disordered proteins (IDPs), characterized by high conformational variability, cover almost a third of the residues in Eukaryotic proteomes. As major players in cellular regulation, IDPs are involved in numerous diseases. Specialized IDP databases provide a starting point for analysis, yet their integration into core databases remains very limited. Here, we propose to start integrating IDP information into ELIXIR Core Data Resources. This will be achieved with a three pronged approach:
|
ELIXIR Italy, EMBL-EBI, ELIXIR Switzerland, ELIXIR Hungary, ELIXIR Ireland |
Integration and standardization of intrinsically disordered protein data (2018-IDPs)
|
Intrinsically disordered proteins (IDPs), characterized by high conformational variability, cover almost a third of the residues in Eukaryotic proteomes. As major players in cellular regulation, IDPs are involved in numerous diseases. Specialized IDP databases provide a starting point for analysis, yet their integration into core databases remains very limited. Here, we propose to start integrating IDP information into ELIXIR Core Data Resources. This will be achieved with a three pronged approach:
|
ELIXIR Italy, EMBL-EBI, ELIXIR Switzerland, ELIXIR Hungary, ELIXIR Ireland |
Integration and standardization of intrinsically disordered protein data (2018-IDPs)
|
Intrinsically disordered proteins (IDPs), characterized by high conformational variability, cover almost a third of the residues in Eukaryotic proteomes. As major players in cellular regulation, IDPs are involved in numerous diseases. Specialized IDP databases provide a starting point for analysis, yet their integration into core databases remains very limited. Here, we propose to start integrating IDP information into ELIXIR Core Data Resources. This will be achieved with a three pronged approach:
|
ELIXIR Italy, EMBL-EBI, ELIXIR Switzerland, ELIXIR Hungary, ELIXIR Ireland |
Integration and standardization of intrinsically disordered protein data (2018-IDPs)
|
Intrinsically disordered proteins (IDPs), characterized by high conformational variability, cover almost a third of the residues in Eukaryotic proteomes. As major players in cellular regulation, IDPs are involved in numerous diseases. Specialized IDP databases provide a starting point for analysis, yet their integration into core databases remains very limited. Here, we propose to start integrating IDP information into ELIXIR Core Data Resources. This will be achieved with a three pronged approach:
|
ELIXIR Italy, EMBL-EBI, ELIXIR Switzerland, ELIXIR Hungary, ELIXIR Ireland |
Integration and standardization of intrinsically disordered protein data (2018-IDPs)
|
Intrinsically disordered proteins (IDPs), characterized by high conformational variability, cover almost a third of the residues in Eukaryotic proteomes. As major players in cellular regulation, IDPs are involved in numerous diseases. Specialized IDP databases provide a starting point for analysis, yet their integration into core databases remains very limited. Here, we propose to start integrating IDP information into ELIXIR Core Data Resources. This will be achieved with a three pronged approach:
|
ELIXIR Italy, EMBL-EBI, ELIXIR Switzerland, ELIXIR Hungary, ELIXIR Ireland |
LEAP - Linking Expertise and Analysis in Pathways
|
Reactome is a world-leading, curated resource for biomolecular pathways, with >1,200 citations in 2019 and >80,000 distinct users monthly. It is developed in international collaboration with EMBL-EBI as one of four partners. Reactome’s content is presented through a multi-scale visualisation system, complemented by advanced analysis tools. Beyond scientific analysis, Reactome is uniquely suited for teaching and training in molecular biology as well as in bioinformatics, through its open source, open data policy. For its content, Reactome critically depends on external domain experts who, in collaboration with professional Reactome curators ensure consistent, high quality of the curated pathways. At University of Ljubljana Faculty of Medicine within the broader group of the Slovenian ELIXIR Node, there is ample domain expertise in a variety of multifactorial disorders and pathways beyond that. For example, we are experts in cholesterol homeostasis, particularly in sterols of the cholesterol synthesis pathway, which were recently found as ligands of nuclear receptor RORC. The expertise is also in the cholesterol metabolism and connections to liver pathologies, the circadian clock, Epo receptor and signalling, molecular aspects of cancer, epigenetics of cancer and brain disorders, in cytoskeleton abnormalities, etc. The aim of the LEAP project is to establish staff exchange and regular interaction between ELIXIR Slovenia and EMBL-EBI, supporting expert curation and user experience testing of Reactome by ELIXIR Slovenia domain experts, and establish routine use of Reactome for data analysis, teaching, and training in Slovenia. This is expected to lead to improved Reactome user interface and content, and advanced pathway analysis for high throughput biomolecular data in the Slovenian research community. |
ELIXIR Slovenia, EMBL-EBI |
LEAP - Linking Expertise and Analysis in Pathways
|
Reactome is a world-leading, curated resource for biomolecular pathways, with >1,200 citations in 2019 and >80,000 distinct users monthly. It is developed in international collaboration with EMBL-EBI as one of four partners. Reactome’s content is presented through a multi-scale visualisation system, complemented by advanced analysis tools. Beyond scientific analysis, Reactome is uniquely suited for teaching and training in molecular biology as well as in bioinformatics, through its open source, open data policy. For its content, Reactome critically depends on external domain experts who, in collaboration with professional Reactome curators ensure consistent, high quality of the curated pathways. At University of Ljubljana Faculty of Medicine within the broader group of the Slovenian ELIXIR Node, there is ample domain expertise in a variety of multifactorial disorders and pathways beyond that. For example, we are experts in cholesterol homeostasis, particularly in sterols of the cholesterol synthesis pathway, which were recently found as ligands of nuclear receptor RORC. The expertise is also in the cholesterol metabolism and connections to liver pathologies, the circadian clock, Epo receptor and signalling, molecular aspects of cancer, epigenetics of cancer and brain disorders, in cytoskeleton abnormalities, etc. The aim of the LEAP project is to establish staff exchange and regular interaction between ELIXIR Slovenia and EMBL-EBI, supporting expert curation and user experience testing of Reactome by ELIXIR Slovenia domain experts, and establish routine use of Reactome for data analysis, teaching, and training in Slovenia. This is expected to lead to improved Reactome user interface and content, and advanced pathway analysis for high throughput biomolecular data in the Slovenian research community. |
ELIXIR Slovenia, EMBL-EBI |
Literature-Data Integration
|
The goal of this group is to explore extending the connected ecosystem to any ELIXIR data resource with the scientific literature, with a view to incorporating more orphan data and human data, and to providing connectivity with other elements of the ELIXIR infrastructure. Aim: To increase understanding of the potential for benefits that would arise from increasing the number of ELIXIR data resources linked to each other, Europe PMC, and integrated with orphan data, where appropriate. Through presentations, webinars, hackathons and staff exchange we will explore:
|
EMBL-EBI, ELIXIR Italy, ELIXIR Czech Republic, ELIXIR France, ELIXIR UK, ELIXIR Spain, ELIXIR Sweden, ELIXIR Germany, ELIXIR Switzerland |
Literature-Data Integration
|
The goal of this group is to explore extending the connected ecosystem to any ELIXIR data resource with the scientific literature, with a view to incorporating more orphan data and human data, and to providing connectivity with other elements of the ELIXIR infrastructure. Aim: To increase understanding of the potential for benefits that would arise from increasing the number of ELIXIR data resources linked to each other, Europe PMC, and integrated with orphan data, where appropriate. Through presentations, webinars, hackathons and staff exchange we will explore:
|
EMBL-EBI, ELIXIR Italy, ELIXIR Czech Republic, ELIXIR France, ELIXIR UK, ELIXIR Spain, ELIXIR Sweden, ELIXIR Germany, ELIXIR Switzerland |
Literature-Data Integration
|
The goal of this group is to explore extending the connected ecosystem to any ELIXIR data resource with the scientific literature, with a view to incorporating more orphan data and human data, and to providing connectivity with other elements of the ELIXIR infrastructure. Aim: To increase understanding of the potential for benefits that would arise from increasing the number of ELIXIR data resources linked to each other, Europe PMC, and integrated with orphan data, where appropriate. Through presentations, webinars, hackathons and staff exchange we will explore:
|
EMBL-EBI, ELIXIR Italy, ELIXIR Czech Republic, ELIXIR France, ELIXIR UK, ELIXIR Spain, ELIXIR Sweden, ELIXIR Germany, ELIXIR Switzerland |
Literature-Data Integration
|
The goal of this group is to explore extending the connected ecosystem to any ELIXIR data resource with the scientific literature, with a view to incorporating more orphan data and human data, and to providing connectivity with other elements of the ELIXIR infrastructure. Aim: To increase understanding of the potential for benefits that would arise from increasing the number of ELIXIR data resources linked to each other, Europe PMC, and integrated with orphan data, where appropriate. Through presentations, webinars, hackathons and staff exchange we will explore:
|
EMBL-EBI, ELIXIR Italy, ELIXIR Czech Republic, ELIXIR France, ELIXIR UK, ELIXIR Spain, ELIXIR Sweden, ELIXIR Germany, ELIXIR Switzerland |
Literature-Data Integration
|
The goal of this group is to explore extending the connected ecosystem to any ELIXIR data resource with the scientific literature, with a view to incorporating more orphan data and human data, and to providing connectivity with other elements of the ELIXIR infrastructure. Aim: To increase understanding of the potential for benefits that would arise from increasing the number of ELIXIR data resources linked to each other, Europe PMC, and integrated with orphan data, where appropriate. Through presentations, webinars, hackathons and staff exchange we will explore:
|
EMBL-EBI, ELIXIR Italy, ELIXIR Czech Republic, ELIXIR France, ELIXIR UK, ELIXIR Spain, ELIXIR Sweden, ELIXIR Germany, ELIXIR Switzerland |
Literature-Data Integration
|
The goal of this group is to explore extending the connected ecosystem to any ELIXIR data resource with the scientific literature, with a view to incorporating more orphan data and human data, and to providing connectivity with other elements of the ELIXIR infrastructure. Aim: To increase understanding of the potential for benefits that would arise from increasing the number of ELIXIR data resources linked to each other, Europe PMC, and integrated with orphan data, where appropriate. Through presentations, webinars, hackathons and staff exchange we will explore:
|
EMBL-EBI, ELIXIR Italy, ELIXIR Czech Republic, ELIXIR France, ELIXIR UK, ELIXIR Spain, ELIXIR Sweden, ELIXIR Germany, ELIXIR Switzerland |
Literature-Data Integration
|
The goal of this group is to explore extending the connected ecosystem to any ELIXIR data resource with the scientific literature, with a view to incorporating more orphan data and human data, and to providing connectivity with other elements of the ELIXIR infrastructure. Aim: To increase understanding of the potential for benefits that would arise from increasing the number of ELIXIR data resources linked to each other, Europe PMC, and integrated with orphan data, where appropriate. Through presentations, webinars, hackathons and staff exchange we will explore:
|
EMBL-EBI, ELIXIR Italy, ELIXIR Czech Republic, ELIXIR France, ELIXIR UK, ELIXIR Spain, ELIXIR Sweden, ELIXIR Germany, ELIXIR Switzerland |
Literature-Data Integration
|
The goal of this group is to explore extending the connected ecosystem to any ELIXIR data resource with the scientific literature, with a view to incorporating more orphan data and human data, and to providing connectivity with other elements of the ELIXIR infrastructure. Aim: To increase understanding of the potential for benefits that would arise from increasing the number of ELIXIR data resources linked to each other, Europe PMC, and integrated with orphan data, where appropriate. Through presentations, webinars, hackathons and staff exchange we will explore:
|
EMBL-EBI, ELIXIR Italy, ELIXIR Czech Republic, ELIXIR France, ELIXIR UK, ELIXIR Spain, ELIXIR Sweden, ELIXIR Germany, ELIXIR Switzerland |
Literature-Data Integration
|
The goal of this group is to explore extending the connected ecosystem to any ELIXIR data resource with the scientific literature, with a view to incorporating more orphan data and human data, and to providing connectivity with other elements of the ELIXIR infrastructure. Aim: To increase understanding of the potential for benefits that would arise from increasing the number of ELIXIR data resources linked to each other, Europe PMC, and integrated with orphan data, where appropriate. Through presentations, webinars, hackathons and staff exchange we will explore:
|
EMBL-EBI, ELIXIR Italy, ELIXIR Czech Republic, ELIXIR France, ELIXIR UK, ELIXIR Spain, ELIXIR Sweden, ELIXIR Germany, ELIXIR Switzerland |
Long Term Sustainability
|
This study will support the establishment of global partnerships and business cases for the long term financial sustainability of Core Data Resources. Aim: To establish global partnerships for sustainable core data resources.
|
ELIXIR Switzerland, EMBL-EBI, ELIXIR Norway |
Long Term Sustainability
|
This study will support the establishment of global partnerships and business cases for the long term financial sustainability of Core Data Resources. Aim: To establish global partnerships for sustainable core data resources.
|
ELIXIR Switzerland, EMBL-EBI, ELIXIR Norway |
Long Term Sustainability
|
This study will support the establishment of global partnerships and business cases for the long term financial sustainability of Core Data Resources. Aim: To establish global partnerships for sustainable core data resources.
|
ELIXIR Switzerland, EMBL-EBI, ELIXIR Norway |
Mapping the landscape of Biocuration in ELIXIR: Practice, capability and training requirements
|
This implementation study is designed to
The outcomes of this study have been published in F1000Research and were presented in a webinar. Part of this work has been presented in the ISB Biocuration 2019 meeting (Cambridge, UK, April 2019):
and also in the ELIXIR All-Hands Meeting 2019 (Lisbon, Portugal, June 2019):
This work is carried out in collaboration with |
ELIXIR UK, ELIXIR Switzerland, EMBL-EBI, ELIXIR Luxembourg, ELIXIR Slovenia |
Mapping the landscape of Biocuration in ELIXIR: Practice, capability and training requirements
|
This implementation study is designed to
The outcomes of this study have been published in F1000Research and were presented in a webinar. Part of this work has been presented in the ISB Biocuration 2019 meeting (Cambridge, UK, April 2019):
and also in the ELIXIR All-Hands Meeting 2019 (Lisbon, Portugal, June 2019):
This work is carried out in collaboration with |
ELIXIR UK, ELIXIR Switzerland, EMBL-EBI, ELIXIR Luxembourg, ELIXIR Slovenia |
Mapping the landscape of Biocuration in ELIXIR: Practice, capability and training requirements
|
This implementation study is designed to
The outcomes of this study have been published in F1000Research and were presented in a webinar. Part of this work has been presented in the ISB Biocuration 2019 meeting (Cambridge, UK, April 2019):
and also in the ELIXIR All-Hands Meeting 2019 (Lisbon, Portugal, June 2019):
This work is carried out in collaboration with |
ELIXIR UK, ELIXIR Switzerland, EMBL-EBI, ELIXIR Luxembourg, ELIXIR Slovenia |
Mapping the landscape of Biocuration in ELIXIR: Practice, capability and training requirements
|
This implementation study is designed to
The outcomes of this study have been published in F1000Research and were presented in a webinar. Part of this work has been presented in the ISB Biocuration 2019 meeting (Cambridge, UK, April 2019):
and also in the ELIXIR All-Hands Meeting 2019 (Lisbon, Portugal, June 2019):
This work is carried out in collaboration with |
ELIXIR UK, ELIXIR Switzerland, EMBL-EBI, ELIXIR Luxembourg, ELIXIR Slovenia |
Mapping the landscape of Biocuration in ELIXIR: Practice, capability and training requirements
|
This implementation study is designed to
The outcomes of this study have been published in F1000Research and were presented in a webinar. Part of this work has been presented in the ISB Biocuration 2019 meeting (Cambridge, UK, April 2019):
and also in the ELIXIR All-Hands Meeting 2019 (Lisbon, Portugal, June 2019):
This work is carried out in collaboration with |
ELIXIR UK, ELIXIR Switzerland, EMBL-EBI, ELIXIR Luxembourg, ELIXIR Slovenia |
Metabolite Identification
|
Metabolomics aims to provide novel insights into the biochemical reactions of organisms by characterising the presence and concentrations of low molecular weight compounds from biological samples. The primary analytical tools for such high-throughput data collection are mass spectrometry (MS), often preceded by chromatographic or electrophoretic separation technologies, and nuclear magnetic resonance spectroscopy (NMR). These technologies produce relatively large and complex data sets that require bioinformaticians, cheminformaticians, biostatisticians, data scientists and computer scientists. Together they develop and apply a wide range of algorithms, software tools, repositories and computational resources to process, analyse, report and store the data and metadata. Increasingly, insights from genomics, epigenomics, transcriptomics, proteomics/protein interactomics and metabolomics are combined, to gain insights into the dynamics of biological processes. Metabolomics activities are well represented within Europe and ELIXIR nodes. Metabolite identification is the area that the community believes will have maximal impact of computational metabolomics and metabolomics data management and will benefit most from interactions with the existing five ELIXIR platforms and where progress will contribute most to other ELIXIR communities. The progress through this integrative Implementation Study will benefit industry and academia alike as metabolite identification is one of the major bottlenecks in metabolomics and resolving this challenge requires a community effort. |
ELIXIR Netherlands, EMBL-EBI, ELIXIR France, ELIXIR UK, ELIXIR Germany, ELIXIR Spain, ELIXIR Sweden, ELIXIR Italy, ELIXIR Estonia, ELIXIR Switzerland, ELIXIR Belgium |
Metabolite Identification
|
Metabolomics aims to provide novel insights into the biochemical reactions of organisms by characterising the presence and concentrations of low molecular weight compounds from biological samples. The primary analytical tools for such high-throughput data collection are mass spectrometry (MS), often preceded by chromatographic or electrophoretic separation technologies, and nuclear magnetic resonance spectroscopy (NMR). These technologies produce relatively large and complex data sets that require bioinformaticians, cheminformaticians, biostatisticians, data scientists and computer scientists. Together they develop and apply a wide range of algorithms, software tools, repositories and computational resources to process, analyse, report and store the data and metadata. Increasingly, insights from genomics, epigenomics, transcriptomics, proteomics/protein interactomics and metabolomics are combined, to gain insights into the dynamics of biological processes. Metabolomics activities are well represented within Europe and ELIXIR nodes. Metabolite identification is the area that the community believes will have maximal impact of computational metabolomics and metabolomics data management and will benefit most from interactions with the existing five ELIXIR platforms and where progress will contribute most to other ELIXIR communities. The progress through this integrative Implementation Study will benefit industry and academia alike as metabolite identification is one of the major bottlenecks in metabolomics and resolving this challenge requires a community effort. |
ELIXIR Netherlands, EMBL-EBI, ELIXIR France, ELIXIR UK, ELIXIR Germany, ELIXIR Spain, ELIXIR Sweden, ELIXIR Italy, ELIXIR Estonia, ELIXIR Switzerland, ELIXIR Belgium |
Metabolite Identification
|
Metabolomics aims to provide novel insights into the biochemical reactions of organisms by characterising the presence and concentrations of low molecular weight compounds from biological samples. The primary analytical tools for such high-throughput data collection are mass spectrometry (MS), often preceded by chromatographic or electrophoretic separation technologies, and nuclear magnetic resonance spectroscopy (NMR). These technologies produce relatively large and complex data sets that require bioinformaticians, cheminformaticians, biostatisticians, data scientists and computer scientists. Together they develop and apply a wide range of algorithms, software tools, repositories and computational resources to process, analyse, report and store the data and metadata. Increasingly, insights from genomics, epigenomics, transcriptomics, proteomics/protein interactomics and metabolomics are combined, to gain insights into the dynamics of biological processes. Metabolomics activities are well represented within Europe and ELIXIR nodes. Metabolite identification is the area that the community believes will have maximal impact of computational metabolomics and metabolomics data management and will benefit most from interactions with the existing five ELIXIR platforms and where progress will contribute most to other ELIXIR communities. The progress through this integrative Implementation Study will benefit industry and academia alike as metabolite identification is one of the major bottlenecks in metabolomics and resolving this challenge requires a community effort. |
ELIXIR Netherlands, EMBL-EBI, ELIXIR France, ELIXIR UK, ELIXIR Germany, ELIXIR Spain, ELIXIR Sweden, ELIXIR Italy, ELIXIR Estonia, ELIXIR Switzerland, ELIXIR Belgium |
Metabolite Identification
|
Metabolomics aims to provide novel insights into the biochemical reactions of organisms by characterising the presence and concentrations of low molecular weight compounds from biological samples. The primary analytical tools for such high-throughput data collection are mass spectrometry (MS), often preceded by chromatographic or electrophoretic separation technologies, and nuclear magnetic resonance spectroscopy (NMR). These technologies produce relatively large and complex data sets that require bioinformaticians, cheminformaticians, biostatisticians, data scientists and computer scientists. Together they develop and apply a wide range of algorithms, software tools, repositories and computational resources to process, analyse, report and store the data and metadata. Increasingly, insights from genomics, epigenomics, transcriptomics, proteomics/protein interactomics and metabolomics are combined, to gain insights into the dynamics of biological processes. Metabolomics activities are well represented within Europe and ELIXIR nodes. Metabolite identification is the area that the community believes will have maximal impact of computational metabolomics and metabolomics data management and will benefit most from interactions with the existing five ELIXIR platforms and where progress will contribute most to other ELIXIR communities. The progress through this integrative Implementation Study will benefit industry and academia alike as metabolite identification is one of the major bottlenecks in metabolomics and resolving this challenge requires a community effort. |
ELIXIR Netherlands, EMBL-EBI, ELIXIR France, ELIXIR UK, ELIXIR Germany, ELIXIR Spain, ELIXIR Sweden, ELIXIR Italy, ELIXIR Estonia, ELIXIR Switzerland, ELIXIR Belgium |
Metabolite Identification
|
Metabolomics aims to provide novel insights into the biochemical reactions of organisms by characterising the presence and concentrations of low molecular weight compounds from biological samples. The primary analytical tools for such high-throughput data collection are mass spectrometry (MS), often preceded by chromatographic or electrophoretic separation technologies, and nuclear magnetic resonance spectroscopy (NMR). These technologies produce relatively large and complex data sets that require bioinformaticians, cheminformaticians, biostatisticians, data scientists and computer scientists. Together they develop and apply a wide range of algorithms, software tools, repositories and computational resources to process, analyse, report and store the data and metadata. Increasingly, insights from genomics, epigenomics, transcriptomics, proteomics/protein interactomics and metabolomics are combined, to gain insights into the dynamics of biological processes. Metabolomics activities are well represented within Europe and ELIXIR nodes. Metabolite identification is the area that the community believes will have maximal impact of computational metabolomics and metabolomics data management and will benefit most from interactions with the existing five ELIXIR platforms and where progress will contribute most to other ELIXIR communities. The progress through this integrative Implementation Study will benefit industry and academia alike as metabolite identification is one of the major bottlenecks in metabolomics and resolving this challenge requires a community effort. |
ELIXIR Netherlands, EMBL-EBI, ELIXIR France, ELIXIR UK, ELIXIR Germany, ELIXIR Spain, ELIXIR Sweden, ELIXIR Italy, ELIXIR Estonia, ELIXIR Switzerland, ELIXIR Belgium |
Metabolite Identification
|
Metabolomics aims to provide novel insights into the biochemical reactions of organisms by characterising the presence and concentrations of low molecular weight compounds from biological samples. The primary analytical tools for such high-throughput data collection are mass spectrometry (MS), often preceded by chromatographic or electrophoretic separation technologies, and nuclear magnetic resonance spectroscopy (NMR). These technologies produce relatively large and complex data sets that require bioinformaticians, cheminformaticians, biostatisticians, data scientists and computer scientists. Together they develop and apply a wide range of algorithms, software tools, repositories and computational resources to process, analyse, report and store the data and metadata. Increasingly, insights from genomics, epigenomics, transcriptomics, proteomics/protein interactomics and metabolomics are combined, to gain insights into the dynamics of biological processes. Metabolomics activities are well represented within Europe and ELIXIR nodes. Metabolite identification is the area that the community believes will have maximal impact of computational metabolomics and metabolomics data management and will benefit most from interactions with the existing five ELIXIR platforms and where progress will contribute most to other ELIXIR communities. The progress through this integrative Implementation Study will benefit industry and academia alike as metabolite identification is one of the major bottlenecks in metabolomics and resolving this challenge requires a community effort. |
ELIXIR Netherlands, EMBL-EBI, ELIXIR France, ELIXIR UK, ELIXIR Germany, ELIXIR Spain, ELIXIR Sweden, ELIXIR Italy, ELIXIR Estonia, ELIXIR Switzerland, ELIXIR Belgium |
Metabolite Identification
|
Metabolomics aims to provide novel insights into the biochemical reactions of organisms by characterising the presence and concentrations of low molecular weight compounds from biological samples. The primary analytical tools for such high-throughput data collection are mass spectrometry (MS), often preceded by chromatographic or electrophoretic separation technologies, and nuclear magnetic resonance spectroscopy (NMR). These technologies produce relatively large and complex data sets that require bioinformaticians, cheminformaticians, biostatisticians, data scientists and computer scientists. Together they develop and apply a wide range of algorithms, software tools, repositories and computational resources to process, analyse, report and store the data and metadata. Increasingly, insights from genomics, epigenomics, transcriptomics, proteomics/protein interactomics and metabolomics are combined, to gain insights into the dynamics of biological processes. Metabolomics activities are well represented within Europe and ELIXIR nodes. Metabolite identification is the area that the community believes will have maximal impact of computational metabolomics and metabolomics data management and will benefit most from interactions with the existing five ELIXIR platforms and where progress will contribute most to other ELIXIR communities. The progress through this integrative Implementation Study will benefit industry and academia alike as metabolite identification is one of the major bottlenecks in metabolomics and resolving this challenge requires a community effort. |
ELIXIR Netherlands, EMBL-EBI, ELIXIR France, ELIXIR UK, ELIXIR Germany, ELIXIR Spain, ELIXIR Sweden, ELIXIR Italy, ELIXIR Estonia, ELIXIR Switzerland, ELIXIR Belgium |
Metabolite Identification
|
Metabolomics aims to provide novel insights into the biochemical reactions of organisms by characterising the presence and concentrations of low molecular weight compounds from biological samples. The primary analytical tools for such high-throughput data collection are mass spectrometry (MS), often preceded by chromatographic or electrophoretic separation technologies, and nuclear magnetic resonance spectroscopy (NMR). These technologies produce relatively large and complex data sets that require bioinformaticians, cheminformaticians, biostatisticians, data scientists and computer scientists. Together they develop and apply a wide range of algorithms, software tools, repositories and computational resources to process, analyse, report and store the data and metadata. Increasingly, insights from genomics, epigenomics, transcriptomics, proteomics/protein interactomics and metabolomics are combined, to gain insights into the dynamics of biological processes. Metabolomics activities are well represented within Europe and ELIXIR nodes. Metabolite identification is the area that the community believes will have maximal impact of computational metabolomics and metabolomics data management and will benefit most from interactions with the existing five ELIXIR platforms and where progress will contribute most to other ELIXIR communities. The progress through this integrative Implementation Study will benefit industry and academia alike as metabolite identification is one of the major bottlenecks in metabolomics and resolving this challenge requires a community effort. |
ELIXIR Netherlands, EMBL-EBI, ELIXIR France, ELIXIR UK, ELIXIR Germany, ELIXIR Spain, ELIXIR Sweden, ELIXIR Italy, ELIXIR Estonia, ELIXIR Switzerland, ELIXIR Belgium |
Metabolite Identification
|
Metabolomics aims to provide novel insights into the biochemical reactions of organisms by characterising the presence and concentrations of low molecular weight compounds from biological samples. The primary analytical tools for such high-throughput data collection are mass spectrometry (MS), often preceded by chromatographic or electrophoretic separation technologies, and nuclear magnetic resonance spectroscopy (NMR). These technologies produce relatively large and complex data sets that require bioinformaticians, cheminformaticians, biostatisticians, data scientists and computer scientists. Together they develop and apply a wide range of algorithms, software tools, repositories and computational resources to process, analyse, report and store the data and metadata. Increasingly, insights from genomics, epigenomics, transcriptomics, proteomics/protein interactomics and metabolomics are combined, to gain insights into the dynamics of biological processes. Metabolomics activities are well represented within Europe and ELIXIR nodes. Metabolite identification is the area that the community believes will have maximal impact of computational metabolomics and metabolomics data management and will benefit most from interactions with the existing five ELIXIR platforms and where progress will contribute most to other ELIXIR communities. The progress through this integrative Implementation Study will benefit industry and academia alike as metabolite identification is one of the major bottlenecks in metabolomics and resolving this challenge requires a community effort. |
ELIXIR Netherlands, EMBL-EBI, ELIXIR France, ELIXIR UK, ELIXIR Germany, ELIXIR Spain, ELIXIR Sweden, ELIXIR Italy, ELIXIR Estonia, ELIXIR Switzerland, ELIXIR Belgium |
Metabolite Identification
|
Metabolomics aims to provide novel insights into the biochemical reactions of organisms by characterising the presence and concentrations of low molecular weight compounds from biological samples. The primary analytical tools for such high-throughput data collection are mass spectrometry (MS), often preceded by chromatographic or electrophoretic separation technologies, and nuclear magnetic resonance spectroscopy (NMR). These technologies produce relatively large and complex data sets that require bioinformaticians, cheminformaticians, biostatisticians, data scientists and computer scientists. Together they develop and apply a wide range of algorithms, software tools, repositories and computational resources to process, analyse, report and store the data and metadata. Increasingly, insights from genomics, epigenomics, transcriptomics, proteomics/protein interactomics and metabolomics are combined, to gain insights into the dynamics of biological processes. Metabolomics activities are well represented within Europe and ELIXIR nodes. Metabolite identification is the area that the community believes will have maximal impact of computational metabolomics and metabolomics data management and will benefit most from interactions with the existing five ELIXIR platforms and where progress will contribute most to other ELIXIR communities. The progress through this integrative Implementation Study will benefit industry and academia alike as metabolite identification is one of the major bottlenecks in metabolomics and resolving this challenge requires a community effort. |
ELIXIR Netherlands, EMBL-EBI, ELIXIR France, ELIXIR UK, ELIXIR Germany, ELIXIR Spain, ELIXIR Sweden, ELIXIR Italy, ELIXIR Estonia, ELIXIR Switzerland, ELIXIR Belgium |
Metabolite Identification
|
Metabolomics aims to provide novel insights into the biochemical reactions of organisms by characterising the presence and concentrations of low molecular weight compounds from biological samples. The primary analytical tools for such high-throughput data collection are mass spectrometry (MS), often preceded by chromatographic or electrophoretic separation technologies, and nuclear magnetic resonance spectroscopy (NMR). These technologies produce relatively large and complex data sets that require bioinformaticians, cheminformaticians, biostatisticians, data scientists and computer scientists. Together they develop and apply a wide range of algorithms, software tools, repositories and computational resources to process, analyse, report and store the data and metadata. Increasingly, insights from genomics, epigenomics, transcriptomics, proteomics/protein interactomics and metabolomics are combined, to gain insights into the dynamics of biological processes. Metabolomics activities are well represented within Europe and ELIXIR nodes. Metabolite identification is the area that the community believes will have maximal impact of computational metabolomics and metabolomics data management and will benefit most from interactions with the existing five ELIXIR platforms and where progress will contribute most to other ELIXIR communities. The progress through this integrative Implementation Study will benefit industry and academia alike as metabolite identification is one of the major bottlenecks in metabolomics and resolving this challenge requires a community effort. |
ELIXIR Netherlands, EMBL-EBI, ELIXIR France, ELIXIR UK, ELIXIR Germany, ELIXIR Spain, ELIXIR Sweden, ELIXIR Italy, ELIXIR Estonia, ELIXIR Switzerland, ELIXIR Belgium |
Mining the proteome: Enabling automated processing and analysis of large-scale proteomics data
|
The project developed robust and automated open analysis pipelines for MS/MS proteomics data (based on the OpenMS framework, including new quality control features) that can be deployed in a cloud environment and reused openly by the scientific community in the future. A feature of this project was the building of a Proteomics Community, bringing people together for a face-to-face meeting (March 2017) and a hackathon (Jan 2018). This is rapidly developing field with limited or non-existent standards in the technical platforms leading to a dgeree of instability (See F1000R paper). Details as to how these concerns might be addressed are set out in the end report. The outcome will be that an increased range of open proteomics tools will be included in an extended range of cloud infrastructures. Proteomics data from the PRIDE Archive database was used in a pilot project to demonstrate the usefulness of the resource. The result being an increased facility for proteomics analysis using pipelines deployed across multiple cloud platforms. Other Implementation Studies:
Webinar summarising the outcome(Rec. April 2018); see also the slides . |
EMBL-EBI, ELIXIR Germany |
Mining the proteome: Enabling automated processing and analysis of large-scale proteomics data
|
The project developed robust and automated open analysis pipelines for MS/MS proteomics data (based on the OpenMS framework, including new quality control features) that can be deployed in a cloud environment and reused openly by the scientific community in the future. A feature of this project was the building of a Proteomics Community, bringing people together for a face-to-face meeting (March 2017) and a hackathon (Jan 2018). This is rapidly developing field with limited or non-existent standards in the technical platforms leading to a dgeree of instability (See F1000R paper). Details as to how these concerns might be addressed are set out in the end report. The outcome will be that an increased range of open proteomics tools will be included in an extended range of cloud infrastructures. Proteomics data from the PRIDE Archive database was used in a pilot project to demonstrate the usefulness of the resource. The result being an increased facility for proteomics analysis using pipelines deployed across multiple cloud platforms. Other Implementation Studies:
Webinar summarising the outcome(Rec. April 2018); see also the slides . |
EMBL-EBI, ELIXIR Germany |
Scalable Curation
|
In the future, the research literature will be increasingly open access, with new communication mechanisms such as preprints requiring versions management and new peer review mechanisms. Managing full text article corpora for text mining will be much more challenging than managing just abstracts, and it is unlikely that each and every text mining group will want to invest the necessary time and effort when there are public resources already available. Bringing the compute to the data is commonplace in most informatics workflows, and there is no reason why text mining operations will be different in the long term. The process of curation, performed by expert biologists, is the life-blood of knowledgebases. Curators need to identify key papers, read the full text of the articles to weigh up the evidence, then extract the most pertinent information. A growing corpus of open access full text articles provides new opportunities to enhance article triage and browsing systems. At the same time, many text mining workflows are mature enough to support curation activities. This group of tasks aims to build community and infrastructure based on the open full-text research literature. By providing a platform for doing text mining and sharing the outputs, developing standards, and then combining the semantic enrichment with rich article metadata and software tools, we expect to provide scalable support for curation across multiple knowledgebases. Aim: Maximise support for human curation. This group will develop the infrastructure around full text article resources to support curator workflows. This will be done by semantically enriching research articles and exploring the development of article triage systems as infrastructure. For example, daily text mining of biological concepts from full text research articles and sharing the annotations for use in search, triage, and crosslinking. The opportunities and role for community curation will also be explored. |
EMBL-EBI, ELIXIR Switzerland, ELIXIR Italy, ELIXIR Norway, ELIXIR Portugal, ELIXIR Czech Republic, ELIXIR Luxembourg, ELIXIR France, ELIXIR UK, ELIXIR Spain, ELIXIR Sweden, ELIXIR Germany |
Scalable Curation
|
In the future, the research literature will be increasingly open access, with new communication mechanisms such as preprints requiring versions management and new peer review mechanisms. Managing full text article corpora for text mining will be much more challenging than managing just abstracts, and it is unlikely that each and every text mining group will want to invest the necessary time and effort when there are public resources already available. Bringing the compute to the data is commonplace in most informatics workflows, and there is no reason why text mining operations will be different in the long term. The process of curation, performed by expert biologists, is the life-blood of knowledgebases. Curators need to identify key papers, read the full text of the articles to weigh up the evidence, then extract the most pertinent information. A growing corpus of open access full text articles provides new opportunities to enhance article triage and browsing systems. At the same time, many text mining workflows are mature enough to support curation activities. This group of tasks aims to build community and infrastructure based on the open full-text research literature. By providing a platform for doing text mining and sharing the outputs, developing standards, and then combining the semantic enrichment with rich article metadata and software tools, we expect to provide scalable support for curation across multiple knowledgebases. Aim: Maximise support for human curation. This group will develop the infrastructure around full text article resources to support curator workflows. This will be done by semantically enriching research articles and exploring the development of article triage systems as infrastructure. For example, daily text mining of biological concepts from full text research articles and sharing the annotations for use in search, triage, and crosslinking. The opportunities and role for community curation will also be explored. |
EMBL-EBI, ELIXIR Switzerland, ELIXIR Italy, ELIXIR Norway, ELIXIR Portugal, ELIXIR Czech Republic, ELIXIR Luxembourg, ELIXIR France, ELIXIR UK, ELIXIR Spain, ELIXIR Sweden, ELIXIR Germany |
Scalable Curation
|
In the future, the research literature will be increasingly open access, with new communication mechanisms such as preprints requiring versions management and new peer review mechanisms. Managing full text article corpora for text mining will be much more challenging than managing just abstracts, and it is unlikely that each and every text mining group will want to invest the necessary time and effort when there are public resources already available. Bringing the compute to the data is commonplace in most informatics workflows, and there is no reason why text mining operations will be different in the long term. The process of curation, performed by expert biologists, is the life-blood of knowledgebases. Curators need to identify key papers, read the full text of the articles to weigh up the evidence, then extract the most pertinent information. A growing corpus of open access full text articles provides new opportunities to enhance article triage and browsing systems. At the same time, many text mining workflows are mature enough to support curation activities. This group of tasks aims to build community and infrastructure based on the open full-text research literature. By providing a platform for doing text mining and sharing the outputs, developing standards, and then combining the semantic enrichment with rich article metadata and software tools, we expect to provide scalable support for curation across multiple knowledgebases. Aim: Maximise support for human curation. This group will develop the infrastructure around full text article resources to support curator workflows. This will be done by semantically enriching research articles and exploring the development of article triage systems as infrastructure. For example, daily text mining of biological concepts from full text research articles and sharing the annotations for use in search, triage, and crosslinking. The opportunities and role for community curation will also be explored. |
EMBL-EBI, ELIXIR Switzerland, ELIXIR Italy, ELIXIR Norway, ELIXIR Portugal, ELIXIR Czech Republic, ELIXIR Luxembourg, ELIXIR France, ELIXIR UK, ELIXIR Spain, ELIXIR Sweden, ELIXIR Germany |
Scalable Curation
|
In the future, the research literature will be increasingly open access, with new communication mechanisms such as preprints requiring versions management and new peer review mechanisms. Managing full text article corpora for text mining will be much more challenging than managing just abstracts, and it is unlikely that each and every text mining group will want to invest the necessary time and effort when there are public resources already available. Bringing the compute to the data is commonplace in most informatics workflows, and there is no reason why text mining operations will be different in the long term. The process of curation, performed by expert biologists, is the life-blood of knowledgebases. Curators need to identify key papers, read the full text of the articles to weigh up the evidence, then extract the most pertinent information. A growing corpus of open access full text articles provides new opportunities to enhance article triage and browsing systems. At the same time, many text mining workflows are mature enough to support curation activities. This group of tasks aims to build community and infrastructure based on the open full-text research literature. By providing a platform for doing text mining and sharing the outputs, developing standards, and then combining the semantic enrichment with rich article metadata and software tools, we expect to provide scalable support for curation across multiple knowledgebases. Aim: Maximise support for human curation. This group will develop the infrastructure around full text article resources to support curator workflows. This will be done by semantically enriching research articles and exploring the development of article triage systems as infrastructure. For example, daily text mining of biological concepts from full text research articles and sharing the annotations for use in search, triage, and crosslinking. The opportunities and role for community curation will also be explored. |
EMBL-EBI, ELIXIR Switzerland, ELIXIR Italy, ELIXIR Norway, ELIXIR Portugal, ELIXIR Czech Republic, ELIXIR Luxembourg, ELIXIR France, ELIXIR UK, ELIXIR Spain, ELIXIR Sweden, ELIXIR Germany |
Scalable Curation
|
In the future, the research literature will be increasingly open access, with new communication mechanisms such as preprints requiring versions management and new peer review mechanisms. Managing full text article corpora for text mining will be much more challenging than managing just abstracts, and it is unlikely that each and every text mining group will want to invest the necessary time and effort when there are public resources already available. Bringing the compute to the data is commonplace in most informatics workflows, and there is no reason why text mining operations will be different in the long term. The process of curation, performed by expert biologists, is the life-blood of knowledgebases. Curators need to identify key papers, read the full text of the articles to weigh up the evidence, then extract the most pertinent information. A growing corpus of open access full text articles provides new opportunities to enhance article triage and browsing systems. At the same time, many text mining workflows are mature enough to support curation activities. This group of tasks aims to build community and infrastructure based on the open full-text research literature. By providing a platform for doing text mining and sharing the outputs, developing standards, and then combining the semantic enrichment with rich article metadata and software tools, we expect to provide scalable support for curation across multiple knowledgebases. Aim: Maximise support for human curation. This group will develop the infrastructure around full text article resources to support curator workflows. This will be done by semantically enriching research articles and exploring the development of article triage systems as infrastructure. For example, daily text mining of biological concepts from full text research articles and sharing the annotations for use in search, triage, and crosslinking. The opportunities and role for community curation will also be explored. |
EMBL-EBI, ELIXIR Switzerland, ELIXIR Italy, ELIXIR Norway, ELIXIR Portugal, ELIXIR Czech Republic, ELIXIR Luxembourg, ELIXIR France, ELIXIR UK, ELIXIR Spain, ELIXIR Sweden, ELIXIR Germany |
Scalable Curation
|
In the future, the research literature will be increasingly open access, with new communication mechanisms such as preprints requiring versions management and new peer review mechanisms. Managing full text article corpora for text mining will be much more challenging than managing just abstracts, and it is unlikely that each and every text mining group will want to invest the necessary time and effort when there are public resources already available. Bringing the compute to the data is commonplace in most informatics workflows, and there is no reason why text mining operations will be different in the long term. The process of curation, performed by expert biologists, is the life-blood of knowledgebases. Curators need to identify key papers, read the full text of the articles to weigh up the evidence, then extract the most pertinent information. A growing corpus of open access full text articles provides new opportunities to enhance article triage and browsing systems. At the same time, many text mining workflows are mature enough to support curation activities. This group of tasks aims to build community and infrastructure based on the open full-text research literature. By providing a platform for doing text mining and sharing the outputs, developing standards, and then combining the semantic enrichment with rich article metadata and software tools, we expect to provide scalable support for curation across multiple knowledgebases. Aim: Maximise support for human curation. This group will develop the infrastructure around full text article resources to support curator workflows. This will be done by semantically enriching research articles and exploring the development of article triage systems as infrastructure. For example, daily text mining of biological concepts from full text research articles and sharing the annotations for use in search, triage, and crosslinking. The opportunities and role for community curation will also be explored. |
EMBL-EBI, ELIXIR Switzerland, ELIXIR Italy, ELIXIR Norway, ELIXIR Portugal, ELIXIR Czech Republic, ELIXIR Luxembourg, ELIXIR France, ELIXIR UK, ELIXIR Spain, ELIXIR Sweden, ELIXIR Germany |
Scalable Curation
|
In the future, the research literature will be increasingly open access, with new communication mechanisms such as preprints requiring versions management and new peer review mechanisms. Managing full text article corpora for text mining will be much more challenging than managing just abstracts, and it is unlikely that each and every text mining group will want to invest the necessary time and effort when there are public resources already available. Bringing the compute to the data is commonplace in most informatics workflows, and there is no reason why text mining operations will be different in the long term. The process of curation, performed by expert biologists, is the life-blood of knowledgebases. Curators need to identify key papers, read the full text of the articles to weigh up the evidence, then extract the most pertinent information. A growing corpus of open access full text articles provides new opportunities to enhance article triage and browsing systems. At the same time, many text mining workflows are mature enough to support curation activities. This group of tasks aims to build community and infrastructure based on the open full-text research literature. By providing a platform for doing text mining and sharing the outputs, developing standards, and then combining the semantic enrichment with rich article metadata and software tools, we expect to provide scalable support for curation across multiple knowledgebases. Aim: Maximise support for human curation. This group will develop the infrastructure around full text article resources to support curator workflows. This will be done by semantically enriching research articles and exploring the development of article triage systems as infrastructure. For example, daily text mining of biological concepts from full text research articles and sharing the annotations for use in search, triage, and crosslinking. The opportunities and role for community curation will also be explored. |
EMBL-EBI, ELIXIR Switzerland, ELIXIR Italy, ELIXIR Norway, ELIXIR Portugal, ELIXIR Czech Republic, ELIXIR Luxembourg, ELIXIR France, ELIXIR UK, ELIXIR Spain, ELIXIR Sweden, ELIXIR Germany |
Scalable Curation
|
In the future, the research literature will be increasingly open access, with new communication mechanisms such as preprints requiring versions management and new peer review mechanisms. Managing full text article corpora for text mining will be much more challenging than managing just abstracts, and it is unlikely that each and every text mining group will want to invest the necessary time and effort when there are public resources already available. Bringing the compute to the data is commonplace in most informatics workflows, and there is no reason why text mining operations will be different in the long term. The process of curation, performed by expert biologists, is the life-blood of knowledgebases. Curators need to identify key papers, read the full text of the articles to weigh up the evidence, then extract the most pertinent information. A growing corpus of open access full text articles provides new opportunities to enhance article triage and browsing systems. At the same time, many text mining workflows are mature enough to support curation activities. This group of tasks aims to build community and infrastructure based on the open full-text research literature. By providing a platform for doing text mining and sharing the outputs, developing standards, and then combining the semantic enrichment with rich article metadata and software tools, we expect to provide scalable support for curation across multiple knowledgebases. Aim: Maximise support for human curation. This group will develop the infrastructure around full text article resources to support curator workflows. This will be done by semantically enriching research articles and exploring the development of article triage systems as infrastructure. For example, daily text mining of biological concepts from full text research articles and sharing the annotations for use in search, triage, and crosslinking. The opportunities and role for community curation will also be explored. |
EMBL-EBI, ELIXIR Switzerland, ELIXIR Italy, ELIXIR Norway, ELIXIR Portugal, ELIXIR Czech Republic, ELIXIR Luxembourg, ELIXIR France, ELIXIR UK, ELIXIR Spain, ELIXIR Sweden, ELIXIR Germany |
Scalable Curation
|
In the future, the research literature will be increasingly open access, with new communication mechanisms such as preprints requiring versions management and new peer review mechanisms. Managing full text article corpora for text mining will be much more challenging than managing just abstracts, and it is unlikely that each and every text mining group will want to invest the necessary time and effort when there are public resources already available. Bringing the compute to the data is commonplace in most informatics workflows, and there is no reason why text mining operations will be different in the long term. The process of curation, performed by expert biologists, is the life-blood of knowledgebases. Curators need to identify key papers, read the full text of the articles to weigh up the evidence, then extract the most pertinent information. A growing corpus of open access full text articles provides new opportunities to enhance article triage and browsing systems. At the same time, many text mining workflows are mature enough to support curation activities. This group of tasks aims to build community and infrastructure based on the open full-text research literature. By providing a platform for doing text mining and sharing the outputs, developing standards, and then combining the semantic enrichment with rich article metadata and software tools, we expect to provide scalable support for curation across multiple knowledgebases. Aim: Maximise support for human curation. This group will develop the infrastructure around full text article resources to support curator workflows. This will be done by semantically enriching research articles and exploring the development of article triage systems as infrastructure. For example, daily text mining of biological concepts from full text research articles and sharing the annotations for use in search, triage, and crosslinking. The opportunities and role for community curation will also be explored. |
EMBL-EBI, ELIXIR Switzerland, ELIXIR Italy, ELIXIR Norway, ELIXIR Portugal, ELIXIR Czech Republic, ELIXIR Luxembourg, ELIXIR France, ELIXIR UK, ELIXIR Spain, ELIXIR Sweden, ELIXIR Germany |
Scalable Curation
|
In the future, the research literature will be increasingly open access, with new communication mechanisms such as preprints requiring versions management and new peer review mechanisms. Managing full text article corpora for text mining will be much more challenging than managing just abstracts, and it is unlikely that each and every text mining group will want to invest the necessary time and effort when there are public resources already available. Bringing the compute to the data is commonplace in most informatics workflows, and there is no reason why text mining operations will be different in the long term. The process of curation, performed by expert biologists, is the life-blood of knowledgebases. Curators need to identify key papers, read the full text of the articles to weigh up the evidence, then extract the most pertinent information. A growing corpus of open access full text articles provides new opportunities to enhance article triage and browsing systems. At the same time, many text mining workflows are mature enough to support curation activities. This group of tasks aims to build community and infrastructure based on the open full-text research literature. By providing a platform for doing text mining and sharing the outputs, developing standards, and then combining the semantic enrichment with rich article metadata and software tools, we expect to provide scalable support for curation across multiple knowledgebases. Aim: Maximise support for human curation. This group will develop the infrastructure around full text article resources to support curator workflows. This will be done by semantically enriching research articles and exploring the development of article triage systems as infrastructure. For example, daily text mining of biological concepts from full text research articles and sharing the annotations for use in search, triage, and crosslinking. The opportunities and role for community curation will also be explored. |
EMBL-EBI, ELIXIR Switzerland, ELIXIR Italy, ELIXIR Norway, ELIXIR Portugal, ELIXIR Czech Republic, ELIXIR Luxembourg, ELIXIR France, ELIXIR UK, ELIXIR Spain, ELIXIR Sweden, ELIXIR Germany |
Scalable Curation
|
In the future, the research literature will be increasingly open access, with new communication mechanisms such as preprints requiring versions management and new peer review mechanisms. Managing full text article corpora for text mining will be much more challenging than managing just abstracts, and it is unlikely that each and every text mining group will want to invest the necessary time and effort when there are public resources already available. Bringing the compute to the data is commonplace in most informatics workflows, and there is no reason why text mining operations will be different in the long term. The process of curation, performed by expert biologists, is the life-blood of knowledgebases. Curators need to identify key papers, read the full text of the articles to weigh up the evidence, then extract the most pertinent information. A growing corpus of open access full text articles provides new opportunities to enhance article triage and browsing systems. At the same time, many text mining workflows are mature enough to support curation activities. This group of tasks aims to build community and infrastructure based on the open full-text research literature. By providing a platform for doing text mining and sharing the outputs, developing standards, and then combining the semantic enrichment with rich article metadata and software tools, we expect to provide scalable support for curation across multiple knowledgebases. Aim: Maximise support for human curation. This group will develop the infrastructure around full text article resources to support curator workflows. This will be done by semantically enriching research articles and exploring the development of article triage systems as infrastructure. For example, daily text mining of biological concepts from full text research articles and sharing the annotations for use in search, triage, and crosslinking. The opportunities and role for community curation will also be explored. |
EMBL-EBI, ELIXIR Switzerland, ELIXIR Italy, ELIXIR Norway, ELIXIR Portugal, ELIXIR Czech Republic, ELIXIR Luxembourg, ELIXIR France, ELIXIR UK, ELIXIR Spain, ELIXIR Sweden, ELIXIR Germany |
Scalable Curation
|
In the future, the research literature will be increasingly open access, with new communication mechanisms such as preprints requiring versions management and new peer review mechanisms. Managing full text article corpora for text mining will be much more challenging than managing just abstracts, and it is unlikely that each and every text mining group will want to invest the necessary time and effort when there are public resources already available. Bringing the compute to the data is commonplace in most informatics workflows, and there is no reason why text mining operations will be different in the long term. The process of curation, performed by expert biologists, is the life-blood of knowledgebases. Curators need to identify key papers, read the full text of the articles to weigh up the evidence, then extract the most pertinent information. A growing corpus of open access full text articles provides new opportunities to enhance article triage and browsing systems. At the same time, many text mining workflows are mature enough to support curation activities. This group of tasks aims to build community and infrastructure based on the open full-text research literature. By providing a platform for doing text mining and sharing the outputs, developing standards, and then combining the semantic enrichment with rich article metadata and software tools, we expect to provide scalable support for curation across multiple knowledgebases. Aim: Maximise support for human curation. This group will develop the infrastructure around full text article resources to support curator workflows. This will be done by semantically enriching research articles and exploring the development of article triage systems as infrastructure. For example, daily text mining of biological concepts from full text research articles and sharing the annotations for use in search, triage, and crosslinking. The opportunities and role for community curation will also be explored. |
EMBL-EBI, ELIXIR Switzerland, ELIXIR Italy, ELIXIR Norway, ELIXIR Portugal, ELIXIR Czech Republic, ELIXIR Luxembourg, ELIXIR France, ELIXIR UK, ELIXIR Spain, ELIXIR Sweden, ELIXIR Germany |
Scalable Curation (2022-23)
|
Most data and literature curation processes are initiated via some entity-centric query (e.g. gene or gene products, a disease, a chemical compound). However, most databases are also interested in accessing and curating contents using other types of modalities: some biological phenomena (e.g. Intrinsically Disordered Proteins) or some domain-specific aspects of biology (e.g. lipidomics, glycomics, rare diseases). These are not easily expressed via a combination of Further, literature curation will expand beyond abstracts to include full-text, supplementary data and pre-prints (in several versions). This will be the focus of Task.3 in 22-23.
|
ELIXIR Switzerland, EMBL-EBI, ELIXIR France, ELIXIR Italy, ELIXIR Netherlands, ELIXIR Portugal, ELIXIR Belgium , ELIXIR Norway |
Scalable Curation (2022-23)
|
Most data and literature curation processes are initiated via some entity-centric query (e.g. gene or gene products, a disease, a chemical compound). However, most databases are also interested in accessing and curating contents using other types of modalities: some biological phenomena (e.g. Intrinsically Disordered Proteins) or some domain-specific aspects of biology (e.g. lipidomics, glycomics, rare diseases). These are not easily expressed via a combination of Further, literature curation will expand beyond abstracts to include full-text, supplementary data and pre-prints (in several versions). This will be the focus of Task.3 in 22-23.
|
ELIXIR Switzerland, EMBL-EBI, ELIXIR France, ELIXIR Italy, ELIXIR Netherlands, ELIXIR Portugal, ELIXIR Belgium , ELIXIR Norway |
Scalable Curation (2022-23)
|
Most data and literature curation processes are initiated via some entity-centric query (e.g. gene or gene products, a disease, a chemical compound). However, most databases are also interested in accessing and curating contents using other types of modalities: some biological phenomena (e.g. Intrinsically Disordered Proteins) or some domain-specific aspects of biology (e.g. lipidomics, glycomics, rare diseases). These are not easily expressed via a combination of Further, literature curation will expand beyond abstracts to include full-text, supplementary data and pre-prints (in several versions). This will be the focus of Task.3 in 22-23.
|
ELIXIR Switzerland, EMBL-EBI, ELIXIR France, ELIXIR Italy, ELIXIR Netherlands, ELIXIR Portugal, ELIXIR Belgium , ELIXIR Norway |
Scalable Curation (2022-23)
|
Most data and literature curation processes are initiated via some entity-centric query (e.g. gene or gene products, a disease, a chemical compound). However, most databases are also interested in accessing and curating contents using other types of modalities: some biological phenomena (e.g. Intrinsically Disordered Proteins) or some domain-specific aspects of biology (e.g. lipidomics, glycomics, rare diseases). These are not easily expressed via a combination of Further, literature curation will expand beyond abstracts to include full-text, supplementary data and pre-prints (in several versions). This will be the focus of Task.3 in 22-23.
|
ELIXIR Switzerland, EMBL-EBI, ELIXIR France, ELIXIR Italy, ELIXIR Netherlands, ELIXIR Portugal, ELIXIR Belgium , ELIXIR Norway |
Scalable Curation (2022-23)
|
Most data and literature curation processes are initiated via some entity-centric query (e.g. gene or gene products, a disease, a chemical compound). However, most databases are also interested in accessing and curating contents using other types of modalities: some biological phenomena (e.g. Intrinsically Disordered Proteins) or some domain-specific aspects of biology (e.g. lipidomics, glycomics, rare diseases). These are not easily expressed via a combination of Further, literature curation will expand beyond abstracts to include full-text, supplementary data and pre-prints (in several versions). This will be the focus of Task.3 in 22-23.
|
ELIXIR Switzerland, EMBL-EBI, ELIXIR France, ELIXIR Italy, ELIXIR Netherlands, ELIXIR Portugal, ELIXIR Belgium , ELIXIR Norway |
Scalable Curation (2022-23)
|
Most data and literature curation processes are initiated via some entity-centric query (e.g. gene or gene products, a disease, a chemical compound). However, most databases are also interested in accessing and curating contents using other types of modalities: some biological phenomena (e.g. Intrinsically Disordered Proteins) or some domain-specific aspects of biology (e.g. lipidomics, glycomics, rare diseases). These are not easily expressed via a combination of Further, literature curation will expand beyond abstracts to include full-text, supplementary data and pre-prints (in several versions). This will be the focus of Task.3 in 22-23.
|
ELIXIR Switzerland, EMBL-EBI, ELIXIR France, ELIXIR Italy, ELIXIR Netherlands, ELIXIR Portugal, ELIXIR Belgium , ELIXIR Norway |
Scalable Curation (2022-23)
|
Most data and literature curation processes are initiated via some entity-centric query (e.g. gene or gene products, a disease, a chemical compound). However, most databases are also interested in accessing and curating contents using other types of modalities: some biological phenomena (e.g. Intrinsically Disordered Proteins) or some domain-specific aspects of biology (e.g. lipidomics, glycomics, rare diseases). These are not easily expressed via a combination of Further, literature curation will expand beyond abstracts to include full-text, supplementary data and pre-prints (in several versions). This will be the focus of Task.3 in 22-23.
|
ELIXIR Switzerland, EMBL-EBI, ELIXIR France, ELIXIR Italy, ELIXIR Netherlands, ELIXIR Portugal, ELIXIR Belgium , ELIXIR Norway |
Scalable Curation (2022-23)
|
Most data and literature curation processes are initiated via some entity-centric query (e.g. gene or gene products, a disease, a chemical compound). However, most databases are also interested in accessing and curating contents using other types of modalities: some biological phenomena (e.g. Intrinsically Disordered Proteins) or some domain-specific aspects of biology (e.g. lipidomics, glycomics, rare diseases). These are not easily expressed via a combination of Further, literature curation will expand beyond abstracts to include full-text, supplementary data and pre-prints (in several versions). This will be the focus of Task.3 in 22-23.
|
ELIXIR Switzerland, EMBL-EBI, ELIXIR France, ELIXIR Italy, ELIXIR Netherlands, ELIXIR Portugal, ELIXIR Belgium , ELIXIR Norway |
Scalable Curation of TF-TG interactions via Integration of NLP and VSM (2022-23)
|
Documented regulatory interactions between Transcription Factors (TFs) and target genes (TGs) form a crucial resource for biological network building. The ExTRI text mining project yielded about 54,000 sentences from literature that proposedly describe a TF-TG interaction, linked to database identifiers. This corpus is now a compelling target for a community curation effort, which will be performed by dedicated curators in this project, and modelers potentially contributing via Cytoscape. To make this large-scale effort possible, we will combine the intuitive and flexible curation tool VSM (Visual Syntax Method), with automated NLP (Natural Language Processing) assistance, into a community curation app. Premarked ExTRI sentences will be shown in their abstract, along with NLP annotations provided by Europe-PMC's API, to help curators formalize knowledge into a form suitable for computational analysis. The new web app will be designed for reusability, guided by professional curation partners, and evaluated for effect on curator performance. |
ELIXIR Norway, ELIXIR Luxembourg, ELIXIR Switzerland |
Scalable Curation of TF-TG interactions via Integration of NLP and VSM (2022-23)
|
Documented regulatory interactions between Transcription Factors (TFs) and target genes (TGs) form a crucial resource for biological network building. The ExTRI text mining project yielded about 54,000 sentences from literature that proposedly describe a TF-TG interaction, linked to database identifiers. This corpus is now a compelling target for a community curation effort, which will be performed by dedicated curators in this project, and modelers potentially contributing via Cytoscape. To make this large-scale effort possible, we will combine the intuitive and flexible curation tool VSM (Visual Syntax Method), with automated NLP (Natural Language Processing) assistance, into a community curation app. Premarked ExTRI sentences will be shown in their abstract, along with NLP annotations provided by Europe-PMC's API, to help curators formalize knowledge into a form suitable for computational analysis. The new web app will be designed for reusability, guided by professional curation partners, and evaluated for effect on curator performance. |
ELIXIR Norway, ELIXIR Luxembourg, ELIXIR Switzerland |
Scalable Curation of TF-TG interactions via Integration of NLP and VSM (2022-23)
|
Documented regulatory interactions between Transcription Factors (TFs) and target genes (TGs) form a crucial resource for biological network building. The ExTRI text mining project yielded about 54,000 sentences from literature that proposedly describe a TF-TG interaction, linked to database identifiers. This corpus is now a compelling target for a community curation effort, which will be performed by dedicated curators in this project, and modelers potentially contributing via Cytoscape. To make this large-scale effort possible, we will combine the intuitive and flexible curation tool VSM (Visual Syntax Method), with automated NLP (Natural Language Processing) assistance, into a community curation app. Premarked ExTRI sentences will be shown in their abstract, along with NLP annotations provided by Europe-PMC's API, to help curators formalize knowledge into a form suitable for computational analysis. The new web app will be designed for reusability, guided by professional curation partners, and evaluated for effect on curator performance. |
ELIXIR Norway, ELIXIR Luxembourg, ELIXIR Switzerland |
Scalable extraction of human genetic and phenotypic data from peer-reviewed literature (2022-23)
|
Health research is advanced through a deeper understanding of disease aetiology provided by detecting associations between genetic variants and disease traits in population samples. The GWAS Central and DisGeNET data services provide extensive gene/variant-phenotype/disease associations. However, an absence of tools and resources to support the text mining of comprehensive data sources prevents scalable import of association data, which is currently limited to text mining abstracts or requires manual curation. This project will extend and integrate the participants’ existing text mining tools to provide a reusable workflow to extract human genotype-phenotype associations from scientific literature full-texts, tables and supplementary materials. These data will be imported into GWAS Central and DisGeNET, accelerating FAIR access to pioneering findings such as COVID-19 GWAS. The development of an annotated GWAS corpus based on full-text articles will enable the evaluation of existing and future text mining methodologies for extracting genotype-phenotype associations and metadata. |
ELIXIR UK, ELIXIR Spain |
Scalable extraction of human genetic and phenotypic data from peer-reviewed literature (2022-23)
|
Health research is advanced through a deeper understanding of disease aetiology provided by detecting associations between genetic variants and disease traits in population samples. The GWAS Central and DisGeNET data services provide extensive gene/variant-phenotype/disease associations. However, an absence of tools and resources to support the text mining of comprehensive data sources prevents scalable import of association data, which is currently limited to text mining abstracts or requires manual curation. This project will extend and integrate the participants’ existing text mining tools to provide a reusable workflow to extract human genotype-phenotype associations from scientific literature full-texts, tables and supplementary materials. These data will be imported into GWAS Central and DisGeNET, accelerating FAIR access to pioneering findings such as COVID-19 GWAS. The development of an annotated GWAS corpus based on full-text articles will enable the evaluation of existing and future text mining methodologies for extracting genotype-phenotype associations and metadata. |
ELIXIR UK, ELIXIR Spain |
Towards a distributed Ensembl
|
This Implementation Study shaped the development of a distributed model for Ensembl, whereby multiple ELIXIR Nodes use common infrastructure to provide an integrated service to users, each focused on their own areas of interest and expertise. This approach will increase the quality of available data, and simplify data access for users by allowing the participation of many Nodes in a single service. The study is a natural complement to Task 10.3 "Capacity Building in Genome Assembly and Annotation" in the EXCELERATE project. The work led to the development of a global species registry employing non-overlapping identifier spaces which will increased facility for genome analysis and customisation of Ensembl to target organisms of relevance. Following this project, the software is more sustainable through improved structures and documentation. This will benefit Ensembl internally, as well as external parties (ELIXIR Nodes, Life Science researchers) who want to create their own Ensembl instance. This study is now complete, see the end report. Webinar summarising the outcomes |
ELIXIR Norway, EMBL-EBI, ELIXIR Sweden |
Towards a distributed Ensembl
|
This Implementation Study shaped the development of a distributed model for Ensembl, whereby multiple ELIXIR Nodes use common infrastructure to provide an integrated service to users, each focused on their own areas of interest and expertise. This approach will increase the quality of available data, and simplify data access for users by allowing the participation of many Nodes in a single service. The study is a natural complement to Task 10.3 "Capacity Building in Genome Assembly and Annotation" in the EXCELERATE project. The work led to the development of a global species registry employing non-overlapping identifier spaces which will increased facility for genome analysis and customisation of Ensembl to target organisms of relevance. Following this project, the software is more sustainable through improved structures and documentation. This will benefit Ensembl internally, as well as external parties (ELIXIR Nodes, Life Science researchers) who want to create their own Ensembl instance. This study is now complete, see the end report. Webinar summarising the outcomes |
ELIXIR Norway, EMBL-EBI, ELIXIR Sweden |
Towards a distributed Ensembl
|
This Implementation Study shaped the development of a distributed model for Ensembl, whereby multiple ELIXIR Nodes use common infrastructure to provide an integrated service to users, each focused on their own areas of interest and expertise. This approach will increase the quality of available data, and simplify data access for users by allowing the participation of many Nodes in a single service. The study is a natural complement to Task 10.3 "Capacity Building in Genome Assembly and Annotation" in the EXCELERATE project. The work led to the development of a global species registry employing non-overlapping identifier spaces which will increased facility for genome analysis and customisation of Ensembl to target organisms of relevance. Following this project, the software is more sustainable through improved structures and documentation. This will benefit Ensembl internally, as well as external parties (ELIXIR Nodes, Life Science researchers) who want to create their own Ensembl instance. This study is now complete, see the end report. Webinar summarising the outcomes |
ELIXIR Norway, EMBL-EBI, ELIXIR Sweden |
Unification of alternative forms of chemical entities with collaborative effort
|
The ELIXIR Core Data Resource ChEBI is a dictionary of molecular entities and, currently, is able to handle all possible forms of a given chemical structure (E.g. neutral, tautomeric, protonated, isotopic, zwitterionic forms). Each form is assigned its own unique ChEBI identifier, which enables inter-relations within ChEBI via the ontology. Within the biological community, different database resources represent chemical structures in different ways. The ELIXIR resource, Rhea, uses only the physiological pH 7.3 form of a given molecule, whereas Reactome requires both neutral and protonated forms of a given molecule. This inconsistency between resources in the mapping of chemical structures was highlighted in a recent ChEBI user workshop held in May 2019 (EMBL-EBI, UK), where it was agreed that a unification of the methodology, and consistency in the mapping of chemical structures would allow easier mapping across the different ELIXIR resources. Here we propose a set of meetings and working groups between the resources ChEBI (EMBL-EBI) and Rhea (SIB), with the aim of streamlining the link between chemicals (ChEBI), reactions (Rhea) and pathways (Reactome) and to provide guidelines on the representation of chemical entities to the metabolic modelling community. We anticipate that the development of closer interactions and working relationships between core staff from the different resources during the course of this project will enable further streamlining and increased levels of interoperability between these ELIXIR resources. It will lead to a greater degree of understanding of the |
EMBL-EBI, ELIXIR Switzerland |
Unification of alternative forms of chemical entities with collaborative effort
|
The ELIXIR Core Data Resource ChEBI is a dictionary of molecular entities and, currently, is able to handle all possible forms of a given chemical structure (E.g. neutral, tautomeric, protonated, isotopic, zwitterionic forms). Each form is assigned its own unique ChEBI identifier, which enables inter-relations within ChEBI via the ontology. Within the biological community, different database resources represent chemical structures in different ways. The ELIXIR resource, Rhea, uses only the physiological pH 7.3 form of a given molecule, whereas Reactome requires both neutral and protonated forms of a given molecule. This inconsistency between resources in the mapping of chemical structures was highlighted in a recent ChEBI user workshop held in May 2019 (EMBL-EBI, UK), where it was agreed that a unification of the methodology, and consistency in the mapping of chemical structures would allow easier mapping across the different ELIXIR resources. Here we propose a set of meetings and working groups between the resources ChEBI (EMBL-EBI) and Rhea (SIB), with the aim of streamlining the link between chemicals (ChEBI), reactions (Rhea) and pathways (Reactome) and to provide guidelines on the representation of chemical entities to the metabolic modelling community. We anticipate that the development of closer interactions and working relationships between core staff from the different resources during the course of this project will enable further streamlining and increased levels of interoperability between these ELIXIR resources. It will lead to a greater degree of understanding of the |
EMBL-EBI, ELIXIR Switzerland |
Administration and support for Core Data Resource (CDR) and Deposition Database (EDD) portfolio
|
The goal of this project is to manage the Core Data Resource and Deposition Database portfolio.
|
ELIXIR Switzerland, EMBL-EBI |
Administration and support for Core Data Resource (CDR) and Deposition Database (EDD) portfolio
|
The goal of this project is to manage the Core Data Resource and Deposition Database portfolio.
|
ELIXIR Switzerland, EMBL-EBI |