Services: data resources

Name of service Tag Related links* Key Collection
GlobalFungi

User interface to data from high-throughput sequencing studies of fungal communities across terrestrial biomes. Includes sequencing data, sample locations, sample metadata.

Glyco@Expasy

Glyco@Expasy centralizes web-based glycoinformatics resources developed within an international network of glycoscientists. The philosophy is that it should be {glycoscientist AND protein scientist}–friendly with the aim of (1) popularizing the use of bioinformatics in glycobiology and (2) emphasizing the relationship between glycobiology and protein-oriented bioinformatics resources. Glyco@Expasy was designed with glycoscientists to meet the growing needs of the community for glycoinformatics.

Glyconnect

GlyConnect is a platform integrating several sources of information to

characterise the molecular actors of glycosylation, mainly glycoproteins and N- and O-linked glycans. The purpose of GlyConnect is to bring out in a single resource the relationships between glycans, the proteins that carry them, the enzymes that synthesise or degrade them and the proteins that bind them.

GnpIS

GnpIS is an interoperable Information System for plant and pest genomics. It is a powerful multispecies centralized information system with seven linked relational databases. 

GO annotation (GOA)

The UniProt  GO annotation program (GOA) aims to provide high-quality Gene Ontology (GO) annotations to proteins in the UniProt Knowledgebase (UniProtKB).

GWAS Catalog

The NHGRI-EBI GWAS Catalog is a quality-controlled, manually curated, literature-derived collection of all published genome-wide association studies.
 

GWAS Central

Providing a centralized compilation of summary level findings from genetic association studies, both large and small

HERVd

Human Endogenous RetroViruses Database.

HLA Ligand Atlas

The HLA Ligand Atlas is a comprehensive collection of tissue and HLA allele specific HLA ligands that are naturally presented. The data was generated in standardized mass spectrometry experiments and analyzed using our reproducible computational analysis workflow. (Data analysed with the OpenMS MHCquant workflow) 

HmtDB

Database of human mitochondrial genomes from primary INSDC databases, personal submissions and application of MtoolBox to NGS data.

Human Protein Atlas (HPA)

Database with millions of high-resolution images.

CDD
Hungarian Cancer Registry

A population-based database that collects all the cases with malignancies in Hungary by reports of oncology care hospitals. Registration is mandatory. The Registry is regulated by the order of the Hungarian Government and maintained by the  National Institute of Oncology.

IDSM

Integrated Database of Small Molecules.
Aggregating many different sources of information about small molecules into a single, logically coherent and semantically interconnected information source.

iHUB

ionomicHUB provides curated ionomic data on many thousands of plant samples freely available to the public. It is an international collaborative cyber research environment to help identify and understand the genes and gene networks that function to control the ionome, the mineral nutrient and trace element composition of an organism or tissue. 

IMGT

An integrated knowledge resource specialized in the immunoglobulins (IG) or antibodies, T cell receptors (TR), and major histocompatibility (MH) of human and other vertebrate species. 

IntAct

IntAct provides a freely available, open source database system and analysis tools for molecular interaction data.

IntEnz

IntEnz is a database of enzyme nomenclature that also provides enzyme classifications based on the nature of catalysed reactions. IntEnz is produced in collaboration with the SIB Swiss Institute of Bioinformatics.

InterPro

InterPro classifies proteins into families and predicts the presence of important domains and sites.

CDD
iPtgxDBs

Integrated Proteogenomics DataBases an open source database that provides integrated annotations, predictions and a six-frame translation for one respective genome sequence in an easily usable format, both as a search DB (FASTA format) with informative identifiers and a GFF file that integrates all annotations and identifiers.

iRefIndex

Consolidated resource of public binary protein-protein interaction data assembled from more than 15 individual binary protein-protein interactions datasets

Italian COVID-19 Data Portal

The Italian COVID-19 Data Portal provides information, guidelines, tools and services to support researchers in creating and sharing research data on COVID-19.

ITSoneDB

ITSoneDB is a comprehensive collection of eukaryotic ribosomal RNA Internal Transcribed Spacer 1 (ITS1) sequences.

ITSoneWB

A Galaxy-based workbench providing established tools (e.g BioMaS, Mothur and qiime2) targeted at global taxonomic analysis of eukaryotic communities based on Internal Transcribed Spacer 1 variants high-throughput sequencing.

IUPHAR/BPS Guide to PHARMACOLOGY

A curated database of drug targets, prescription medicines and experimental drugs.

JASPAR

An open access database of manually curated, non-redundant transcription factor (TF) binding profiles

LiceBase

Species focused genome data resource for sea lice, provides a genome browser, access to high-throughput data and LIMS.

LIPID Maps

Provides access to lipid nomenclature, databases, tools, protocols, standards, tutorials, meetings, publications, and other resources and serving the international lipid research community.

Marine Metagenomics Portal (MMP)

Freely accessible microbial genomics and metagenomics reference databases that include MarRef, MarDB, and MarCat. The MMP also includes a search engine that you can use to query all three databases at once.

MATRIXDB

MatrixDB is a database focused on molecular interactions of extracellular matrix proteins and glycosaminoglycans. The advanced query interface allows users to browse MatrixDB data, to build and to visualize specific interaction networks based on transcriptomic or quantitative proteomic data, diseases, biological processes or molecular functions. MatrixDB hosts the GAG builder tool to build on line 3D models of GAG sequences binding to proteins. 

Mentha

A comprehensive collection of data from manually curated protein-protein interaction databases adhering to the IMEx consortium.