Services: data resources

Name of service		Tag	Related links*	Key Collection
GlobalFungi	User interface to data from high-throughput sequencing studies of fungal communities across terrestrial biomes. Includes sequencing data, sample locations, sample metadata.		bio.tools
Glyco@Expasy	Glyco@Expasy centralizes web-based glycoinformatics resources developed within an international network of glycoscientists. The philosophy is that it should be {glycoscientist AND protein scientist}–friendly with the aim of (1) popularizing the use of bioinformatics in glycobiology and (2) emphasizing the relationship between glycobiology and protein-oriented bioinformatics resources. Glyco@Expasy was designed with glycoscientists to meet the growing needs of the community for glycoinformatics.	Proteins and proteomes	bio.tools
Glyconnect	GlyConnect is a platform integrating several sources of information to characterise the molecular actors of glycosylation, mainly glycoproteins and N- and O-linked glycans. The purpose of GlyConnect is to bring out in a single resource the relationships between glycans, the proteins that carry them, the enzymes that synthesise or degrade them and the proteins that bind them.	Proteins and proteomes	bio.tools
GnpIS	GnpIS is an interoperable Information System for plant and pest genomics. It is a powerful multispecies centralized information system with seven linked relational databases.	Genes and genomes Proteins and proteomes	bio.tools FAIRsharing
GO annotation (GOA)	The UniProt GO annotation program (GOA) aims to provide high-quality Gene Ontology (GO) annotations to proteins in the UniProt Knowledgebase (UniProtKB).	Genes and genomes Proteins and proteomes	bio.tools FAIRsharing
GWAS Catalog	The NHGRI-EBI GWAS Catalog is a quality-controlled, manually curated, literature-derived collection of all published genome-wide association studies.	Genes and genomes Literature	bio.tools FAIRsharing TeSS
GWAS Central	Providing a centralized compilation of summary level findings from genetic association studies, both large and small		bio.tools FAIRsharing
HERVd	Human Endogenous RetroViruses Database.	Genes and genomes	bio.tools FAIRsharing
HLA Ligand Atlas	The HLA Ligand Atlas is a comprehensive collection of tissue and HLA allele specific HLA ligands that are naturally presented. The data was generated in standardized mass spectrometry experiments and analyzed using our reproducible computational analysis workflow. (Data analysed with the OpenMS MHCquant workflow)
HmtDB	Database of human mitochondrial genomes from primary INSDC databases, personal submissions and application of MtoolBox to NGS data.	Evolution and phylogeny Genes and genomes	bio.tools FAIRsharing
Human Protein Atlas (HPA)	Database with millions of high-resolution images.	Molecular and cellular structures Proteins and proteomes	bio.tools FAIRsharing	CDD
Hungarian Cancer Registry	A population-based database that collects all the cases with malignancies in Hungary by reports of oncology care hospitals. Registration is mandatory. The Registry is regulated by the order of the Hungarian Government and maintained by the National Institute of Oncology.	Genes and genomes
IDSM	Integrated Database of Small Molecules. Aggregating many different sources of information about small molecules into a single, logically coherent and semantically interconnected information source.	Chemical biology	bio.tools
iHUB	ionomicHUB provides curated ionomic data on many thousands of plant samples freely available to the public. It is an international collaborative cyber research environment to help identify and understand the genes and gene networks that function to control the ionome, the mineral nutrient and trace element composition of an organism or tissue.	Genes and genomes Proteins and proteomes	FAIRsharing
IMGT	An integrated knowledge resource specialized in the immunoglobulins (IG) or antibodies, T cell receptors (TR), and major histocompatibility (MH) of human and other vertebrate species.	Genes and genomes Proteins and proteomes	bio.tools FAIRsharing
IntAct	IntAct provides a freely available, open source database system and analysis tools for molecular interaction data.	Enzymes, interactions and pathways Proteins and proteomes	bio.tools FAIRsharing TeSS
IntEnz	IntEnz is a database of enzyme nomenclature that also provides enzyme classifications based on the nature of catalysed reactions. IntEnz is produced in collaboration with the SIB Swiss Institute of Bioinformatics.	Enzymes, interactions and pathways Proteins and proteomes	bio.tools FAIRsharing
InterPro	InterPro classifies proteins into families and predicts the presence of important domains and sites.	Molecular and cellular structures Proteins and proteomes	bio.tools FAIRsharing TeSS	CDD
iPtgxDBs	Integrated Proteogenomics DataBases an open source database that provides integrated annotations, predictions and a six-frame translation for one respective genome sequence in an easily usable format, both as a search DB (FASTA format) with informative identifiers and a GFF file that integrates all annotations and identifiers.	Proteins and proteomes
iRefIndex	Consolidated resource of public binary protein-protein interaction data assembled from more than 15 individual binary protein-protein interactions datasets	Proteins and proteomes	bio.tools FAIRsharing
Italian COVID-19 Data Portal	The Italian COVID-19 Data Portal provides information, guidelines, tools and services to support researchers in creating and sharing research data on COVID-19.
ITSoneDB	ITSoneDB is a comprehensive collection of eukaryotic ribosomal RNA Internal Transcribed Spacer 1 (ITS1) sequences.	Genes and genomes	bio.tools
ITSoneWB	A Galaxy-based workbench providing established tools (e.g BioMaS, Mothur and qiime2) targeted at global taxonomic analysis of eukaryotic communities based on Internal Transcribed Spacer 1 variants high-throughput sequencing.		bio.tools
IUPHAR/BPS Guide to PHARMACOLOGY	A curated database of drug targets, prescription medicines and experimental drugs.	Chemical biology	bio.tools FAIRsharing TeSS
JASPAR	An open access database of manually curated, non-redundant transcription factor (TF) binding profiles	Proteins and proteomes	bio.tools
LiceBase	Species focused genome data resource for sea lice, provides a genome browser, access to high-throughput data and LIMS.	Genes and genomes	bio.tools FAIRsharing
LIPID Maps	Provides access to lipid nomenclature, databases, tools, protocols, standards, tutorials, meetings, publications, and other resources and serving the international lipid research community.		bio.tools FAIRsharing
Marine Metagenomics Portal (MMP)	Freely accessible microbial genomics and metagenomics reference databases that include MarRef, MarDB, and MarCat. The MMP also includes a search engine that you can use to query all three databases at once.	Genes and genomes	bio.tools TeSS
MATRIXDB	MatrixDB is a database focused on molecular interactions of extracellular matrix proteins and glycosaminoglycans. The advanced query interface allows users to browse MatrixDB data, to build and to visualize specific interaction networks based on transcriptomic or quantitative proteomic data, diseases, biological processes or molecular functions. MatrixDB hosts the GAG builder tool to build on line 3D models of GAG sequences binding to proteins.	Enzymes, interactions and pathways	FAIRsharing
Mentha	A comprehensive collection of data from manually curated protein-protein interaction databases adhering to the IMEx consortium.	Enzymes, interactions and pathways Proteins and proteomes	bio.tools FAIRsharing