The UniProt Knowledgebase (UniProtKB) is the central hub for the collection of functional information on proteins, with accurate, consistent and rich annotation. In addition to capturing the core data mandatory for each UniProtKB entry (mainly, the amino acid sequence, protein name or description, taxonomic data and citation information), as much annotation information as possible is added. This includes widely accepted biological ontologies, classifications and cross-references, and clear indications of the quality of annotation in the form of evidence attribution of experimental and computational data. The Universal Protein Resource (UniProt) is a comprehensive resource for protein sequence and annotation data. The UniProt databases are the UniProt Knowledgebase (UniProtKB), the UniProt Reference Clusters (UniRef), and the UniProt Archive (UniParc). The UniProt Metagenomic and Environmental Sequences (UniMES) database is a repository specifically developed for metagenomic and environmental data. The UniProt Knowledgebase,is an expertly and richly curated protein database, consisting of two sections called UniProtKB/Swiss-Prot and UniProtKB/TrEMBL.
UniProtKB is covered by Thomson Reuters Data Citation Index. UniProtKB is covered by SCOPUS. Different databases for different uses: We have built four different databases: The UniProt Knowledgebase, and in particular UniProtKB/Swiss-Prot,
is used to access functional information on proteins. Every UniProtKB entry contains the amino acid sequence, protein name or description,taxonomic data and citation information but in addition to this, we add
as much annotation as possible. This includes widely accepted biological ontologies, classifications and cross-references, as well as clear indications on the quality of annotation in the form of evidence attribution to
experimental and computational data. - The UniRef databases provide clustered sets of sequences from
UniProtKB and selected UniParc records to provide complete coverage of sequence space at several resolutions. UniRef90 and UniRef50 yield a database size reduction of approximately 40% and 65%, respectively, providing significantly faster sequence searches. - UniParc is the most comprehensive publicly accessible non-redundant
protein sequence database available, providing links to all underlying
sources and versions of these sequences. You can instantly find out whether a sequence of interest is already in the public domain and, if not, identify its closest relatives. - UniMES is a repository specifically for metagenomic and environmental