Genomic databases

A database is a structured collection of records stored in a computer system. In genomics and bioinformatics, there are a few key kinds of databases that house genomic data. A "sequence" database is a collection of DNA and/or protein sequences. An "annotation" database is a collection of information about particular sequences. Some databases include sequence information AND annotation information. How do you know which databases to access to get the information that you want? The answer, of course, is that it all depends on the question you are asking! It takes a lot of practice to be able to guess which databases will be good ones to try and use. On this page is a collection of databases that you will DEFINITELY be using for your research.

ncbi logo : A set of databases of sequences for everything under the sun

JGI logo : A database of the genome sequence and annotations for Nematostella

AiptasiaBase logo : A database of the partial transcriptome sequence and annotations for Aiptasia

pfam logo : A database of evolutionarily conserved protein families, and annotations about the functions of those families