This help document describes the mouse, rat, rodent, or human sequence databases you can select from the MouseBLAST query form when searching for sequences that may be associated with mouse genes.
Nucleotide Sequence Databases
- GenBank Mouse - Subset of GenBank containing mouse sequences from rodent, high-throughput cDNAs (HTC), and patent divisions. Mouse sequences are parsed from these divisions by examining the phylogeny lines for entries containing "mus." Does not include ESTs, genome survey sequences (GSS), and high-throughput genomic sequences (HTGS).
- GenBank Mouse associated with genes in MGI - Subset of GenBank containing mouse sequences from all divisions (including EST) associated with MGI mouse genes. Mouse sequences are parsed from these divisions by examining the phylogeny lines for entries containing "mus."
- GenBank GSS Mouse - Subset of GenBank containing mouse genome survey sequences (GSS). All mouse sequences are parsed from the GSS division by examining the phylogeny lines for those entries containing "mus."
- Mouse Gene Traps from GSS in MGI - Subset of Genome Survey Sequences (GSS) of mouse origin that are parsed for the class, Gene Trap, and contact information of the provider.
- GenBank HTG Mouse - Subset of GenBank containing mouse high-throughput genomic sequences (HTG). All mouse sequences are parsed from the GSS division by examining the phylogeny lines for those entries containing "mus."
- GenBank Human - Subset of GenBank containing human sequences from primate, high-throughput cDNAs (HTC), and patent divisions. Human sequences are parsed from these divisions by examining the phylogeny lines for entries containing "sapiens." Does not include ESTs, genome survey sequences (GSS), and high-throughput genomic sequences (HTGS).
- GenBank Rat - Subset of GenBank containing rat sequences from rodent, high-throughput cDNAs (HTC), and patent divisions. Rat sequences are parsed from these divisions by examining the phylogeny lines for entries containing "rattus." Does not include ESTs, genome survey sequences (GSS), and high-throughput genomic sequences (HTGS).
- GenBank Rodent - Subset of GenBank containing rodent sequences from rodent, high-throughput cDNAs (HTC), and patent divisions. Rodent sequences are parsed from these divisions by examining the phylogeny lines for entries containing "rodentia." Does not include ESTs, genome survey sequences (GSS), and high-throughput genomic sequences (HTGS).
- RefSeq Mouse Transcripts - All NCBI RefSeq Mouse transcripts (Mouse RefSeqs of type NM_, NR_, XM_, and XR_).
- NCBI Mouse Genome Assembly Build 37 - NCBI Build 37.1 represents over 90% of the mouse genome in finished form. See NCBI's Mouse Genome Overview for details. Click to see the genome view or the statistics.
- NCBI Mouse Genome Assembly Build 36 - The Build 36 assembly from the Mouse Genome Sequencing Consortium (MGSC) is composed largely of finished sequence and includes approximately 2.6 Gb of sequences on chromosomes 1-19, X, Y, M (mitochondrial DNA) and Un (unmapped clone contigs).
- NCBI Mouse Genome Assembly Build 34 - NCBI Mouse Build 34 represents a fourth generation composite assembly. In this build, chromosomes 1,3,5,6,7,8,9,10, and 12-19 were automatically assembled.
- DFCI Mouse Gene Index - Tentative consensus transcript sequences from DFCI's Mouse Gene Index (MGI).
- DFCI Human Gene Index - Tentative consensus transcript sequences from DFCI's Human Gene Index (HGI).
- DFCI Rat Gene Index - Tentative consensus transcript sequences from DFCI's Rat Gene Index (RGI).
- RIKEN FANTOM cDNAs - Mouse cDNAs from the RIKEN FANTOM project.
Top
Protein Sequence Databases
- UniProt - All protein sequences from the "non-redundant" set of sequences from SwissProt and TrEMBL (SP_TR_NRDB from SIB/EBI).
- UniProt Mouse - All mouse protein sequences from the "non-redundant" set of sequences from SwissProt and TrEMBL (SP_TR_NRDB from SIB/EBI). Mouse sequences are defined as those entries containing "mus" in the phylogeny lines or "mouse" in the species lines.
- UniProt Human - All human protein sequences from the "non-redundant" set of sequences from SwissProt and TrEMBL (SP_TR_NRDB from SIB/EBI). Human sequences are defined as those entries containing "sapiens" in the phylogeny lines or "human" in the species lines.
- UniProt Rat - All rat protein sequences from the "non-redundant" set of sequences from SwissProt and TrEMBL (SP_TR_NRDB from SIB/EBI). Rat sequences are defined as those entries containing "rattus" in the phylogeny lines or "rat" in the species lines.
- UniProt Rodent - All rodent protein sequences from the "non-redundant" set of sequences from SwissProt and TrEMBL (SP_TR_NRDB from SIB/EBI). Rodent sequences are parsed from these sequences as entries containing "rodentia" in the phylogeny lines or "mus" or "rattus" in the species lines.
Top