Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4476 |
Symbol | |
ID | 5706916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5057490 |
End bp | 5058644 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641273892 |
Product | hypothetical protein |
Protein accession | YP_001539241 |
Protein GI | 159039988 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02570] CRISPR-associated protein, GSU0053 family, N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGGCG ACGAGTTGGC GGCACGGCTG ATTGCTGCCG TTGGCGAGCA GAGGCGTGAG AGCGGCGTGG TGGTGGAGGC GGTGTATCAG CCGGTCGGGG GTGCTGGTGG CAAGGTGATG CCTCCGACCT TCCCGGTGGT GGAGCGCGGT GGCTCGCCCT ATCTGCTGGA GGAACGGTGG GTTGATGGCG ACCGGGTGGG CACGGTGGTG ATCGACCAGG TGCCGAGCCA GGCCAACCGG GTCGAGGAGG CGCTGCTGGC GGCTCGGGAC ACGGGGCGAC TGTCGGTGCC GATCTTCGAG ATGATGGTGG ACGGGCTGCG GCTGACGTCG CTGCAGTTCC CGCACCGCTA TGCGGATGCG TACCTGCGTG ACAGCGAGGT CGACGGCGTA CGTTTCGATG ACAGCACGGC CGGTAAGGCG CTGCGGTCGG TGACGACACG TGACGTTCGT CCGCTGTACG CCCGGGAGCC GTACTCGCTG TTGTTCGGGG CGTGGGATTC GCATCGCAGG GGCCGGTGGC CACGGTTCGC GCGGCTGTAC CAGTCGATGA TGTACGGCCT GGATCCGATC GTTGGCGATC GACGCAGCGG GCGGTTCGAC CCGTTGAATC TCACGGGCGG TGTGGACAAC AAGAACAAGG CTGAGACGGA CTGGCGGTTC CTCCCGGAGG GGCAGAAGGC CAAGGGCGGC CGGCTGAGCG AGATCGGCCA CGGCCACATC GCCCCCAACC CCGCTCATGG TGGGGTGACG GTCCGGGAGG TACGCCGGTC GGCGTGGATC TCCTTCGCCG GCCTGGAGCG GCTGCGGTTC GGGGAGGTCT CCGAGGAGGC TGCTGGGCTC GCGCGAGCGG CGCTGGCGGC GTTGGCGCTG GTCGGGGATC GGTTGGCTTT CGGGCGGCCG TCGCTGTCGC TGCGGTCCGG CTGCGAGTTG ACCCGGATCA CCGAGACGGT GGCGTTCGAA GTCGCCGGCG GGGAGAAGGA GCCGGTCGAG GTGTCGGTCG GTGACGCTGT CGCGGCGTTC GTCCAGCTGC GGGCACAGGC GGGGGCGGCG GGTGTGCCGA TGGCAGACGA TGTGGTGGCT GTGACGCCGA TTCGGCAGTT GCGCGAGGCG ATGGTGTACG CGCGCACCCA GGCTGTCCCA GACTCCGAGG AGTAG
|
Protein sequence | MTGDELAARL IAAVGEQRRE SGVVVEAVYQ PVGGAGGKVM PPTFPVVERG GSPYLLEERW VDGDRVGTVV IDQVPSQANR VEEALLAARD TGRLSVPIFE MMVDGLRLTS LQFPHRYADA YLRDSEVDGV RFDDSTAGKA LRSVTTRDVR PLYAREPYSL LFGAWDSHRR GRWPRFARLY QSMMYGLDPI VGDRRSGRFD PLNLTGGVDN KNKAETDWRF LPEGQKAKGG RLSEIGHGHI APNPAHGGVT VREVRRSAWI SFAGLERLRF GEVSEEAAGL ARAALAALAL VGDRLAFGRP SLSLRSGCEL TRITETVAFE VAGGEKEPVE VSVGDAVAAF VQLRAQAGAA GVPMADDVVA VTPIRQLREA MVYARTQAVP DSEE
|
| |