Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Spea_3598 |
Symbol | |
ID | 5663984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella pealeana ATCC 700345 |
Kingdom | Bacteria |
Replicon accession | NC_009901 |
Strand | + |
Start bp | 4390475 |
End bp | 4391830 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641238260 |
Product | protease Do |
Protein accession | YP_001503446 |
Protein GI | 157963412 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.133395 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGACTA AATTATCCTT ACTTTCAGCT GCGATATTGA GCGCAACCCT GACACTAGCC CCTGTTGCTT CTCAAGCTGC AATTCCTTTA GCCGTCGACG GACAAGAGCT ACCAAGCTTA GCGCCCATGC TGCAAGAAAC AACGCCAGCC GTAGTAGCAG TTGCAGTCTC TGGAACCCAT GTTTCTAAAC AGAAGGTCCC CGATGCTTTC CGTTATTTCT TCGGCCCTAA CGCTCCACGT GAACAAGTGC AAGAACGTCC ATTTAAAGGG CTAGGTTCAG GGGTCATTAT CGATGCCAAA GAAGGTTATA TCGTCACTAA CAACCATGTG ATTGAAGGTG CCGATGAGAT CCTTATCGGG CTACATGACG GCCGAGAAGT CGAAGCGAAA CTAATTGGCA CTGACGCAGA GTCTGATATC GCTTTATTGC AGATAAAAGC CAAGAATCTG ACCGCATTGA AACGCGCCGA CTCAGATAAA CTCCAAGTCG GTGACTTTGC TGTAGCCATT GGAAATCCAT TCGGTCTAGG TCAAACAGTT ACTTCAGGAA TAGTCAGTGC CATGGGCCGA AGCGGTCTTG GCATCGAGAT GCTAGAAAAC TTTATTCAAA CCGATGCCGC AATCAATAGC GGTAATTCCG GTGGCGCCTT GGTTAACCTC AATGGTGAGC TTATCGGTAT CAATACCGCG ATTGTAGCTC CTGGTGGCGG CAACGTCGGT ATTGGTTTCG CTATCCCTGC CAATATGGTT AATAACCTCG TTAAGCAGAT TATTGAACAC GGTGAGGTTC GCCGCGGCGT GCTTGGCGTG ATGGGACAAG ATCTCACCAG TGAACTCGCC AAAGGTTTCG GTATCGAAAC TCAGCACGGC GGCTTTATCA ACGAAGTGAT GCCTGACAGC GCCGCTGCCA AAGCGGGTAT AAAAGTCGGT GATATTATTG TCAGCGTTAA TGGACGCAGC ATTAAGAGCT TCCAAGAGCT ACGTGCAAAA GTCGCGACTA TGGGCGCGGG CACTAAAGTG GAATTTGGTT TAATCCGCGA TGGCGATGAA GAAACGGTTA CCGCAGTACT CGGCGAGTCA ACTCAGGCGG CAGAAGCCGC TGCGGGCGCA GTGCATCCTA TGCTACAAGG GGCGAAGCTG GAAACGGCAA GTTCGTCTGG AGTTGAGATC ACAGATGTAG CTCAAGGCTC TCCAGCTGCA GCAAGTGGCT TAATAAAAGG TGATATTATC GTTGGTGTTA ACCGCACTAA GGTGAAAAAC CTCAAAGCCC TTAAATCTGC GCTTGAAGAT CAAAAAGGCT CAGTGGCGCT AAAAATTAAG CGAGATAACA CTAGCTTGTA TTTGATCCTG AGGTAA
|
Protein sequence | MKTKLSLLSA AILSATLTLA PVASQAAIPL AVDGQELPSL APMLQETTPA VVAVAVSGTH VSKQKVPDAF RYFFGPNAPR EQVQERPFKG LGSGVIIDAK EGYIVTNNHV IEGADEILIG LHDGREVEAK LIGTDAESDI ALLQIKAKNL TALKRADSDK LQVGDFAVAI GNPFGLGQTV TSGIVSAMGR SGLGIEMLEN FIQTDAAINS GNSGGALVNL NGELIGINTA IVAPGGGNVG IGFAIPANMV NNLVKQIIEH GEVRRGVLGV MGQDLTSELA KGFGIETQHG GFINEVMPDS AAAKAGIKVG DIIVSVNGRS IKSFQELRAK VATMGAGTKV EFGLIRDGDE ETVTAVLGES TQAAEAAAGA VHPMLQGAKL ETASSSGVEI TDVAQGSPAA ASGLIKGDII VGVNRTKVKN LKALKSALED QKGSVALKIK RDNTSLYLIL R
|
| |