Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shew_3296 |
Symbol | |
ID | 4920797 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella loihica PV-4 |
Kingdom | Bacteria |
Replicon accession | NC_009092 |
Strand | + |
Start bp | 3939261 |
End bp | 3940613 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640164908 |
Product | protease Do |
Protein accession | YP_001095421 |
Protein GI | 127514224 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0128319 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00139389 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAACGA AATTATCACT ACTTTCCGCC GCACTGCTCG GCGCGAGCCT GATGGTTATG CCTGCCGTTT CTCAAGCCGC CATTCCACTG GCCGTCGAGG GGCAAGCGCT GCCCAGCTTG GCCCCCATGC TGGAGAAAAC CACCCCAGCC GTGGTTGCCG TCGCCGTCTC TGGCACCCAT GTCTCTAAGC AGCGTCTGCC CGATGCGTTC CGCTACTTCT TCGGCCCTAA TGCTCCCAGA GAACAGGTGC AAGAGCGTCC CTTCAAGGGT CTGGGTTCGG GCGTCATCAT AGATGCTAAG AAAGGCTATA TCGTCACCAA CAACCATGTG ATCGAAGGCG CCGACGAGAT CTTGATTGGC CTGCATGACG GCCGTGAGAT CGAGGCCAAG CTTATCGGTA CCGATGCCGA GTCAGACGTG GCCCTGCTAC AGATAGAAGC CAAGAACCTG GTGGCCCTCA AGCGCGCCGA CTCGGATGAA CTCAAGGTGG GCGACTTTGC CGTGGCTATC GGTAACCCCT TCGGTTTAGG CCAAACGGTC ACCTCGGGTA TCGTCAGCGC CATGGGCCGC AGCGGCCTAG GTATCGAGAT GCTGGAGAAC TTCATTCAGA CAGACGCCGC CATCAACAGC GGTAACTCGG GCGGCGCCCT GGTGAATCTC AACGGTGAGC TGATCGGCAT CAACACCGCC ATCGTCGCTC CGGGCGGCGG CAACGTGGGT ATCGGCTTCG CGATTCCCGC CAACATGGTC AACAACCTAG TGGATCAACT GATCGAACAC GGCGAGGTGC GCCGCGGCGT ACTAGGTGTC AGTGGTCGTG ACTTAGACAG CGAGCTGGCC CAGGGCTTCG GCTTGGATTC GCAGCACGGC GGCTTCGTCA ATGAAGTCAT GCCTGACAGC GCCGCCGACA AGGCGGGCAT CAAGGCTGGG GATATCATAG TCAGCGTCAA CGATAAGCCG ATCAAATCCT TCCAGGAGCT GAGAGCCAAG ATAGGCACCA TGGGCGCCGG CGCCAAGGTG AAACTCGGTC TTATCCGCGA CGGCGATGAG AAGACGGTCA CCGCAGTATT AGGCGAGGCG AGCCAGCAGA CAGAAACCGC GGCGGGCGCC GTACATCCAA TGCTGGCGGG TGCCACCCTG GAGAACAATA AGAAGGGGGT CGAGATCACC GAGATCGCCC AAGGTTCTCC GGCAGCGGCC AGTGGCCTGC TTAAGGGCGA TATCATAGTC GGGGTCAACC GTACCCGTAT CGAAGATCTT AAGGAGCTCA AGGCGGAGCT GAAGGAGCAA CACGGCGCCG TGGCCCTGAA ACTGCTGCGC GGTGACAACA GCCTCTATCT GGTTCTGAGA TAA
|
Protein sequence | MKTKLSLLSA ALLGASLMVM PAVSQAAIPL AVEGQALPSL APMLEKTTPA VVAVAVSGTH VSKQRLPDAF RYFFGPNAPR EQVQERPFKG LGSGVIIDAK KGYIVTNNHV IEGADEILIG LHDGREIEAK LIGTDAESDV ALLQIEAKNL VALKRADSDE LKVGDFAVAI GNPFGLGQTV TSGIVSAMGR SGLGIEMLEN FIQTDAAINS GNSGGALVNL NGELIGINTA IVAPGGGNVG IGFAIPANMV NNLVDQLIEH GEVRRGVLGV SGRDLDSELA QGFGLDSQHG GFVNEVMPDS AADKAGIKAG DIIVSVNDKP IKSFQELRAK IGTMGAGAKV KLGLIRDGDE KTVTAVLGEA SQQTETAAGA VHPMLAGATL ENNKKGVEIT EIAQGSPAAA SGLLKGDIIV GVNRTRIEDL KELKAELKEQ HGAVALKLLR GDNSLYLVLR
|
| |