Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal_3654 |
Symbol | |
ID | 4844060 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS155 |
Kingdom | Bacteria |
Replicon accession | NC_009052 |
Strand | + |
Start bp | 4276474 |
End bp | 4277826 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640120922 |
Product | protease Do |
Protein accession | YP_001051998 |
Protein GI | 126175849 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000180809 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACAA AATTATCTGT ACTTTCAGCC GCAATGTTAG CCGCGACCCT GACTATGATG CCTGTCGTCT CACAGGCCGC CATTCCACAG ACGGTTGAGG GACAATCCAT TCCAAGTCTT GCGCCCATGC TTGAGCGTAC GACACCTGCT GTTGTGTCCG TAGCCGTTAC GGGTACTCAT GTTTCAAAAC AAAGAGTACC TGATGTTTTC CGTTACTTCT TTGGCCCCAA TGCGCCACAG GAGCAAGTGC AGGAACGTCC TTTTAGAGGC TTAGGCTCAG GCGTTATTAT TGACGCCGAA AAAGGCTACA TAGTCACTAA CAACCACGTG ATTGACGGTG CTGATGATAT TCAAGTTGGT CTACATGACG GTCGTGAAGT AAAGGCCAAA CTCATTGGTA CTGACTCAGA ATCGGATATT GCATTGCTGC AAATTGAGGC AAAAAATCTT GTCGCAATCA AGTCATCAAA CTCTGACGAC TTGCGTGTTG GTGACTTTGC CGTTGCCATT GGTAACCCCT TCGGTTTAGG GCAAACCGTG ACTTCAGGTA TTGTCAGTGC TTTAGGCCGT AGCGGTTTAG GCATAGAAAT GCTGGAAAAC TTTATTCAAA CCGATGCCGC GATTAACAGT GGTAACTCGG GTGGTGCACT CGTTAACTTA AACGGGGAAT TGATTGGGAT TAACACCGCA ATCGTAGCGC CGGGGGGTGG TAACGTTGGT ATCGGCTTTG CGATCCCAGC CAATATGGTG AAAAACTTGG TCGCGCAAAT TGCCGAACAC GGTGAAGTAC GCCGCGGAGT ACTGGGGATT TCGGGACGCG ATCTTGATAG CCAACTCGCC CAAGGCTTTG GTTTAGACAC CCAGCACGGC GGCTTCGTGA ATGAAGTAGC CAAAGACAGT GCCGCCGAAA AAGCCGGTAT TAAAGCGGGT GATATCATTA TCAGCGTCGA TGGCCGTGGT ATTAAGTCCT TCCAAGAACT GCGAGCCAAA GTCGCCACTA TGGGTGCGGG TGCTAAGGTT GAGTTAGGAC TCATCCGTGA CGGAGATAAG AAAACCGTTA AGGTGACCTT AGGTGAAGCA AGCCAAACCA GTGAATCTGC GGCTGGCGCC GTGCATCCTA TGTTGCAAGG TGCATCCTTA GAAAACACAT CTAAGGGCAT TGAAATTACC GATGTCGCCC AAGGTTCACC AGCGGCAATG AGTGGCTTAC AAAAAGGAGA TGTGATTGTC GGTATTAACC GTACTGCAAT TAAAGATCTT AAGGCACTTA AAGCACAGCT AAAAGATCAA GAAGGTGCGG TGGCACTGAA GATCATGCGT GATAAGAGCA TGTTGTATTT AGTCCTACGT TAA
|
Protein sequence | MKTKLSVLSA AMLAATLTMM PVVSQAAIPQ TVEGQSIPSL APMLERTTPA VVSVAVTGTH VSKQRVPDVF RYFFGPNAPQ EQVQERPFRG LGSGVIIDAE KGYIVTNNHV IDGADDIQVG LHDGREVKAK LIGTDSESDI ALLQIEAKNL VAIKSSNSDD LRVGDFAVAI GNPFGLGQTV TSGIVSALGR SGLGIEMLEN FIQTDAAINS GNSGGALVNL NGELIGINTA IVAPGGGNVG IGFAIPANMV KNLVAQIAEH GEVRRGVLGI SGRDLDSQLA QGFGLDTQHG GFVNEVAKDS AAEKAGIKAG DIIISVDGRG IKSFQELRAK VATMGAGAKV ELGLIRDGDK KTVKVTLGEA SQTSESAAGA VHPMLQGASL ENTSKGIEIT DVAQGSPAAM SGLQKGDVIV GINRTAIKDL KALKAQLKDQ EGAVALKIMR DKSMLYLVLR
|
| |