Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sbal223_0721 |
Symbol | |
ID | 7088113 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella baltica OS223 |
Kingdom | Bacteria |
Replicon accession | NC_011663 |
Strand | - |
Start bp | 859527 |
End bp | 860879 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643459633 |
Product | protease Do |
Protein accession | YP_002356663 |
Protein GI | 217971912 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000974746 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.002505 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAACAA AATTATCTGT ACTTTCAGCC GCAATGTTAG CCGCGACCCT GACTATGATG CCTGTCGTCT CACAGGCCGC CATTCCACAG ACGGTTGAGG GACAATCCAT TCCAAGTCTT GCGCCCATGC TTGAGCGTAC GACACCTGCT GTTGTGTCCG TAGCCGTTAC GGGGACTCAT GTTTCAAAAC AAAGAGTACC TGATGTTTTC CGTTACTTCT TTGGCCCCAA TGCGCCACAG GAGCAAGTGC AGGAACGTCC TTTTAGAGGC TTAGGCTCAG GCGTTATTAT TGACGCCGAA AAAGGCTACA TAGTCACTAA CAACCACGTG ATTGACGGTG CTGATGATAT TCAAGTTGGT CTACATGACG GTCGTGAAGT AAAAGCCAAA CTCATTGGTA CTGACTCAGA ATCGGATATT GCATTGCTGC AAATTGAGGC GAAAAATCTT GTCGCAATCA AGTCATCAAA CTCTGACGAC TTACGTGTTG GTGACTTTGC CGTTGCCATT GGTAACCCCT TCGGTTTAGG GCAAACCGTG ACTTCAGGTA TCGTCAGTGC TTTAGGCCGT AGCGGTTTAG GCATAGAAAT GCTGGAAAAC TTTATTCAAA CCGATGCCGC GATTAACAGT GGTAACTCGG GTGGTGCACT CGTTAACTTA AACGGGGAAT TGATTGGGAT TAACACCGCA ATCGTAGCGC CGGGAGGTGG TAACGTTGGT ATCGGCTTTG CGATCCCAGC CAATATGGTG AAAAACTTGG TCGCGCAAAT TGCCGAACAC GGTGAAGTAC GCCGCGGAGT ACTGGGGATT TCGGGACGCG ATCTTGATAG CCAACTCGCC CAAGGCTTTG GTTTAGACAC CCAGCACGGC GGCTTCGTGA ATGAAGTCGC TAAAGACAGT GCCGCCGAGA AAGCCGGTAT TAAAGCGGGT GATATCATTA TCAGCGTCGA TGGCCGTGGC ATTAAGTCCT TCCAAGAACT GCGTGCCAAA GTCGCCACTA TGGGCGCGTG TGCTAAGGTT GAACTAGGAC TCATCCGCGA CGGTGATAAG AAAACCGTTA AGGTGACCTT AGGTGAAGCA AGCCAAACCA GTGAATCTGC GGCGGGCGCC GTGCATCCTA TGTTGCAAGG TGCATCCTTA GAAAACTCAT CTAAGGGCAT TGAAATAACC GATGTCGCCC AAGGTTCACC TGCGGCAATG AGTGGCTTGC AAAAAGGAGA TGTGATTGTC GGTATTAACC GTACTGCCAT TAAAGATCTT AAGGCACTTA AAGCACAGCT AAAAGATCAA GAAGGTGCGG TGGCATTGAA GATCATGCGT GATAAGAGCA TGTTGTATTT AGTCCTACGT TAA
|
Protein sequence | MKTKLSVLSA AMLAATLTMM PVVSQAAIPQ TVEGQSIPSL APMLERTTPA VVSVAVTGTH VSKQRVPDVF RYFFGPNAPQ EQVQERPFRG LGSGVIIDAE KGYIVTNNHV IDGADDIQVG LHDGREVKAK LIGTDSESDI ALLQIEAKNL VAIKSSNSDD LRVGDFAVAI GNPFGLGQTV TSGIVSALGR SGLGIEMLEN FIQTDAAINS GNSGGALVNL NGELIGINTA IVAPGGGNVG IGFAIPANMV KNLVAQIAEH GEVRRGVLGI SGRDLDSQLA QGFGLDTQHG GFVNEVAKDS AAEKAGIKAG DIIISVDGRG IKSFQELRAK VATMGACAKV ELGLIRDGDK KTVKVTLGEA SQTSESAAGA VHPMLQGASL ENSSKGIEIT DVAQGSPAAM SGLQKGDVIV GINRTAIKDL KALKAQLKDQ EGAVALKIMR DKSMLYLVLR
|
| |