Gene Sbal223_0721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_0721 
Symbol 
ID7088113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp859527 
End bp860879 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content49% 
IMG OID643459633 
Productprotease Do 
Protein accessionYP_002356663 
Protein GI217971912 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000974746 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.002505 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAACAA AATTATCTGT ACTTTCAGCC GCAATGTTAG CCGCGACCCT GACTATGATG 
CCTGTCGTCT CACAGGCCGC CATTCCACAG ACGGTTGAGG GACAATCCAT TCCAAGTCTT
GCGCCCATGC TTGAGCGTAC GACACCTGCT GTTGTGTCCG TAGCCGTTAC GGGGACTCAT
GTTTCAAAAC AAAGAGTACC TGATGTTTTC CGTTACTTCT TTGGCCCCAA TGCGCCACAG
GAGCAAGTGC AGGAACGTCC TTTTAGAGGC TTAGGCTCAG GCGTTATTAT TGACGCCGAA
AAAGGCTACA TAGTCACTAA CAACCACGTG ATTGACGGTG CTGATGATAT TCAAGTTGGT
CTACATGACG GTCGTGAAGT AAAAGCCAAA CTCATTGGTA CTGACTCAGA ATCGGATATT
GCATTGCTGC AAATTGAGGC GAAAAATCTT GTCGCAATCA AGTCATCAAA CTCTGACGAC
TTACGTGTTG GTGACTTTGC CGTTGCCATT GGTAACCCCT TCGGTTTAGG GCAAACCGTG
ACTTCAGGTA TCGTCAGTGC TTTAGGCCGT AGCGGTTTAG GCATAGAAAT GCTGGAAAAC
TTTATTCAAA CCGATGCCGC GATTAACAGT GGTAACTCGG GTGGTGCACT CGTTAACTTA
AACGGGGAAT TGATTGGGAT TAACACCGCA ATCGTAGCGC CGGGAGGTGG TAACGTTGGT
ATCGGCTTTG CGATCCCAGC CAATATGGTG AAAAACTTGG TCGCGCAAAT TGCCGAACAC
GGTGAAGTAC GCCGCGGAGT ACTGGGGATT TCGGGACGCG ATCTTGATAG CCAACTCGCC
CAAGGCTTTG GTTTAGACAC CCAGCACGGC GGCTTCGTGA ATGAAGTCGC TAAAGACAGT
GCCGCCGAGA AAGCCGGTAT TAAAGCGGGT GATATCATTA TCAGCGTCGA TGGCCGTGGC
ATTAAGTCCT TCCAAGAACT GCGTGCCAAA GTCGCCACTA TGGGCGCGTG TGCTAAGGTT
GAACTAGGAC TCATCCGCGA CGGTGATAAG AAAACCGTTA AGGTGACCTT AGGTGAAGCA
AGCCAAACCA GTGAATCTGC GGCGGGCGCC GTGCATCCTA TGTTGCAAGG TGCATCCTTA
GAAAACTCAT CTAAGGGCAT TGAAATAACC GATGTCGCCC AAGGTTCACC TGCGGCAATG
AGTGGCTTGC AAAAAGGAGA TGTGATTGTC GGTATTAACC GTACTGCCAT TAAAGATCTT
AAGGCACTTA AAGCACAGCT AAAAGATCAA GAAGGTGCGG TGGCATTGAA GATCATGCGT
GATAAGAGCA TGTTGTATTT AGTCCTACGT TAA
 
Protein sequence
MKTKLSVLSA AMLAATLTMM PVVSQAAIPQ TVEGQSIPSL APMLERTTPA VVSVAVTGTH 
VSKQRVPDVF RYFFGPNAPQ EQVQERPFRG LGSGVIIDAE KGYIVTNNHV IDGADDIQVG
LHDGREVKAK LIGTDSESDI ALLQIEAKNL VAIKSSNSDD LRVGDFAVAI GNPFGLGQTV
TSGIVSALGR SGLGIEMLEN FIQTDAAINS GNSGGALVNL NGELIGINTA IVAPGGGNVG
IGFAIPANMV KNLVAQIAEH GEVRRGVLGI SGRDLDSQLA QGFGLDTQHG GFVNEVAKDS
AAEKAGIKAG DIIISVDGRG IKSFQELRAK VATMGACAKV ELGLIRDGDK KTVKVTLGEA
SQTSESAAGA VHPMLQGASL ENSSKGIEIT DVAQGSPAAM SGLQKGDVIV GINRTAIKDL
KALKAQLKDQ EGAVALKIMR DKSMLYLVLR