Gene Sbal223_2749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbal223_2749 
SymbolclpX 
ID7087128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella baltica OS223 
KingdomBacteria 
Replicon accessionNC_011663 
Strand
Start bp3226447 
End bp3227727 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content47% 
IMG OID643461636 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_002358660 
Protein GI217973909 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000397165 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.558913 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGACA ACAAAAATAA TGGTGACAGC GGTAAATTGC TGTACTGCTC TTTTTGCGGA 
AAGAGCCAGC ATGAAGTCAG AAAACTCATT GCTGGCCCAT CAGTGTATGT ATGCGACGAA
TGTGTTGAGT TATGTAACGA CATCATTCGC GAAGAGATTA AAGAAATCTC TCCAAAGCGC
GATAGCGATA AGTTACCGAC ACCACATGAG TTGCGTGCAC ATTTAGACGA TTATGTGATT
GGCCAAGACA GGGCTAAGAA AGTGCTGTCT GTGGCAGTGT ATAACCACTA TAAGCGTTTG
AAAAATGCGT CGCCTAAAGA TGGTATCGAG CTGGGTAAGA GTAACATTTT ACTTATCGGT
CCAACGGGTA GTGGTAAGAC GCTGTTGGCT GAAACCCTTG CGCGCTCACT CAATGTGCCT
TTCACTATGG CCGATGCCAC AACACTGACT GAAGCGGGTT ATGTGGGTGA AGACGTTGAA
AACATCATTC AAAAGTTGCT GCAAAAGTGC GACTACGATG TAGAGAAAGC GCAGCGTGGT
ATCGTTTATA TCGATGAAAT TGATAAAATT AGCCGCAAGT CAGACAATCC ATCGATCACC
CGTGACGTAT CGGGTGAGGG CGTGCAGCAA GCGCTGCTTA AGCTGATTGA AGGTACTGTT
GCCGCGGTTC CACCACAAGG TGGCCGTAAG CATCCACAGC AAGAATTCTT ACAAGTCGAT
ACTTCTAAGA TCCTGTTTAT CTGTGGCGGT GCGTTTGCAG GGCTTGAGAA AGTGATTGAG
CAACGTGCAC ACGTTGGTTC GGGTATCGGT TTCGGTGCTC AGGTAAAAGG CGAAAAAGAA
AAGGCGACGA TTTCTGAAAC CTTAACTCAA GTTGAACCTG GCGATTTGGT CAAATATGGC
TTAATTCCTG AGTTTATCGG CCGTCTGCCA GTGGTTGCGA CTCTGACTGA GCTGGATGAA
GAAGCCCTAG TGCAAATTTT ATCTCAACCT AAAAATGCCC TGACTAAGCA GTACAGTGCG
TTATTTGAGA TGGAAGGCGT CGAGCTTGAG TTCCGCGAAG ATGCGCTTAA AGCGATAGCA
CACAAGGCGA TGTCACGTAA GACAGGTGCC CGTGGTTTAC GCTCAATCGT TGAAAGTATT
CTACTGGATA CTATGTACGA TATTCCTTCT GTCGATGGTG TAGTGAAGGC CGTTGTCGAT
GAATCAGTCG TGAAAGGTGA ATCTGCACCT ATCCTGATTT ATGAGCACAA TGAAGCCCAA
GCAGCCTCTG GCGAGCAATA A
 
Protein sequence
MGDNKNNGDS GKLLYCSFCG KSQHEVRKLI AGPSVYVCDE CVELCNDIIR EEIKEISPKR 
DSDKLPTPHE LRAHLDDYVI GQDRAKKVLS VAVYNHYKRL KNASPKDGIE LGKSNILLIG
PTGSGKTLLA ETLARSLNVP FTMADATTLT EAGYVGEDVE NIIQKLLQKC DYDVEKAQRG
IVYIDEIDKI SRKSDNPSIT RDVSGEGVQQ ALLKLIEGTV AAVPPQGGRK HPQQEFLQVD
TSKILFICGG AFAGLEKVIE QRAHVGSGIG FGAQVKGEKE KATISETLTQ VEPGDLVKYG
LIPEFIGRLP VVATLTELDE EALVQILSQP KNALTKQYSA LFEMEGVELE FREDALKAIA
HKAMSRKTGA RGLRSIVESI LLDTMYDIPS VDGVVKAVVD ESVVKGESAP ILIYEHNEAQ
AASGEQ