Gene Dtox_3355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3355 
SymbolclpX 
ID8430349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3560107 
End bp3561357 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content44% 
IMG OID645035589 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_003192708 
Protein GI258516486 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0291024 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAACG AGAAAGGTCA GCTCAAGTGT TCCTTTTGCG GTAAACTGCA GGATCAGGTA 
AAAAAATTGG TTGCCGGGCC AGGTGTTTAT ATCTGCGATG AGTGCATTGA ACTTTGTAAC
GAGATCATTG AAGAGGAATT AAGTGATGAC TTAAATTTGG ATATGAGTGA TGTTCCAAAA
CCAAAAGAAA TCAAAGAAAT ACTGGACCAG TACGTTATAG GACAGGAAAG TGCCAAAAAA
GCGCTTGCCG TAGCCGTTTA CAACCACTAT AAGCGGATAA ATCTGGGCGG TAAAATTGAT
GATGTTGAAT TGCAGAAAAG TAATATTGTT ATGCTTGGTC CTACCGGCAG CGGTAAGACG
CTGTTGGCTC AGACCTTGGC CCGCCTGTTG AATGTTCCTT TTGCTATTGC CGATGCCACC
TCTTTGACAG AGGCAGGTTA TGTAGGTGAG GACGTTGAGA ACATTCTGTT GAAGCTTATC
CAGGCTGCTG ACTATGATGT GGAAAAAGCA GAAAAAGGTA TAGTTTATAT TGATGAGATT
GATAAAATTG CCCGCAAGTC GGAAAATCCT TCTATTACCA GGGATGTATC CGGTGAAGGG
GTACAGCAGG CTTTATTAAA GATACTGGAA GGGACTGTGG CCAGTGTACC TCCCCAGGGA
GGCCGCAAGC ACCCGCATCA GGAATTTATC CAGTTGGATA CCACCAATGT ACTGTTTATT
TGCGGTGGGG CCTTTGACGG TATAGATAAA ATCATTCAAA ACCGTACCGG CAAAAAGTCT
ATGGGCTTTG GTGCCGAAAT TAAAGCAATG CGCGAACAGC GTATCGGTGA GATTTTAAGC
AACATTTTAC CGGAGGATTT GCTCAAATAC GGCTTGATTC CTGAGTTTGT AGGCCGTCTG
CCGGTTATTG TTACACTGGA TATGCTGGAT GAAGATGCTC TGGTACGCAT TTTGACCGAG
CCTCGCAATG CTTTGATTAA ACAGTATGAA AAGCTGTTTG AGCTGGACGG AGTAGCTATT
GAATTTCAAG CTGACGCTCT TAAGTGCATA GCTAAGGAAG CCCTGCGCCG TAATACCGGA
GCCAGGGGCC TGAGGGCTAT ACTGGAGGAT GTCATGCTGA ATATTATGTA TGAGATTCCG
ACCAGAGATG ATATAGCCAA GTGCATTATC AATAAGGACA CTATTGAGAA GAAAGAAGAT
CCGGTAATCA TCTCGGTTGA CAGGAAGAAG AAAAAAGAAG AATCAGCTTA A
 
Protein sequence
MFNEKGQLKC SFCGKLQDQV KKLVAGPGVY ICDECIELCN EIIEEELSDD LNLDMSDVPK 
PKEIKEILDQ YVIGQESAKK ALAVAVYNHY KRINLGGKID DVELQKSNIV MLGPTGSGKT
LLAQTLARLL NVPFAIADAT SLTEAGYVGE DVENILLKLI QAADYDVEKA EKGIVYIDEI
DKIARKSENP SITRDVSGEG VQQALLKILE GTVASVPPQG GRKHPHQEFI QLDTTNVLFI
CGGAFDGIDK IIQNRTGKKS MGFGAEIKAM REQRIGEILS NILPEDLLKY GLIPEFVGRL
PVIVTLDMLD EDALVRILTE PRNALIKQYE KLFELDGVAI EFQADALKCI AKEALRRNTG
ARGLRAILED VMLNIMYEIP TRDDIAKCII NKDTIEKKED PVIISVDRKK KKEESA