Gene Dtox_4350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_4350 
Symbol 
ID8431369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4527864 
End bp4529816 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content40% 
IMG OID645036543 
ProductSigma 54 interacting domain protein 
Protein accessionYP_003193636 
Protein GI258517414 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID[TIGR00764] lon-related putative ATP-dependent protease
[TIGR02903] ATP-dependent protease, Lon family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCTT TTCTGGAAAA ATTTATGGGT GCCGCAAAGG CTGACAAATT TGAAGACAAG 
CCTGGCATTA ATGAACAATT AAAATACAGG GTTAACGCTT TATATGATTT GATAGCTAAT
ATTTACGGTT CTGATAAACT GGTATTGAGA GCCGGAAAAC TGGACGCTCT TAAACTGATG
CGTTCTGAGA TACTTTCAAA AAGAGTTTTA GCTTTGCAAA AAATTATATT TGAGGATGCT
GTGACAGATA TTGAACCAGA TCATGAAGAA ATACCAATAA TTTTAGATAA AATAGAAGAT
AAGATATCCG ATTATTTAGC CAGAAGATCA TTGGAAGATG AACTGGAAAG AAAAATCAAG
GAAAAGATGA ATGAAAGACA GGAGGATTAT TTGCAAGAAA TTAAAATGCA GGTAATTAAG
GAAAATACGG GTCCGGAAAA CGCTCAGACC CTTAAGAAGC TTGCTGTTTT GGAAAAGCTG
GAGCATAAAA AATTATCAAG CACGGCCATG GAGTTTTTGC GGCCAACCTG CTTTGGCGAA
ATTATCGGGC AGGAAAGGGC TGTCAAAGCC CTGCTGTCTA AACTTGCTTC ACCCTTTCCC
CAGCATATAC TATTATACGG GCCTCCCGGT GTAGGTAAAA CAACTGCTGC CCGCTTGGCC
TTGGAGGCAG CTAAAAATAT CAGTGGCACT CCTTTTGTTA AAGATGCTCC TTTTATTGAG
GTTAACGGTA CAACGCTGCG CTGGGATTCC AGGGAAATTA CCAACCCTCT GCTGGGTTCT
GTTCATGACC CGATTTATCA GGGAGCACGC CGTGATTTGG CTGATTCAGG AATACCTGAG
CCGAAATTGG GTTTGGTCAC GGACGCTCAC GGGGGCATAC TTTTTATTGA TGAGATAGGT
GAGATGGACC CGATATTGCA GAATAAATTA TTAAAAGTGC TTGAGGATAA GAGAGTATTT
TTTGAATCCA CTTATTATGA TCCGGGTGAC CAGAGTATAC CCCGGTATAT TAAAAAATTA
TTTGATCAAG GAGCACCGGC AGATTTTTTA CTGATCGGAG CTACAACAAG AAGTCCGGAG
GATATTAACC CGGCTATCAG ATCGCGCTGT GCAGAGGTTT ATTTTGATCC TTTAACTCCA
GGAGCTATTA AACAGATAGT AAATCAGGCG GCAGATAAAC TTAATATTGG ACTGGAAGAT
AAGGTTGCGG AAATAATTAG CGAATATACT ATTGAAGGGC GTAAAGCAGT TAATATTCTG
GCTGATGCCT ATGGGATGTC ATCTTATAGA AATGAAGAGA CTGCTTATGA GCAAAAAAAA
TTATATATTT GTGAGCTTGA TATTTACGAT GTTGTGCAGA CCAGCCGTCT TACTCCCTAT
GTGTTGTATA AAGCTTCTTC ACGAAAAGAA ACAGGTAAAA TCTTTGGCCT GGGTGTTACA
GGTTTTATTG GGTCCGTGTT GGAAATTGAG GCCATTACGT TTTCTGCCAG ACAACCGCAG
AGAGGTAATA TACGCTTTAA TGATACTGCC GGCAGTATGG CCAAAGATTC GGTATTTAAT
GCTGCTTCAG TAATTCGCAA GTTAACCGGG GAAGATTTGA ACAATTACGA TGTGCATATT
AATATTATTG GCGGAGGGCG TATTGATGGT CCTTCTGCCG GAGTAGCCAT GCTGCTGGCA
GTCTTAAGCA CTATAAAGGA TTATCCTATT CCTCAGGATA TTGCTGTTAC CGGTGAGGTG
TCCATTCAAG GAAAGATAAG AGCTGTGGGG GGTATAGCGG AAAAAATATA TGGTGCCAGG
CAAGCCGGTA TAAAAACAGT TTTTATCCCG GCTGAGAATT TTTCTGATAT ACCGAGAGAT
ATAAAAAATA TTAAAATAAT TCCTGTAAGC TCTGTAGAAG AAATAATAGA TTATATTTTC
CCTAATACAA AATTTAATCA ATCAGTAAGT TAA
 
Protein sequence
MKAFLEKFMG AAKADKFEDK PGINEQLKYR VNALYDLIAN IYGSDKLVLR AGKLDALKLM 
RSEILSKRVL ALQKIIFEDA VTDIEPDHEE IPIILDKIED KISDYLARRS LEDELERKIK
EKMNERQEDY LQEIKMQVIK ENTGPENAQT LKKLAVLEKL EHKKLSSTAM EFLRPTCFGE
IIGQERAVKA LLSKLASPFP QHILLYGPPG VGKTTAARLA LEAAKNISGT PFVKDAPFIE
VNGTTLRWDS REITNPLLGS VHDPIYQGAR RDLADSGIPE PKLGLVTDAH GGILFIDEIG
EMDPILQNKL LKVLEDKRVF FESTYYDPGD QSIPRYIKKL FDQGAPADFL LIGATTRSPE
DINPAIRSRC AEVYFDPLTP GAIKQIVNQA ADKLNIGLED KVAEIISEYT IEGRKAVNIL
ADAYGMSSYR NEETAYEQKK LYICELDIYD VVQTSRLTPY VLYKASSRKE TGKIFGLGVT
GFIGSVLEIE AITFSARQPQ RGNIRFNDTA GSMAKDSVFN AASVIRKLTG EDLNNYDVHI
NIIGGGRIDG PSAGVAMLLA VLSTIKDYPI PQDIAVTGEV SIQGKIRAVG GIAEKIYGAR
QAGIKTVFIP AENFSDIPRD IKNIKIIPVS SVEEIIDYIF PNTKFNQSVS