Gene Dtox_1969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_1969 
Symbol 
ID8428951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2102322 
End bp2105753 
Gene Length3432 bp 
Protein Length1143 aa 
Translation table11 
GC content52% 
IMG OID645034297 
ProductGLUG domain protein 
Protein accessionYP_003191428 
Protein GI258515206 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000014348 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000694878 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACATTTT GTGACTTCAG CGAAGCGAAA GGAATGGGTG TAAAATTGAG AAAAACGATG 
CTTAGAAAAC CTCTTACCAT ATTTATGGTG CTTATTATGG CGCTCGCCCT GTTGCCTGTA
ATGGCCCCGG CGGAGGACGG CGCGGGTACC CCGGCGATGT CATCCCTGGC GCTGTTCACG
GATAAATCCT GCGTGACGCC TATTCAGTTT GACGAGCCTT TTTCAAGCGG CCATTTTGAT
TACAATGCGA ATATTCCCGA CTATTTGAGC AGCGTCTACG CCATGGGCGT TGCAAGCAAC
AGTTCCGATT TCCTGAAGTT TGAAATGCAA AGGTCCGGCT GGGGGGGAAC GGGTATCGCC
ATGGCGAATC TCCCTGTATT GATCAGCAAT TTAAACTTAT ATCAAACCAA TGCAAACAGT
ACATTCACCT TCAGGGCCGG CCCCCAGAGC AACATTAGCG CGGATAACGG AGACGTCTTT
TATCATGTGC ATTTCAAGAG GATGCCGACG TTAAAAAGCC TTAGTGTAAA CGATACGTTA
ACACCGACGT TTAACAGAGA TGTTTTCTCC TATACCGCCG GTGTTGACGC CGAGGCGGCT
TCAGTGGATA TCGCGGTCAC CGGATATAAT AAAGCCTACG TGATCACTGT AAACGGGCAG
GCTCCTGTGG ACGGCACGGC AAGTGTTACG TTGAATTGGA GTGCCGACGG AACCATGCAG
GTCCCGATCA CTGTCAGCGG TGGCACCTCC GAAACAATCT ACAACCTTGG CCTGATCCGC
GAAACCAAGT CCGATACCCC CGTGATATAT ATAAACCCCC TGCCCGCGGT TTATGTGTTC
ACCGAGACAC CAAGCCCTCT TACGGTAAGG GCGGCAGCCT ACGCGGATGT TTCCTACCAG
TGGTACGTGA ACACGGAGGA CAGCACCGAG GGCGGGCAGA AGATTGACGG CGCGGAGAGC
GCGTCCTATA TACCGGAATT GGAGACTGTC TCATCAACTA AGATTTACTA TTACTACTGT
GTGGCAACGA ATAACACAAC AGAGGATTTT GCCGTCAGCA AAACGGCCGC CGTAACCGTT
ATGCCTGACC CGACGCCGGT GATAACACGG GAAAATGCCG ACGGCAGCCC GTTGCCGGCG
CAGTATTATG AATATAATGT AGGCGACACG CCTGTGGGAA TGAAGGTCGA GACCACTTCC
GTTGCGGAGG GCGGCTACTT TTTCTACACC TGGAATTGTC GCTTGATCAG CGGCTCGGTC
ACTACCAAGT CCGGACCGGA TGAAAAAAAT GTGTGCTTGC CGTACATGAA TTCGGATGGA
GAAACTACTT ACTTCTGCAC GGTAACCCAC GTCATAAACG GGAAAGCATA CACGGCGGAC
AGCGAGGACT TCATTGTAAG GGTTTATGAA ACATCCGCAA AGATGCCCTA TATAAGCTCG
CAGCCCTCCG GCACGTCTTA TCAGCTGGGA GAGACGCAAA TTACGGACCT TGAGATCGGG
GCAATGGTTA TAGCCGGGAC TATGAGCAAT GAAATTTCCT ATCAATGGTA TTCCAGCACT
GATGGGGAAA CTTATGCGCC GATAGAGGGT GCAACCGGTT ATACCCTAAC CCCGAGCATA
TTGGACACGG CGGGTGTCAT CTATTATAAG TGCACGGTTA CTGATAATTT CACCAGTTTA
AGCGGCAAAA CATATACCAG CTCAATAGAT TCCGCAGTCG CAGTCATTCG ATTCATAGGC
GGATTCGGCC AATGGAACGG CAGCGGTAAG CAAGACGATC CTTTCCTGCT GGAAGATGTG
CAGGATATTG AAGAGCTTCG CGAATTGGTC AATACCGGCA CAAGCTTCGC GGACTACTAT
TTTAAAATGA ACGCGGATAT TACCCTGCCG GACGGCTGGG TGCCCGTAGG TTCTGTTAAG
GAAGGTTCCG CCAACGAAGG CGCCGGAGCC AATATTAATC CGTTTTCCGC GAACTTTGAC
GGCGGCGGAC ACACGATCAC CATACCGGCC GGCGGCAAGC CGCTGTTTGG GTATGTCAGG
CTGGCGACGA TAAGCAACCT GCAGATTTAC GGGACCGAAA TTGCCGGAGC GGCATTGATT
GATAATTATA CAATTGACTA CGGCCCCGGC GGTCAGTACT CTGATGTGCC GGCGTATACG
GCGGATATTA AAAACGTTAC GCTTAAATCA GGTTCCAAAA CGCTCGGCTC CGGCTTGGTC
AACGGAAACG GGTCCGGGGA GAACAATATA TATTTCACCG GATGTGTTGT CGAATCCGGT
GTTACCGTCG GTTATGACAA GAGCAGGTCC AATGTCGGCT CGCTAATCTC CAGCCTGAAC
GGCAATATTC TCAACTGCGT CAGCTATGCG GATGTATACG GGACTAACCA TGTGGGCGGA
CTCGTGGGAG AGAAGGGGCA ATCGATGGGC TACTGCACTG TTTGGGACAG CGCCTTCCAT
GGAACCGTTA CGGCAAGCGG CCAATTCGCA GGCGGGATCA TGGGCAGCGG ATATTTGGCC
GGTTCGGCGC CCAACAGCCC CTGTGTATCC ATTCAGAATT GCTCCGTTAC CGGAACCGTT
ACGGCTTCGG ACTATATCGG CGGGGTCTTC GGTGGTGAAG GCGGCGTTTC TGAGTGCTGG
GAAAACGGTA TTGGCTATAT TCAGAACAAT TATTTCAGCG GAACGCTTAC GGCAACTTCC
GGCAATGCTG CTCATGTAGG CGGAGTAATC GGCTACATGC ACTCGCTGGA CCGCTACAAT
ATTATTTCCA ATAACTACTA TATGGAAGGT TCCGGGGTGG ACAGTGGCAT AGGAGCTGTG
GCTGCCGTTG ACAAATCCAC CGGGCTTTAC AGCCGCGGTG ATGACCCCAC AGGCGCGGAC
GCGGACAAGC TTACTAAATC ATTCACGTCC GGGGAACTGA CCGACGGGAC TCTGCTCAGC
GTGCTGAACG CGGGAATAAA CAGCAGCGGG AACTGGACGG CCGGTTTCGA CGGGAAGCCT
GTAGTTGGAA GCACACGTCA TATTCTGATG ATTACGGTCG ACGGCATCCA AAGCAACCGT
TTTTCGTCCA ACGATATAAA CGACATTTAT ACCAAGAACA TTACGGTACT TTACAGCGAC
AGGTCCACAC AGTCGGTCAG CGCCAATGGT GCGACGGTTG AGGGCTTTAA TACAACGTCC
ACCGGTTATA AAACCGTTAC GGTAACGCTG CAAAACCACA CCTATATTTT CCGGCTTCAA
GTTACTACTG CCACAGGCGC AACCGGCAGT GATCTGAACA GCGACGGAAG TGTTAATGTG
CAGGATCTGA TCCTGGTGGG CCAGCATATT GGTGAGAGCG GTACCCCTGG CTGGATTGAT
TTTGATCTTA ACAGTGACGG CACTGTGGAT GTATTGGACA TGATCCTGGT GGGCCAGCAA
TTTACAGCTT GA
 
Protein sequence
MTFCDFSEAK GMGVKLRKTM LRKPLTIFMV LIMALALLPV MAPAEDGAGT PAMSSLALFT 
DKSCVTPIQF DEPFSSGHFD YNANIPDYLS SVYAMGVASN SSDFLKFEMQ RSGWGGTGIA
MANLPVLISN LNLYQTNANS TFTFRAGPQS NISADNGDVF YHVHFKRMPT LKSLSVNDTL
TPTFNRDVFS YTAGVDAEAA SVDIAVTGYN KAYVITVNGQ APVDGTASVT LNWSADGTMQ
VPITVSGGTS ETIYNLGLIR ETKSDTPVIY INPLPAVYVF TETPSPLTVR AAAYADVSYQ
WYVNTEDSTE GGQKIDGAES ASYIPELETV SSTKIYYYYC VATNNTTEDF AVSKTAAVTV
MPDPTPVITR ENADGSPLPA QYYEYNVGDT PVGMKVETTS VAEGGYFFYT WNCRLISGSV
TTKSGPDEKN VCLPYMNSDG ETTYFCTVTH VINGKAYTAD SEDFIVRVYE TSAKMPYISS
QPSGTSYQLG ETQITDLEIG AMVIAGTMSN EISYQWYSST DGETYAPIEG ATGYTLTPSI
LDTAGVIYYK CTVTDNFTSL SGKTYTSSID SAVAVIRFIG GFGQWNGSGK QDDPFLLEDV
QDIEELRELV NTGTSFADYY FKMNADITLP DGWVPVGSVK EGSANEGAGA NINPFSANFD
GGGHTITIPA GGKPLFGYVR LATISNLQIY GTEIAGAALI DNYTIDYGPG GQYSDVPAYT
ADIKNVTLKS GSKTLGSGLV NGNGSGENNI YFTGCVVESG VTVGYDKSRS NVGSLISSLN
GNILNCVSYA DVYGTNHVGG LVGEKGQSMG YCTVWDSAFH GTVTASGQFA GGIMGSGYLA
GSAPNSPCVS IQNCSVTGTV TASDYIGGVF GGEGGVSECW ENGIGYIQNN YFSGTLTATS
GNAAHVGGVI GYMHSLDRYN IISNNYYMEG SGVDSGIGAV AAVDKSTGLY SRGDDPTGAD
ADKLTKSFTS GELTDGTLLS VLNAGINSSG NWTAGFDGKP VVGSTRHILM ITVDGIQSNR
FSSNDINDIY TKNITVLYSD RSTQSVSANG ATVEGFNTTS TGYKTVTVTL QNHTYIFRLQ
VTTATGATGS DLNSDGSVNV QDLILVGQHI GESGTPGWID FDLNSDGTVD VLDMILVGQQ
FTA