Gene Dtox_0834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0834 
Symbol 
ID8427773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp853007 
End bp854329 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content47% 
IMG OID645033189 
Producttransposase, IS605 OrfB family 
Protein accessionYP_003190363 
Protein GI258514141 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00917857 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGCAA CCCAAAAGAA CCGTATTAAA CATCTAACTA AAGAGCAATA TGCTCTACTG 
CAAAACCTCT GCCGGTATGC CAAAAACCTG TACAATGTGG CATTGTACAA CATACGGCAG
CACTACTTTG TCACCGGCAA GCTTCTAAGC TACGCCAAAA ACTGTGCCCT GTGCAAAACC
AACGAAAACT TTAAGATGCT GCAGGCCGGT GTTTCCCAGC AGATTATCCG GGTAGCTACC
CAAAGTTTCA AAAGCTTTCT CGGATTGAAA AGGTTAGCAG CTAAAGGCCA GTACCCGGCA
GAGAAAGTCC GTATCCCACG CTACCTGAAA AAAGACGGCT ACTTTCAACT GGTACTGTCA
ACCAACGCAA TAACCATAAA TGCCGGGTAC CTGCAGCTGC CGTTATCAAA CGTATTCAAA
AAAGACCACC CGGAGGCCAG GGACATCCGG TTTCCGTTTC CCGAGCGATT AGATAAAACC
AGTATTCGGG AGGTACGCAT CAATCCGGCT CATAAGGCCC ACTTCTTTGA AGTGGAGTAT
ATCTACCGTG ACAAGCCAGT AGTGCTGCCT TCCCTGGATA GCAATCGCAT CCTGGGCATA
GATTTGGGTG TAGATAACCT GGCTGCCTGT GCATCCACCA CCGGGCATGT CTTATTAATT
GACGGCAAGC AACTCAAGGC CGCCAACCAG TGGTATAACA AAGAAAGAGC CAGACTGCAG
TCTATTAAAG ATCTGCACAA CATTAAGAGT GAAACCCACA AGCTGGCTGC CCTTGCCGTA
TCCCGGGAAA ATTTTATCAC CGACTACTTG CGGAAAGCTG CCAAACATAT TGTAGAATTC
AGTATCTCCC TGGAGATTGG CACCGTGGTA GTTGGTGTAA ATAAGGAACA AAAGCAGGGA
GTCAACATTG GCCACGTTAA CAACCAGAAC TTTGTACAAA TCCCCCTCTG GAAGTTCCGG
CGTGTTCTGA AAAACATCTG CGATAAGTAC GGTATCACCT ATATTGAGGT AGAGGAAAGC
TACACCAGTA AAGCCAGCTT CCTTGATAAG GATTTTTTGC CGGAGTACGA TCCCGCGAAC
AAAAATGAAT ACACCTTCAG CGGTAGACGG GTTAAGCGGG GGTTGTATCG AACAAAAAAC
GGTTGTACTA TTCATGCCGA CATCAATGGT GCAGCGAATA TCATCCGCAA ATACCGGTTG
GATGGGGATT TTTCTGTTCT GGATAAGGGT ATATTCTTAA ATCCCTACCG GGTACAGGTT
CTAAATACGC CCCGTAAGAA ACCACCGGTA GTCCAGAAAA AGAAAAGTAA GGCAGCGGCG
TAA
 
Protein sequence
MFATQKNRIK HLTKEQYALL QNLCRYAKNL YNVALYNIRQ HYFVTGKLLS YAKNCALCKT 
NENFKMLQAG VSQQIIRVAT QSFKSFLGLK RLAAKGQYPA EKVRIPRYLK KDGYFQLVLS
TNAITINAGY LQLPLSNVFK KDHPEARDIR FPFPERLDKT SIREVRINPA HKAHFFEVEY
IYRDKPVVLP SLDSNRILGI DLGVDNLAAC ASTTGHVLLI DGKQLKAANQ WYNKERARLQ
SIKDLHNIKS ETHKLAALAV SRENFITDYL RKAAKHIVEF SISLEIGTVV VGVNKEQKQG
VNIGHVNNQN FVQIPLWKFR RVLKNICDKY GITYIEVEES YTSKASFLDK DFLPEYDPAN
KNEYTFSGRR VKRGLYRTKN GCTIHADING AANIIRKYRL DGDFSVLDKG IFLNPYRVQV
LNTPRKKPPV VQKKKSKAAA