Gene Dtox_2083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_2083 
Symbol 
ID8429065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp2260057 
End bp2261607 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content42% 
IMG OID645034404 
ProductDSH domain protein 
Protein accessionYP_003191535 
Protein GI258515313 
COG category[L] Replication, recombination and repair 
COG ID[COG4581] Superfamily II RNA helicase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00750043 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTATG AAGATAATTC TGTTTATAAC AGCATTACCG AAACAGTCAA GGACCGTTTG 
CCGGCACTTT TTTTCGTTTT TAGCCGGGGA AAAACAGAAC TTATTGCCGA AGAGTTAAGC
CATGATTGGG ATTTTTTAAA ACCCGCTGAA AAGAAAACAG TACAGCAAAT AGTTGCTGAT
TTCGAAAAAC AAAACCCAAC TGCTTTTGAA CGTAAAAACA GAAGACTTTT AAAAAGACTG
CTTAGCCGTG GTATCGGCTA TCATCATGCC GGGCTGTCGC CGGTCTTGAA AAATCTAGTA
GAGACGCTTT ATGAGCGAAG GCTTATTTAT GCCTTGTGCT GTACTGAGAC CTTTGCTGCC
GGGGTTAACT TTCCGGCTTG CAGTACCATC TTTGATTCCT GCCGGAAGTG GGACGGCAAA
ACCTTCCGGG GGCTTTTAAA CCGTGAATTT TTTCAAATGG CAGGCAGGGC GGGCCGCCGT
GGATTTGATG AAAAGGGTTA TGTTTTTGTT CGTATTGATG AGCAGTACCC GGAGCAAACA
ACTTTCTTTG ATGAAGATGA AGTGGAATCA GTTAACAGTC ATTTAACCAT ATCGCCAAAT
ACTGCGTTAA ACCTTTTGCA GTGGAAAACA GATGCGGAAA TTGAGCATTT TTTAACCAAT
AATTTTGCGG TTTACCAGGG TAAAAAGGAA GAAAGAGCTG TTAATATTGA TATTGAAGCC
ATTACTGCTG AGATAGAAAA TTTGGAACAG CATTTTTGCG AAGAAAGAGA TACTCGTACC
TGTGCGCTTT ATCGCAAAAA ACTAAAGAAG GAACTGTATA AGCATTACCG GAGACGCAAA
AATAACCCGG ATTACCAGTC AAAAATAGAT GAGATCAAAG AAATCCTGGA CTTACCGGCC
AGAGATTGCG CCCATTCATT ATGTTTTTCA GCAAAGAGAA ACCTGGGTAA ACTTGTTTCA
GAAAGAAAGC GCTTAAACCG GCAGCGTGAA AAGCTGGCCG GGCAGCATGA AAATTACTTT
GATAAGTTCT CCAGTGTTTG CAGTTTGCTG GAGCAGTTGG GCTATATTGA AGGACGCATT
CTGCTGCCAA GGGGGATTTT TGCTTCAAAG ATACATATCC AAGAGATACT GGTTACTGAA
CTGATCTTCT CAGGAATTAT GTCGGATGCC ACACCGGCAG AGATTGCTGC TATTATAGTT
GGTATAGATT ATGAGGCCAA CAGAAGAGAT AAGATGATTC CCAATGTGGT AGACCTGTCG
AAAGTTGAAG AATTGCATAG AGAACTGCAA AAAAGCAACG TACCGTTGCA CTTTTGCAGT
TGGTCGCCTA TACCTGGTCC TCTCGCGTAT TTATGGCATG AAGGTAAAAG CTTTGGAAGC
TTGCTGGAAA TGACGGAAAT GCAGGAAGGA GATATCTTTT CCATGCTGAG AAGAGAGATT
GATTTATTAA GACAGATAGA ATCAGCATTA AAGGATGACC CGGCCTTGCA AGCCAAAATA
CGCGGAGTCA GATTAAGCTT GGACAGGGAT GAAGTTTCTG TTTCGTTTTA G
 
Protein sequence
MRYEDNSVYN SITETVKDRL PALFFVFSRG KTELIAEELS HDWDFLKPAE KKTVQQIVAD 
FEKQNPTAFE RKNRRLLKRL LSRGIGYHHA GLSPVLKNLV ETLYERRLIY ALCCTETFAA
GVNFPACSTI FDSCRKWDGK TFRGLLNREF FQMAGRAGRR GFDEKGYVFV RIDEQYPEQT
TFFDEDEVES VNSHLTISPN TALNLLQWKT DAEIEHFLTN NFAVYQGKKE ERAVNIDIEA
ITAEIENLEQ HFCEERDTRT CALYRKKLKK ELYKHYRRRK NNPDYQSKID EIKEILDLPA
RDCAHSLCFS AKRNLGKLVS ERKRLNRQRE KLAGQHENYF DKFSSVCSLL EQLGYIEGRI
LLPRGIFASK IHIQEILVTE LIFSGIMSDA TPAEIAAIIV GIDYEANRRD KMIPNVVDLS
KVEELHRELQ KSNVPLHFCS WSPIPGPLAY LWHEGKSFGS LLEMTEMQEG DIFSMLRREI
DLLRQIESAL KDDPALQAKI RGVRLSLDRD EVSVSF