Gene Dtox_3937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3937 
Symbol 
ID8430952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4111801 
End bp4112808 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content46% 
IMG OID645036155 
Productglyceraldehyde-3-phosphate dehydrogenase, type I 
Protein accessionYP_003193253 
Protein GI258517031 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000958634 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.595989 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTAA GATTAGGCAT TAACGGTTTT GGCAGAATTG GGAGATGTGT CTTCCGGGCG 
GCAATGAATA ATCCCGAGGT GGAAATTGTT GCAGTGAATG ATTTAACCGA TGCCGCAACT
CTGGCCCACC TGTTGAAGTA CGATTCTGTG CATGGTACTT TCGATGCGCA AATCAGTGCT
GCAGAAGATG CAGTTATTGT TAACGGCAAG ACATTTAAGG TATTGGCCGA AACTAAGCCG
GAAGCTTTAC CCTGGGGAGA CTTAGGGGTA GATATTGTTG TGGAATCTAC CGGAAGGTTT
GTCAAGCGTG CAGACGCGGC CAGGCATTTA GCAGGAGGAG CTAAGAAGGT AATTATCTCA
GCGCCGGCCA AGGAAGAAGA TATTACGGTG GTTATGGGTG TGAACGAGGA TAAATATGAC
CCAGCCAAAC ATCACGTGCT TTCTAATGCT TCTTGCACTA CTAACTGTTT AGCGCCTTTG
GCCAAGGTTT TAAATGATAA ATTCGGAATT GTCAAAGGGT TAATGACGAC AGTACATTCT
TATACCAATG ATCAAAAGAT TTTGGATGCT CCGCATAAGG ATTTAAGGCG GGCCAGAGCC
GGGGGCATGT CCATAATTCC CACTACTACC GGAGCGGCTA AGGCTGTTTC TCTGGTGCTG
CCGGAACTGC AGGGAAAATT AAACGGTTTT TCCATGCGTG TACCTACGCC TAATGTGTCT
GTGGTGGATT TAGTGGTGGA AACTGTCAAG CCTACCTCTG TGGAAGAGGT CAACGCTGTT
TTAAAAGCTG CGTCTGAGGC CGAATTGAAG GGTATTTTAG AGTATTGTGA CCTGCCTCTT
GTTTCTAAGG ATTTTAACGG TAACCCTCGA TCTTCGATTT TGGACGCTCT GTCTACCATA
GTAATCGGCG GCAATATGGT GAAGGTTATC TCCTGGTATG ATAATGAATG GGGTTATTCC
AACCGGGTTG TTGACTTAGT TTTTTACATG GCCGGCAAGG GATTGTAA
 
Protein sequence
MAVRLGINGF GRIGRCVFRA AMNNPEVEIV AVNDLTDAAT LAHLLKYDSV HGTFDAQISA 
AEDAVIVNGK TFKVLAETKP EALPWGDLGV DIVVESTGRF VKRADAARHL AGGAKKVIIS
APAKEEDITV VMGVNEDKYD PAKHHVLSNA SCTTNCLAPL AKVLNDKFGI VKGLMTTVHS
YTNDQKILDA PHKDLRRARA GGMSIIPTTT GAAKAVSLVL PELQGKLNGF SMRVPTPNVS
VVDLVVETVK PTSVEEVNAV LKAASEAELK GILEYCDLPL VSKDFNGNPR SSILDALSTI
VIGGNMVKVI SWYDNEWGYS NRVVDLVFYM AGKGL