Gene Dtox_4079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_4079 
Symbol 
ID8431093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4247241 
End bp4248617 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content42% 
IMG OID645036278 
Productnucleotide sugar dehydrogenase 
Protein accessionYP_003193376 
Protein GI258517154 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1004] Predicted UDP-glucose 6-dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAAAA TCTCAGTAGC CGGTACAGGC TACGTAGGCC TGGTAGCCGG AGTATGTTTT 
GCCGAAGTTG GTCACCATGT CACTTGTGTT GATATCGATG AACAAAAAAT CAATATGCTT
AAGCAAGGCA TTTCTCCTAT CTATGAAACA GGATTAGAGG AACTAATGCA GAAAAATTAT
GTTGCTGGTA GGCTTGATTT TACCACAGAC TATATCCATG CATACAAAGA TGCAGATGCC
GTTTTTATAG GTGTAGGTAC TCCCGAGCAG CCTGATGGCT CTGCCAACCT TTCCTACATA
GCTACCGCCG CCAGGCAAAT TGCAGAAACA ATAGAGAAGG ACTGCCTTGT GGTGGTAAAA
TCAACAGTTC CTGTTGGTAC AAATGATAAA GTGGAGCAAT TTATTAAGGA TTTTTTAGTT
AATAATGTTA AAGTAGAGGT GGCTAGCAAT CCTGAATTTC TGGCCCAGGG TACAGCGGTA
CGTGATACTC TTGAGGCAGC GAGAATCATC ATCGGTACCG ATAGTAAATG GGCTGAGGAA
ATGCTAATGA AAATATATGA ACCATTTAAT CTACCAATTG TATCTGTAAG TAGGCGGTCG
GCGGAGATGA TCAAATATGC TGCCAATGAC TTTTTGGCCT TAAAAATATC CTATATGAAT
GACATTGCTA ACCTCTGCGA GCTTGTAGGT GCAGATATCC AGGACGTAGC ACGCGGAATG
TCATTTGATG AACGTATTGG TAGTAAATTT TTAAATGCCG GAATCGGTTA CGGTGGTAGC
TGTTTCCCCA AGGATACCAA GGCACTGGAT TATCTGGCCA GGCAAAATGG CTATGAGATA
AGGACGGTTA AGGCTGCCAT TACGGTAAAT AATGATCAGA AAACAGTATT ATATAGAAAG
GCTTCGAACA GGTTAATTAC TTTTGATGGC CTGAAGGTGG CAGTGTTGGG ACTAACCTTT
AAGCCAGGAA CAGATGATTT GCGGGAAGCG CCGTCACTTG AAAATGTTCC GCTGCTGTTG
GATCGAGGGG CAATGATATA TGCTTATGAC CCTGTGGGTA GTGATAATTT TAGGCAGAAG
TATCCGGAGA GTGAGTATAA GATTACTTAC GTTAAGAGCC CAGAGGAGGC ACTTTCCGGT
GCTAATGTTT GCTTTATCTT CACCGAATGG GATGAAATTA AAGCAGTTAA GCCGGAAAAC
TATAAAAATC AAATGCGGAC ACCGCTGGTA TATGACGGCA GAAATATATA TGGCATTGAA
GATATGAAAG ATGCCGGAGT GGAGTATTAT TCTATTGGTA GACAAGCTGT TATAGAGAAT
AAAGCTGTTA GGTTAAGCAA TACACACAAC ACGAACCACC GCCTATATAG TGCTTAA
 
Protein sequence
MHKISVAGTG YVGLVAGVCF AEVGHHVTCV DIDEQKINML KQGISPIYET GLEELMQKNY 
VAGRLDFTTD YIHAYKDADA VFIGVGTPEQ PDGSANLSYI ATAARQIAET IEKDCLVVVK
STVPVGTNDK VEQFIKDFLV NNVKVEVASN PEFLAQGTAV RDTLEAARII IGTDSKWAEE
MLMKIYEPFN LPIVSVSRRS AEMIKYAAND FLALKISYMN DIANLCELVG ADIQDVARGM
SFDERIGSKF LNAGIGYGGS CFPKDTKALD YLARQNGYEI RTVKAAITVN NDQKTVLYRK
ASNRLITFDG LKVAVLGLTF KPGTDDLREA PSLENVPLLL DRGAMIYAYD PVGSDNFRQK
YPESEYKITY VKSPEEALSG ANVCFIFTEW DEIKAVKPEN YKNQMRTPLV YDGRNIYGIE
DMKDAGVEYY SIGRQAVIEN KAVRLSNTHN TNHRLYSA