Gene Dtox_3994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3994 
Symbol 
ID8431009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4179783 
End bp4181621 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content44% 
IMG OID645036211 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_003193309 
Protein GI258517087 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.460388 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGGTA AGGCAAGATT ACTGGTTCTG CTGGGCTGTG ATTTTCTGTT GGTGATTATG 
TCTTTGCTCG TTTCACTGCT TATCCGTTTC CCCAGTTGGC CTGAATTAAG CAATGCTCTA
GTTAACTACA TTAATTTTGC CCCGGTTTGT GCGCTTGTTA TGCTTGTCTT TTTTTATTTT
TTTGGCCTTT ATCAAAGAGT ATGGGCCTAT GCCAGCATAG GTGAGTTAGT GACCATAGTA
AAAGCGGTTA CAACAGGAAA ACTTGTTGTC ATTGCTTTGA CCTATTTTAT TTTTACCCCG
TTACCCAGAA GTGTTGTATT AATGTCCTGG GCTTTTAGCA TTATTCTTAT TGGCGGCTCC
AGGTTGTACT GGAGAATATA TATGCAGAAG AAGAAATTTA TCGTGGCCGG CTGCCCGCTG
GATAAAAGGA AAACGCTTAT AGTTGGTGCC GGTGATGCCG GGGTGCTGGT GGTTCGCGAA
CTAATGAACC ATAACAGTGA GTATTTGCCG GTAGGATTTA TTGATGATGA TGCAAGTAAG
CAGGGTATGG TTATCCTGGG CATACCGGTA TTAGGGAAAC GTGAAGAATT GCCGTCGATT
ATCGAAAAAT GCAGAATAAA AGAGGTAATT ATTGCTATGC CGTCAGTGTC TGGACAAGTG
ATAAAGGAAA CTGTCGAGAA GTGTCACAAT TCACGGGTAA AGTTAAAGAT ATTGCCGGGT
GTTTACCAGT TAATCAGCGG GCAAGTGACC GTAAATCATA TTCGCGATAT TCAGGTGGAA
GATTTATTGG GCAGAGAGCC TGTTGAGGTA GATTTAAGCG AGATTGCCGG TTATTTAACG
GACAGGGTGG TTTTGGTTTC CGGTGCCGGC GGTTCGATTG GTTCAGAGCT ATGCAGGCAG
GTGGTCAGGT TGAAGCCTAA GCTTCTGGTT GTTTTAGGGC ATGGTGAGAA CAGTATACAT
AATATAGTTT TTGAACTTCG TGAAATGCAT GGCAGCGATC TGCCTATTGA GATAGTGATT
GCTGATATAA GGGACAGGCA GAAGATTAAT TTGATTTTTA AGAAATATAG GCCGTCAGTG
GTTTTTCATG CTGCTGCGCA TAAGCATGTA CCTCTGATGG AACTGCACCC TGATGAGTCA
GTGAAGACGA ATGTTTTGGG TACAAAGAAT TTAGCAGAAG CAGCCGACAG GGTTGGAACA
GATGTTTTTA TTATGCTTTC AACAGATAAA GCAGTTAATC CTTCCAGTGT TATGGGGGCT
ACCAAGCGTT TGGCTGAGCT GATATTGCAG CAGATGAACA GTATAAGTGA TACTGTTTAT
GCGGCTGTTC GGTTTGGCAA TGTCTTGGGG AGCAATGGCA GTGTAGTGCC TATCTTTAAG
CGGCAGATTG CTCAGGGAGG GCCGGTTACT GTTACTCACC CGGAAATGAA GAGGTACTTT
ATGACTATAC CCGAGGCTGT GCAGTTGGTG ATTCAGGCAG GGGCTATGGC TCAGGGTGGG
GAGATATTTG TGCTGGACAT GGGTGAGCCG GTGAAGATTG TGGATTTAGC TAAATGTATT
ATTGATTTGT CGGGCGTGGA TTGTGAAATT AAATTTACCG GGATTAGGCC GGGGGAGAAG
CTGTTTGAGG AATTGCTGAC GGCAGAAGAG GGTTCTTCTG CTACCCGGCA TAGGAGGATA
TTTGTGGCTA ATGCGGGGAG TGTGGATTTG GAGACATTGG AGTTGGAAGT TTTTCGTTTG
AGAGAGTTGG GAGAGGATGT TGTGACCGGG GATGTTTTTA AAGCATTGAC GGTTCTCCTG
CCAAATATAA AGATATATCG AAAAGATATG GTTGGCTAG
 
Protein sequence
MRGKARLLVL LGCDFLLVIM SLLVSLLIRF PSWPELSNAL VNYINFAPVC ALVMLVFFYF 
FGLYQRVWAY ASIGELVTIV KAVTTGKLVV IALTYFIFTP LPRSVVLMSW AFSIILIGGS
RLYWRIYMQK KKFIVAGCPL DKRKTLIVGA GDAGVLVVRE LMNHNSEYLP VGFIDDDASK
QGMVILGIPV LGKREELPSI IEKCRIKEVI IAMPSVSGQV IKETVEKCHN SRVKLKILPG
VYQLISGQVT VNHIRDIQVE DLLGREPVEV DLSEIAGYLT DRVVLVSGAG GSIGSELCRQ
VVRLKPKLLV VLGHGENSIH NIVFELREMH GSDLPIEIVI ADIRDRQKIN LIFKKYRPSV
VFHAAAHKHV PLMELHPDES VKTNVLGTKN LAEAADRVGT DVFIMLSTDK AVNPSSVMGA
TKRLAELILQ QMNSISDTVY AAVRFGNVLG SNGSVVPIFK RQIAQGGPVT VTHPEMKRYF
MTIPEAVQLV IQAGAMAQGG EIFVLDMGEP VKIVDLAKCI IDLSGVDCEI KFTGIRPGEK
LFEELLTAEE GSSATRHRRI FVANAGSVDL ETLELEVFRL RELGEDVVTG DVFKALTVLL
PNIKIYRKDM VG