Gene Dtox_4081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_4081 
Symbol 
ID8431095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp4250609 
End bp4252447 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content44% 
IMG OID645036280 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_003193378 
Protein GI258517156 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGGTA AGGCAAGATT ACTGGTTTTA CTGTGTTGTG ATTTTCTGTT GGTGATTATG 
TCTTTGCTCG TTTCACTGTT TATCCGTTTC CCCAGTTGGC CTGAATTAAG CAAGGCTTTA
GTTAACTATA TTAATTTTGC CCCGGTTTGT GCGCTTGTTA TGCTTGTCTT TTTTTATTTT
TTTGGCCTTT ATCAAAGAGT ATGGGCCTAT GCCAGCATAG GTGAGTTAGT GACTATAGTA
AAAGCGGTTA CAACAGGAAA ACTTGTTGTC ATTGCTTTGA CATATTTTAT TTTTACCCCG
TTACCCAGAA GTGTAGTATT AATGTCCTGG GCTTTTAGTA TTATTCTGAT TGGCGGTTCC
AGGTTGTGCT GGAGAATATA TGTACAGAAG AAGAAATTTG CCGTGGCCGG CTGCCCACTG
GATAAAAGGA AAACGCTTAT AGTTGGTGCC GGTGATGCCG GGGTGCTGGT GGTTCGCGAA
TTAATGAACC ATAACAGTGA GTATTTGCCG GTAGGATTTA TTGATGATGA TGCAAGTAAG
CAGGGTATGG TTATCCTGGG CATACCGGTG TTAGGGAAAC GTGAAGAATT GCCGTCGATT
ATCGAAAAAT ACAGAATTAA AGAAGTAATT ATCGCTATGC CGTCAGTGTC CGGACAAGTT
ATAAAGGAAA CTGTCGAGAA GTGTCACAAT TCACAGGTAA AATTAAAGAT ATTGCCGGGT
GTTTACCAGT TAATCAGCGG GCAAGTGACC GTAAATCATA TTCGCGATAT TCAGGTGGAA
GATTTATTGG GCAGAGAGCC TGTTGAGGTA GATTTAAGCG AGATTGCCGG TTATTTAACA
GATAGGGTGG TTTTGGTTTC CGGTGCCGGC GGTTCGATTG GTTCAGAGCT ATGCAGGCAG
GTGGTCAGGT TTAAGCCCAA GCTTCTGGTT GTTTTGGGGC ATGGTGAGAA CAGCATACAT
AATATAGTCT TTGAGCTTCG TGAAATGCAT GGCAGCGATC TGCCTATTGA GATAGTGATT
GCTGATATAA GGGACAGGCA AAAGATTAAT TTGATTTTTA AGAAATATAG ACCGTCAGTG
GTTTTTCATG CTGCTGCGCA TAAGCATGTA CCTCTGATGG AGCTGCATCC TGATGAGTCA
GTGAAGACGA ATGTTTTGGG TACAAAGAAT TTAGCAGAAG CAGCCGACAG GGTTGGAACA
GATGTTTTTA TTATGCTTTC AACAGATAAA GCAGTTAATC CTTCCAGTGT TATGGGGGCT
ACCAAGCGTT TGGCTGAGCT GATATTGCAG CAGATGAACA GTATAAGTGA TACTGTTTAT
GCGGCTGTTC GGTTTGGCAA TGTCTTGGGG AGCAATGGCA GTGTAGTGCC TATCTTTAAG
CGGCAGATTG CTCAGGGAGG GCCGGTTACT GTTACTCACC CGGAAATGAA GAGGTACTTT
ATGACTATAC CCGAGGCTGT GCAGTTAGTG ATTCAGGCTG GGGCTATGGC TCAGGGCGGG
GAGATATTTG TGCTGGACAT GGGTGAGCCG GTGAAGATTG TGGATTTAGC TAAATGTATT
ATTGATTTGT CGGGAGTGGA TTGTGAAATT AAATTTACCG GGATTAGGCC GGGGGAGAAG
CTGTTTGAGG AATTGCTGAC GGCAGAAGAG GGTTCTTCTG CTACCCGGCA TAGGAGGATA
TTTGTGGCTA ATGCGGGGAG TGTGAATTTG GAGACATTGG AGTTGGAAAT TCTTCGTTTG
AGGGAGTTGG GATGGGATGT TGTGACTGGG GACGTTTTTA AAGCATTGAC GGTTCTCCTT
CCAAATATAA AGATATATCG AAAAGATATG GTTGGTTAG
 
Protein sequence
MSGKARLLVL LCCDFLLVIM SLLVSLFIRF PSWPELSKAL VNYINFAPVC ALVMLVFFYF 
FGLYQRVWAY ASIGELVTIV KAVTTGKLVV IALTYFIFTP LPRSVVLMSW AFSIILIGGS
RLCWRIYVQK KKFAVAGCPL DKRKTLIVGA GDAGVLVVRE LMNHNSEYLP VGFIDDDASK
QGMVILGIPV LGKREELPSI IEKYRIKEVI IAMPSVSGQV IKETVEKCHN SQVKLKILPG
VYQLISGQVT VNHIRDIQVE DLLGREPVEV DLSEIAGYLT DRVVLVSGAG GSIGSELCRQ
VVRFKPKLLV VLGHGENSIH NIVFELREMH GSDLPIEIVI ADIRDRQKIN LIFKKYRPSV
VFHAAAHKHV PLMELHPDES VKTNVLGTKN LAEAADRVGT DVFIMLSTDK AVNPSSVMGA
TKRLAELILQ QMNSISDTVY AAVRFGNVLG SNGSVVPIFK RQIAQGGPVT VTHPEMKRYF
MTIPEAVQLV IQAGAMAQGG EIFVLDMGEP VKIVDLAKCI IDLSGVDCEI KFTGIRPGEK
LFEELLTAEE GSSATRHRRI FVANAGSVNL ETLELEILRL RELGWDVVTG DVFKALTVLL
PNIKIYRKDM VG