Gene Dtox_3771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_3771 
Symbol 
ID8430781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp3943903 
End bp3945462 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content49% 
IMG OID645035998 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_003193101 
Protein GI258516879 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0302209 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGTTG TTACGGCAGG GGAAATGGCT GCTCTTGACA GGAAGGCTAT TGAGCAGTAT 
TGTATACCGG GCATTGTTCT TATGGAGAAT GCAGGTCTTC AAGTGGTTAA GGTTATAGTT
GATCTTTTGG ATAAACAAGT AAATGGCAAG AGAGTGGTTA TTTTTGCCGG CAAAGGCAAT
AACGGCGGGG ATGGGTTTGT TATTGCCCGC CACTTGCATA ATCTGGGTGC CGCTGTAGAT
GTCTTGCTTT TTGCTGATCC TGCGCAAATT TCCGGCGATG CCGCCATCAA TTTAAATATA
TGGCTTAAAA TGGGCCAGAC TATTGTTCGG ATAAATGATG GAAACGGTAA GTCTCCGGCT
GAACTTGCGG TTGCTTCCGC TGATATTATG GTAGATGCTC TTTATGGTAC CGGCTTTAAA
GGAGTTGTGC CTGAACCAGC GGCTTCTTTT ATCTTGCTGG CCAATAACAG CGGCAAACCC
ATTGTATGTG TGGATATTCC TTCCGGTGTA GAAGCTGATA CAGGTTGTGT CCGGGGACCC
TGTTTTAAGG CTGATCATAC AGTTACATTT GCGCTGCCCA AGCTGGGTCT TTTGCTGGAA
CCGGGTTCGT ACTATTCCGG CAAGCTGCAC ATAGTGGATA TTTCTTTGCC GGCCGTTTTA
TTAAAGTTCC GGAATTACGG GCGTTATTTG CTGCGGGATG AGCTTGTCAG TGAGTGGCTG
CCTCGCCGTT TTACCGGTGC TCATAAAGGA GATTGTGGAA GAGTATTAAT TCTAGCCGGA
TCCCGGGGAA TGACAGGAGC TGCCTGCCTG ACAGCTCAGG CTGCGATCAG ATCCGGTGCG
GGCCTGGTTA CGCTTGGTGT ACCTGAAGGT TTGCACGATA TTATGGAGAT TAAGTTAACC
GAGGTTATGA CGGTGCCGTT GCCGGAAACA GACAGGAAAA CTCTGTCTCT GGAGGCCCTG
GATCAGATAA AAGCGCTGTT AGACAGGTCT GATGTGCTTG CTTTGGGGCC GGGCCTGACG
GTGCATCCTG AGACCGTGGC TTTAGTGCAA AAGGTACTGG AAGATCTAAA AATACCTGCT
GTAATAGACG CGGACGGGTT AAATGCTCTG GCCGGGCAGA CCGGTATTTT GAGCAGAATT
AAAGCACCTG TTGTGCTAAC ACCCCATCCG GTAGAAATGG CTCGTTTGTT AAGCATAACT
GCCGGGGAAG TGTTGTCTGA TCGCTTAGGC TCAGTTCAAA AATTAGTGGA GGCCGGCGGG
TGCATTGTGC TTTTGAAGGG CTCCCGCACT CTGATAACGG ATGGGGAGGA AATCTATATT
AATGCCACAG GTAATCCGGG AATGGCTACC GGCGGAAGCG GTGATGTTTT GACCGGAGTT
ATTGCTGCGC TTTTAGCGCA AGGCTTGAGT CCTCTCAGAG CAGCTGCTGC CGGTGCGTTT
GTGCATGGCA GAGCCGGAGA TTTGGCCCTG GCGGATAAAG GTGTTATGGG GTTGATTGCC
GGTGACTTTC TGGATTGCCT GCCCGGAGCT TTAAATATGA TATTAGAAGG TGAAATATAA
 
Protein sequence
MQVVTAGEMA ALDRKAIEQY CIPGIVLMEN AGLQVVKVIV DLLDKQVNGK RVVIFAGKGN 
NGGDGFVIAR HLHNLGAAVD VLLFADPAQI SGDAAINLNI WLKMGQTIVR INDGNGKSPA
ELAVASADIM VDALYGTGFK GVVPEPAASF ILLANNSGKP IVCVDIPSGV EADTGCVRGP
CFKADHTVTF ALPKLGLLLE PGSYYSGKLH IVDISLPAVL LKFRNYGRYL LRDELVSEWL
PRRFTGAHKG DCGRVLILAG SRGMTGAACL TAQAAIRSGA GLVTLGVPEG LHDIMEIKLT
EVMTVPLPET DRKTLSLEAL DQIKALLDRS DVLALGPGLT VHPETVALVQ KVLEDLKIPA
VIDADGLNAL AGQTGILSRI KAPVVLTPHP VEMARLLSIT AGEVLSDRLG SVQKLVEAGG
CIVLLKGSRT LITDGEEIYI NATGNPGMAT GGSGDVLTGV IAALLAQGLS PLRAAAAGAF
VHGRAGDLAL ADKGVMGLIA GDFLDCLPGA LNMILEGEI