Gene Dtox_0201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0201 
Symbol 
ID8427125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp219699 
End bp220661 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content43% 
IMG OID645032588 
Product4-diphosphocytidyl-2C-methyl-D-erythritolkinase 
Protein accessionYP_003189777 
Protein GI258513555 
COG category[I] Lipid transport and metabolism 
COG ID[COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase 
TIGRFAM ID[TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.114439 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTTGG TTATACCCGC TTATGCTAAA ATCAATCTTT GTTTGGATGT GCTGGGAAGA 
AGGGATGACG GTTATCATGA GGTAGAGATG GTTATGCAAT CTATTTCTCT GCATGATTTG
CTGGAGCTGT CCCTCTCGGA AGAGCAGGAA AATAATAATA TGGGCAAGAT TATTTTGACT
GTTGCAGGTG CTGATTTGCC TGTTGGTGAG GAGAATCTGG TATTCAGGAC GGCCCGCATA
TTGCAGGAGT ATACGGGATG CCGGTTGGGC TGCTCAATAC TTCTGCATAA AAAGATACCG
GTTGCTGCCG GTCTTGCCGG TGGGTCCGCT GATGCTGCTG CGGCACTGCT GGGTCTTAAT
AAGTTATGGA ATTTGGATTT AACTGTTGCA GAACTGTATG CTTTAGCAGC TAAAATTGGT
TCTGATGTAC CTTTTTGTAT CAAAGGCGGT ACAGTGCTGG CAAAAGGAAG AGGCGAGCAG
TTGGCTTTTC TGGAAGCCGC ACCCGATATG GGAATTATTT TAGTTAAACC TGCTTATGGA
ATATCTACCG GGGAGGTTTA TAGCAAGCTG AATAGCGCCG TTTATCCTCA AGTTATTAAT
ACGATGCAAA AAAAAGATAT TACTAATGAT ACTAATGATA TCCATAACAT GTTATGCCTT
TCGGATTTGG GACCGCCGGT ACTAAGAATG ATTAAAGCCA TAAAAAGCAG GCAATTGCCT
GCTGTATGTA AGGCTTTATA TAATATTTTG GAGGAACCGG CAATGAAAAT GCACCCGAAC
CTTTTAGATA TAAAAAACAT ACTATTTGAA CAAGGAGCGA TGGGTGTTTT AATGTCCGGC
AGCGGATCGA CAATTTTTGG CATCACTCCT GATTTAGAGG CCGCACATCT GCTGTCTAAG
GGCCTGAGTC CTTCGCTTGG ATCTATTTAT GCGGTGAAAT TGCAGGGAGC GAGAGAAGTA
TGA
 
Protein sequence
MPLVIPAYAK INLCLDVLGR RDDGYHEVEM VMQSISLHDL LELSLSEEQE NNNMGKIILT 
VAGADLPVGE ENLVFRTARI LQEYTGCRLG CSILLHKKIP VAAGLAGGSA DAAAALLGLN
KLWNLDLTVA ELYALAAKIG SDVPFCIKGG TVLAKGRGEQ LAFLEAAPDM GIILVKPAYG
ISTGEVYSKL NSAVYPQVIN TMQKKDITND TNDIHNMLCL SDLGPPVLRM IKAIKSRQLP
AVCKALYNIL EEPAMKMHPN LLDIKNILFE QGAMGVLMSG SGSTIFGITP DLEAAHLLSK
GLSPSLGSIY AVKLQGAREV