Gene Dtox_0072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0072 
Symbol 
ID8426994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp76973 
End bp78229 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content44% 
IMG OID645032467 
ProductUDP-N-acetylglucosamine1- carboxyvinyltransferase 
Protein accessionYP_003189658 
Protein GI258513436 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000298929 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.037429 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAATA TTGCTATCGT AGGGGGGCAA AGGCTTCAGG GAAAAGTAAA AGTCAGCGGA 
GCAAAAAATG CAACTCTTGC GATACTTGGA GCAGCTTTGT TGGCTAATGA AAGTATTATC
CTGGAGAATG TGCCCGACAT AAGTGATGTT AGGATAATGG TAAATATCAT TCGTGATTTG
GGCGGGGAGA TTGATTGGTT GGACAAGGAA GTAATATCTT TTGTTCCGCC TAAAGAAATT
AAAAAATCTC CTATATATAA TAATGTAAAG AAATTGCGCG CCTCCAATTT ATTGCTTGGG
CCTCTATTGG CCAAGTTTGG TTATGCTGAA GTGGCTCTTC CCGGAGGGTG CAATATCGGA
GTGCGGCCTA TGGACTTGCA TTTTAAAGGG TTGGCCGGTC TAGGTGCGGA TTTATATATA
GAGAGAGGTT GTGTCAAAGG ATCTGCTAAG AAACTTGCAG GTGCCAGAAT ATATCTTGAT
TTTCCCAGTG TTGGAGCTAC TGAGAATATA ATGATGGCTG CCTGTCTGGC TGAGGGACAG
ACTATTATTG AAAACGTTGC CAAGGAGCCG GAAATAGTTG ATTTGGCAAA TTTTTTAAAC
AGTCTGGGCG GCAAAGTGCG AGGGGCAGGC ACAGATGTTA TAAAAATAGA AGGAGTAAAA
TCTTTAGATC GCGGTGGTCG CTATGCTGTA ATTCCTGATC GCATTGAGGC CGGAACTTTT
ATGGTAGCTA TTGCGGCGAC AAGGGGTGAT GCGATCCTTG AGGGGGTTAT TCCCAGGCAT
ATTGAGCCTC TTATAGCCAA GTTGCGGGAG GCTAATGTTG AGATAACTGA GGAAGGGGAT
AATCTAAGAG TCAGAGCGGT TAGCCAACTA AATCCCATAG ATATCAAGAC GCTGCCTTAC
CCTGGTTTTC CAACTGATAT GCAGTCGCAG GTAATGACGC TGCTGACAAA TGTGCCGGGA
ACCAGTATAA TTATAGAGAA TATTTTCGAA AATAGATTTC AGATTTCAGA TGAATTAAAG
AGAATGGGAG CTCAAATCAA GGTTGAAGGG CGTATGGCTG TAATTGAGGG TGTTGCATCT
TTACAGGGGA CTGTTGTTAA GGCCTCTGAT TTGCGGGCCG GTGCTGCTTT GGTAATTGCC
GGTTTAATGG CAGAAGGAGT TACCGAAATC ATCAATTCTT TTTACATTGA CAGGGGATAC
CAGGACTTGG AGGATAAATT ATCTTCGCTG GGTGCTAAGA TCTGGAGAAA CGATTGA
 
Protein sequence
MSNIAIVGGQ RLQGKVKVSG AKNATLAILG AALLANESII LENVPDISDV RIMVNIIRDL 
GGEIDWLDKE VISFVPPKEI KKSPIYNNVK KLRASNLLLG PLLAKFGYAE VALPGGCNIG
VRPMDLHFKG LAGLGADLYI ERGCVKGSAK KLAGARIYLD FPSVGATENI MMAACLAEGQ
TIIENVAKEP EIVDLANFLN SLGGKVRGAG TDVIKIEGVK SLDRGGRYAV IPDRIEAGTF
MVAIAATRGD AILEGVIPRH IEPLIAKLRE ANVEITEEGD NLRVRAVSQL NPIDIKTLPY
PGFPTDMQSQ VMTLLTNVPG TSIIIENIFE NRFQISDELK RMGAQIKVEG RMAVIEGVAS
LQGTVVKASD LRAGAALVIA GLMAEGVTEI INSFYIDRGY QDLEDKLSSL GAKIWRND