Gene Moth_0843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0843 
SymbolmurG 
ID3831540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp875996 
End bp877111 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content61% 
IMG OID637828773 
Productundecaprenyldiphospho-muramoylpentapeptide beta-N- acetylglucosaminyltransferase 
Protein accessionYP_429703 
Protein GI83589694 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0707] UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase 
TIGRFAM ID[TIGR01133] undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGGGTGA TAATTACCGG CGGCGGTACC GGGGGCCATG TTTACCCGGC CCTGGCCATT 
GCTCGCGGCC TTAAAGAGGC CAGGCCGGGG GTAGAGTTAC TGTATATCGG GACGGCCAGG
GGTCTGGAAG CTGACGTGGT ACCCCGGGCT GGCCTGACCC TGGCCACCAT TACCGTCCAG
GGGCTGGTGC GACGGCAAGT ATGGAAGAAC ATTCCCGCCC TGGTGAAGAC CGGCCGGGGG
CTTGGCGAGG CCTGGCAGCA GGTGCGCCGT TTTCGACCAG ACGTAGTAGT CGGCACCGGT
GGCTATGTCA GCGGCCCGGT GTGCCTGGCT GCCGCCCTCC AGGGCGTACC GGTAATCCTC
CATGAACAGA ATGCCTTTCC GGGTGTTACC AATCGGCTGC TGGCGATCCT GGCTCGCTGC
GTCTGCCTGA CCTTTCCCGA GGCAGCCTCC CGTTTCCCTC GCCGGGCAAA ACTGGTTACC
ACCGGGCTAC CGGTACGGCC GGAGATAATC CAGGCGGACC GGGATTCATG CCGGCAGCAT
TTCGGCCTGC GGCCGGAGCA ACTCTTCCTG GTAACTGTTG GTGGCAGCCA GGGGGCCAGG
AGTATTAACG GGGCCATGTT ACCTATTTTG AAGGAACTGG CCGGGTGCCA GGATGTCAGC
CTTCTCCAGG TAACAGGACG CCGGGACTAT GAGGCTTATT TACAGCAGGT GCGCACCCAG
GGAATAGATC TGGCTAAATA TGGCAACATT ACCATTGAAC CCTATGTCTA TAACCTGGAG
CAGGCCCTGG CTGCAGCCGA CCTGGTCATC GGCCGGGCCG GGGCCTCCTT TTTAGCCGAA
GTACTGGCCC GGGGTCTGCC GTCCGTCCTG GTTCCCTATC CCCATGCGGC AGCCAATCAT
CAGGAGTATA ATGCCCGGGC CGTGGCCCGG CAGGGGGCGG CCGTGGTGGT CCTGGACCGG
GAACTAAAAG GAGGGCGGCT TTACCAGGTT GTATTCGAAC TCCTGAGATC AAGGGAAAAG
CTAAAGGCCA TGGCGGCTGC CGCCGCTTCA TTAGGTCGTC CCGGAGCCCT GGAGGCTATT
ATCCAGGTTA TCTTGAAAAC GGTCGAATCA GGTTAG
 
Protein sequence
MRVIITGGGT GGHVYPALAI ARGLKEARPG VELLYIGTAR GLEADVVPRA GLTLATITVQ 
GLVRRQVWKN IPALVKTGRG LGEAWQQVRR FRPDVVVGTG GYVSGPVCLA AALQGVPVIL
HEQNAFPGVT NRLLAILARC VCLTFPEAAS RFPRRAKLVT TGLPVRPEII QADRDSCRQH
FGLRPEQLFL VTVGGSQGAR SINGAMLPIL KELAGCQDVS LLQVTGRRDY EAYLQQVRTQ
GIDLAKYGNI TIEPYVYNLE QALAAADLVI GRAGASFLAE VLARGLPSVL VPYPHAAANH
QEYNARAVAR QGAAVVVLDR ELKGGRLYQV VFELLRSREK LKAMAAAAAS LGRPGALEAI
IQVILKTVES G