Gene Moth_0846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0846 
Symbol 
ID3831543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp879550 
End bp880806 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content60% 
IMG OID637828776 
ProductUDP-N-acetylglucosamine 1-carboxyvinyltransferase 
Protein accessionYP_429706 
Protein GI83589697 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0834214 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGAGGTTC TAGTCATCAA TGGGGGCCAG CGCCTGGAGG GGGTAGTGGT TGTCCAGGGG 
GCAAAGAATG CCGCATTGCC CATCATGGCA GCCACCCTCC TGGCCACCGG GGAATGCCGC
CTCCAGCGGG TACCCCGCCT GCAGGATGTC AGTGTCATGG CAGCTGTTAT TCGTTCCCTG
GGGATGAAGG TGGAGCACCG GGGCGAGGAA CTGCTGGTGG CTCCGGAGTC CACGGTAACG
CCGGAAGTAC CGGCAGAATT GATGCGGCAA CTGCGGGCTT CCAATCTGGT TATGGGCCCC
CTGCTGGGGC GGTGGCACTA CTTCCGGGTA CCCTATCCAG GGGGCTGCGC TATTGGCTCC
CGGCCCATGG ACCTGCACAT CAAGGGTTTG ATGGCCATGG GGGCTGAAGT GACGGAAAAG
CTGGGCTATA TCGAGGCCCG GACCACCGGG CTCCGGGGAA CCAGCTTCTA CCTGGATTTT
CCCAGCGTCG GGGCGACGGA AAACCTGATG ATGGCCGCCG CCCTGGCGGA AGGGGTGACA
ACCCTGTATA ATGCGGCCCG GGAACCGGAG ATTGTCGACC TGCAAAATTT CCTGAACGCT
ATGGGAGCCA GGATTCGCGG CGCCGGCCGG GATACCATCC GCATCGAAGG GGTGAGGGAA
CTCAAGGGAT GCGATTATAA GATAATTCCC GACCGGATCG AAGCCGGTAC CTTCCTGGCC
GCAGCAGCGG CCACCGGAGG CGATGTCCTG GTCCAGGATT GCCAGCCCGA GCATCTCATG
GCCGTCCTGG CAAAACTGCG GGAAATGGGG GCCAGGATAA TTATAAAAAA AGAGGCCATC
CGTATCCAGG GCCCGGGGAG GCCAAGGGCC GTCGATTGTA AGACCCTCCC TTACCCCGGT
TTTCCCACCG ATATGCAACC CCAGTTTATG GCCCTTATGA GTGTAGCCGA CGGTACGAGT
ATTATGGTTG AAAGCATTTT TGAAAACAGA TTTAAACATG CGGCTGAATT AAGGCGCCTG
GGAGCCGATA TAAAAATAGA AGGACGGGTA GCAGTGATCA ACGGCGTTCC AGGCTTAAGC
GGGAGTATGG TGGAAGCCTC CGACCTGAGG GCCGGGGCGG CCCTGGTGAT CGCCGGCCTC
CTGGCCGAAG GGCAGACCCT GGTGGAGGGC GTCCACCACC TGGATCGCGG TTACGAACAA
TTGGAGAAAC GCCTGGGCGA CCTGGGAGCC GATATTCAGC GCCTGAATAA GAAGTGA
 
Protein sequence
MEVLVINGGQ RLEGVVVVQG AKNAALPIMA ATLLATGECR LQRVPRLQDV SVMAAVIRSL 
GMKVEHRGEE LLVAPESTVT PEVPAELMRQ LRASNLVMGP LLGRWHYFRV PYPGGCAIGS
RPMDLHIKGL MAMGAEVTEK LGYIEARTTG LRGTSFYLDF PSVGATENLM MAAALAEGVT
TLYNAAREPE IVDLQNFLNA MGARIRGAGR DTIRIEGVRE LKGCDYKIIP DRIEAGTFLA
AAAATGGDVL VQDCQPEHLM AVLAKLREMG ARIIIKKEAI RIQGPGRPRA VDCKTLPYPG
FPTDMQPQFM ALMSVADGTS IMVESIFENR FKHAAELRRL GADIKIEGRV AVINGVPGLS
GSMVEASDLR AGAALVIAGL LAEGQTLVEG VHHLDRGYEQ LEKRLGDLGA DIQRLNKK