Gene Moth_2375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2375 
Symbol 
ID3832014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2500395 
End bp2501654 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content61% 
IMG OID637830294 
ProductUDP-N-acetylglucosamine 1-carboxyvinyltransferase 
Protein accessionYP_431200 
Protein GI83591191 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000845863 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGGAGGCTA TAGCTATTCA GGGCCGGGCG CGGTTGCGTG GCCGAGTGGC CATCAGCGGC 
TCCAAGAATG CCGTCCTGCC GATTATTGCG GCCTGCCTGC TGACTGGTGA CGAGTGCTAT
CTGGAAGATA TCCCCCGGCT GGCTGATGTT GATACCATGT GCGGAGTCAT CGGTGAACTG
GGCGCCAGGG TTTATCCTGA GGGGATAAAT GGCCTGCGCA TCAGTTCCGG CTTTTTAGAA
AAGGTGGAAC CCCCCTACGA ATACGTCCGC CGCATGCGGG CCTCCTTTCT GGTCATGGGC
CCCCTCTTGG CCCGTTTTGG GCGGGTAAAG GTTTCTCTGC CGGGAGGGTG CGCCATCGGT
GCCCGACCCA TTGACCTGCA CCTGAAGGGT ATGGCCGCCC TGGGGGCCAA GATTACTGTT
AATAAAGGTA ACGTTGAGGC CGAGGCTGGC AGCCGCCTGA AGGGAGCCCA GGTATACCTG
GATTTCCCCA GTGTCGGAGC GACGGAGAAT ATCATGATGG CCGCCGCCCT GGCCGAGGGG
ACTACCACCA TCGAAAACGC CGCCGGCGAA CCGGAGATCG TTGACCTGGC CAACTTCATC
AACGCCATGG GCGGCCGGGT CACAGGCGCC GGGACCAGGG TTATCAAAAT CGAGGGGGTA
AAGGAACTCC ACGGCAGCCG TCATGCCGTT ATCCCCGACC GGGTTGAAGC CGGCACCTTT
ATGATTGCCG CCGCCGCCAC CGGCGGGGAT GTTTTGGTGG AAAATGTAAT CCCCACCCAC
CTGAAGGCGG TCATGGCCAA ACTGGCCGAA ACCGGCGCCC GGCTGGAAGA GGAAGAGGGC
GGCATTCGGG TGCGGGCCGA TCTACCCTTA AAGGCAGTTG ACATCAAGAC CATGCCCTAT
CCCGGCTTCC CGACGGACAT GCAGGCCCAG TTCATGGCCC TCCTGACCAC CGCCCGGGGG
AGCAGCATGG TCACAGAGAC CGTCTTTGAA AACCGCTTTA TGCACGTCAA CGAGCTGAAA
CGCATGGGGG CCGATATAGT CATAGAAGGC CATTGCGCCG TGATTAAAGG CAAGAGCAAG
CTCATCGGGG CGCCCGTCAA GGCTACCGAC CTGCGGGCCG GGGCGGCCCT GGTCATTGCC
GGCCTCATGG CTGAGGGAGA AACAACCATC TCCTGCGTCC ACCATATTGA CCGCGGCTAC
GAAAACCTGG TCGGCAAGCT CCAGGCCCTG GGCGCTGAAG TTATCAGAAC AGAAATTTAG
 
Protein sequence
MEAIAIQGRA RLRGRVAISG SKNAVLPIIA ACLLTGDECY LEDIPRLADV DTMCGVIGEL 
GARVYPEGIN GLRISSGFLE KVEPPYEYVR RMRASFLVMG PLLARFGRVK VSLPGGCAIG
ARPIDLHLKG MAALGAKITV NKGNVEAEAG SRLKGAQVYL DFPSVGATEN IMMAAALAEG
TTTIENAAGE PEIVDLANFI NAMGGRVTGA GTRVIKIEGV KELHGSRHAV IPDRVEAGTF
MIAAAATGGD VLVENVIPTH LKAVMAKLAE TGARLEEEEG GIRVRADLPL KAVDIKTMPY
PGFPTDMQAQ FMALLTTARG SSMVTETVFE NRFMHVNELK RMGADIVIEG HCAVIKGKSK
LIGAPVKATD LRAGAALVIA GLMAEGETTI SCVHHIDRGY ENLVGKLQAL GAEVIRTEI