Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2375 |
Symbol | |
ID | 3832014 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2500395 |
End bp | 2501654 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637830294 |
Product | UDP-N-acetylglucosamine 1-carboxyvinyltransferase |
Protein accession | YP_431200 |
Protein GI | 83591191 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase |
TIGRFAM ID | [TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 50 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000845863 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGGAGGCTA TAGCTATTCA GGGCCGGGCG CGGTTGCGTG GCCGAGTGGC CATCAGCGGC TCCAAGAATG CCGTCCTGCC GATTATTGCG GCCTGCCTGC TGACTGGTGA CGAGTGCTAT CTGGAAGATA TCCCCCGGCT GGCTGATGTT GATACCATGT GCGGAGTCAT CGGTGAACTG GGCGCCAGGG TTTATCCTGA GGGGATAAAT GGCCTGCGCA TCAGTTCCGG CTTTTTAGAA AAGGTGGAAC CCCCCTACGA ATACGTCCGC CGCATGCGGG CCTCCTTTCT GGTCATGGGC CCCCTCTTGG CCCGTTTTGG GCGGGTAAAG GTTTCTCTGC CGGGAGGGTG CGCCATCGGT GCCCGACCCA TTGACCTGCA CCTGAAGGGT ATGGCCGCCC TGGGGGCCAA GATTACTGTT AATAAAGGTA ACGTTGAGGC CGAGGCTGGC AGCCGCCTGA AGGGAGCCCA GGTATACCTG GATTTCCCCA GTGTCGGAGC GACGGAGAAT ATCATGATGG CCGCCGCCCT GGCCGAGGGG ACTACCACCA TCGAAAACGC CGCCGGCGAA CCGGAGATCG TTGACCTGGC CAACTTCATC AACGCCATGG GCGGCCGGGT CACAGGCGCC GGGACCAGGG TTATCAAAAT CGAGGGGGTA AAGGAACTCC ACGGCAGCCG TCATGCCGTT ATCCCCGACC GGGTTGAAGC CGGCACCTTT ATGATTGCCG CCGCCGCCAC CGGCGGGGAT GTTTTGGTGG AAAATGTAAT CCCCACCCAC CTGAAGGCGG TCATGGCCAA ACTGGCCGAA ACCGGCGCCC GGCTGGAAGA GGAAGAGGGC GGCATTCGGG TGCGGGCCGA TCTACCCTTA AAGGCAGTTG ACATCAAGAC CATGCCCTAT CCCGGCTTCC CGACGGACAT GCAGGCCCAG TTCATGGCCC TCCTGACCAC CGCCCGGGGG AGCAGCATGG TCACAGAGAC CGTCTTTGAA AACCGCTTTA TGCACGTCAA CGAGCTGAAA CGCATGGGGG CCGATATAGT CATAGAAGGC CATTGCGCCG TGATTAAAGG CAAGAGCAAG CTCATCGGGG CGCCCGTCAA GGCTACCGAC CTGCGGGCCG GGGCGGCCCT GGTCATTGCC GGCCTCATGG CTGAGGGAGA AACAACCATC TCCTGCGTCC ACCATATTGA CCGCGGCTAC GAAAACCTGG TCGGCAAGCT CCAGGCCCTG GGCGCTGAAG TTATCAGAAC AGAAATTTAG
|
Protein sequence | MEAIAIQGRA RLRGRVAISG SKNAVLPIIA ACLLTGDECY LEDIPRLADV DTMCGVIGEL GARVYPEGIN GLRISSGFLE KVEPPYEYVR RMRASFLVMG PLLARFGRVK VSLPGGCAIG ARPIDLHLKG MAALGAKITV NKGNVEAEAG SRLKGAQVYL DFPSVGATEN IMMAAALAEG TTTIENAAGE PEIVDLANFI NAMGGRVTGA GTRVIKIEGV KELHGSRHAV IPDRVEAGTF MIAAAATGGD VLVENVIPTH LKAVMAKLAE TGARLEEEEG GIRVRADLPL KAVDIKTMPY PGFPTDMQAQ FMALLTTARG SSMVTETVFE NRFMHVNELK RMGADIVIEG HCAVIKGKSK LIGAPVKATD LRAGAALVIA GLMAEGETTI SCVHHIDRGY ENLVGKLQAL GAEVIRTEI
|
| |