Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0846 |
Symbol | |
ID | 3831543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 879550 |
End bp | 880806 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637828776 |
Product | UDP-N-acetylglucosamine 1-carboxyvinyltransferase |
Protein accession | YP_429706 |
Protein GI | 83589697 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase |
TIGRFAM ID | [TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0834214 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGAGGTTC TAGTCATCAA TGGGGGCCAG CGCCTGGAGG GGGTAGTGGT TGTCCAGGGG GCAAAGAATG CCGCATTGCC CATCATGGCA GCCACCCTCC TGGCCACCGG GGAATGCCGC CTCCAGCGGG TACCCCGCCT GCAGGATGTC AGTGTCATGG CAGCTGTTAT TCGTTCCCTG GGGATGAAGG TGGAGCACCG GGGCGAGGAA CTGCTGGTGG CTCCGGAGTC CACGGTAACG CCGGAAGTAC CGGCAGAATT GATGCGGCAA CTGCGGGCTT CCAATCTGGT TATGGGCCCC CTGCTGGGGC GGTGGCACTA CTTCCGGGTA CCCTATCCAG GGGGCTGCGC TATTGGCTCC CGGCCCATGG ACCTGCACAT CAAGGGTTTG ATGGCCATGG GGGCTGAAGT GACGGAAAAG CTGGGCTATA TCGAGGCCCG GACCACCGGG CTCCGGGGAA CCAGCTTCTA CCTGGATTTT CCCAGCGTCG GGGCGACGGA AAACCTGATG ATGGCCGCCG CCCTGGCGGA AGGGGTGACA ACCCTGTATA ATGCGGCCCG GGAACCGGAG ATTGTCGACC TGCAAAATTT CCTGAACGCT ATGGGAGCCA GGATTCGCGG CGCCGGCCGG GATACCATCC GCATCGAAGG GGTGAGGGAA CTCAAGGGAT GCGATTATAA GATAATTCCC GACCGGATCG AAGCCGGTAC CTTCCTGGCC GCAGCAGCGG CCACCGGAGG CGATGTCCTG GTCCAGGATT GCCAGCCCGA GCATCTCATG GCCGTCCTGG CAAAACTGCG GGAAATGGGG GCCAGGATAA TTATAAAAAA AGAGGCCATC CGTATCCAGG GCCCGGGGAG GCCAAGGGCC GTCGATTGTA AGACCCTCCC TTACCCCGGT TTTCCCACCG ATATGCAACC CCAGTTTATG GCCCTTATGA GTGTAGCCGA CGGTACGAGT ATTATGGTTG AAAGCATTTT TGAAAACAGA TTTAAACATG CGGCTGAATT AAGGCGCCTG GGAGCCGATA TAAAAATAGA AGGACGGGTA GCAGTGATCA ACGGCGTTCC AGGCTTAAGC GGGAGTATGG TGGAAGCCTC CGACCTGAGG GCCGGGGCGG CCCTGGTGAT CGCCGGCCTC CTGGCCGAAG GGCAGACCCT GGTGGAGGGC GTCCACCACC TGGATCGCGG TTACGAACAA TTGGAGAAAC GCCTGGGCGA CCTGGGAGCC GATATTCAGC GCCTGAATAA GAAGTGA
|
Protein sequence | MEVLVINGGQ RLEGVVVVQG AKNAALPIMA ATLLATGECR LQRVPRLQDV SVMAAVIRSL GMKVEHRGEE LLVAPESTVT PEVPAELMRQ LRASNLVMGP LLGRWHYFRV PYPGGCAIGS RPMDLHIKGL MAMGAEVTEK LGYIEARTTG LRGTSFYLDF PSVGATENLM MAAALAEGVT TLYNAAREPE IVDLQNFLNA MGARIRGAGR DTIRIEGVRE LKGCDYKIIP DRIEAGTFLA AAAATGGDVL VQDCQPEHLM AVLAKLREMG ARIIIKKEAI RIQGPGRPRA VDCKTLPYPG FPTDMQPQFM ALMSVADGTS IMVESIFENR FKHAAELRRL GADIKIEGRV AVINGVPGLS GSMVEASDLR AGAALVIAGL LAEGQTLVEG VHHLDRGYEQ LEKRLGDLGA DIQRLNKK
|
| |