Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0756 |
Symbol | |
ID | 3831469 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 792458 |
End bp | 793471 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637828687 |
Product | N-acetylneuraminate synthase |
Protein accession | YP_429617 |
Protein GI | 83589608 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2089] Sialic acid synthase |
TIGRFAM ID | [TIGR03569] N-acetylneuraminate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 61 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.556038 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCAATA GGGTTTTCAT CATTGCCGAA GCGGGCGTCA ACCATAATGG CGATTTGCAG CTGGCCAGGA AACTGGTAGA CGCGGCGGTA GAAGCGGGGG CAGACGCTGT AAAGTTTCAG ACTTTCAAGG CCGAAGAAGT GGCCACCCCC GGCGCCGAGC GGGCCCAATA TCAAAAAGAT AATATGCCCG GAAAAGACGA AAGCCAGCTG GAGATGATTA AACGGCTGGA ATTGAGCTAC GCCCAATTTC GGGAACTTTA TGCTTATTGC CGGCAGAAGG GGATTATATT TCTTTCCTCT CCCTTCGATC AGGAAAGCAT TGATTTTCTG GCCGAACTGG GAGTGCCTTA TTTTAAAATC CCTTCTGGAG AAATAACCAA CTATCCCTTC TTGCGTCGGA TCGGCGGGAA AAAGCGGCCG GTTATCCTTT CTACCGGCAT GGCGACCCTG GGTGAAGTGG AAGGTGCGTT GCGGGTTTTG CGGGAAGCCG GGGCGAGCGA CATAACCCTG CTGCACTGCA CCACCAGCTA CCCGGCTCCG CCGGAAGAGG TGAATTTAAG GGCCATGCTT ACCATGAAGC ATGCCTTTGC CCTACCGGTG GGCTATTCCG ATCACACCGA GGGCATCGCC GTACCCATTG CGGCAGCGGC CCTGGGGGCA GAAGTGATCG AGAAACACTT AACCGTGGAC CGCAACCTTC CCGGCCCTGA CCACCGCGCC TCCCTGGAAC CGGGAGAATT TAAAGAAATG GTCGTGGCCA TCCGCCAGGT AGAAAAAAGC CTGGGGGACG GCATCAAACG GCCCGCGCCG GGCGAGCTGG CCGTCATGCC GGCGGCCAGG CGCAGCCTGG TGGCAGCCAG GGACATAGCC GCCGGGGAAA TAATCACGGA CTCCTGCCTG ACCGCTAAAA GGCCGGGGAC GGGCATCCCG CCGAATTTGT GGGATGTGGT GGTGGGCCGG CAGGCCCGCC GGGATATTGC CGCAGGTAGT ATTTTAAGCT GGGATATGAT TTGA
|
Protein sequence | MFNRVFIIAE AGVNHNGDLQ LARKLVDAAV EAGADAVKFQ TFKAEEVATP GAERAQYQKD NMPGKDESQL EMIKRLELSY AQFRELYAYC RQKGIIFLSS PFDQESIDFL AELGVPYFKI PSGEITNYPF LRRIGGKKRP VILSTGMATL GEVEGALRVL REAGASDITL LHCTTSYPAP PEEVNLRAML TMKHAFALPV GYSDHTEGIA VPIAAAALGA EVIEKHLTVD RNLPGPDHRA SLEPGEFKEM VVAIRQVEKS LGDGIKRPAP GELAVMPAAR RSLVAARDIA AGEIITDSCL TAKRPGTGIP PNLWDVVVGR QARRDIAAGS ILSWDMI
|
| |