Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2229 |
Symbol | |
ID | 4811094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2656764 |
End bp | 2657816 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640107635 |
Product | N-acetylneuraminate synthase |
Protein accession | YP_001038624 |
Protein GI | 125974714 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2089] Sialic acid synthase |
TIGRFAM ID | [TIGR03586] pseudaminic acid synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGAGCA GAAGAATTAA AATAGGGAAT CGGGAAATTG GTGAAGGATG TCCCTGTTAT ATTATTGCGG AAATGTCTGC GAACCATGCA GGAGATTTAG GCAAAGCAAT CGAAATAATA CATGCGGCGA AAGAAGCCGG GGCGGATTGC ATAAAAATAC AGACTTATAC ACCGGATACC ATGACCATAA ACTGTGATAA AAAATATTTT CATATAAATG ACGGAACCTG GAAAGGTGAA AATTTATACG GTTTGTACCA GAAGGCAAAT ACTCCGTGGG AATGGCACGC ACGGCTCAAA GAAGAAGCCC AAAAAGCAGG AATTGATTTT TTTTCAACTC CGTTTGACAA ATCGGCTGTG GATTTTTTGG AGGATTTGGG AGTTGAATTT TATAAAATAG CTTCTTTTGA GGTTGTAGAT ATTCCCTTGA TAAAATATAT TGCATCAAAG AAAAAACCTA TTATTATGTC AACAGGTATG GCAACTTTGG GTGAAATTGA AGAAGCGGTG GAAACCATTA GGTCACAGGG TAATGACAAC TTCTGCCTTT TAAAATGTTC CAGTGCTTAT CCGGCCGTTC CGGAACAAAT GAATTTGAAA ACCATAGCTC ATTTGAAAGA GACATTTAAT GTTCCCGTGG GATTGTCGGA TCATTCTTTG GGTTCGGTTT CTGCAGTTGT GGCGGTTGCC ATGGGGGCAA GCATTATTGA AAAGCATTTT TGTCTGAGCA GGGAAATTAA AAGTCCCGAT TCGTCCTTTT CAATGGAACC GGATGAATTC AAAAAGATGG TCGAAGATAT AAGAGCTGCG GAAAAGTCAA TAGGGAAGGT AAGCTACAGT ATTTCAGAAA ATGAAGCCGT AAGCCGCAGT CACAGAAGAT CGATTTTTGT TGTGAAAGAT ATAAAAAAGG GTGAAGCTTT TACAGAAGAG AACATAAGGA TAATAAGACC GGCAGACGGT TTGGAGCCAA AATACTTTGA GCAGGTTTTA AACCGGAGGG CGTCGCAGGA TATTGAGAGG GGGACACCGC TGAAGTGGAC AATGATAAGC TGA
|
Protein sequence | MQSRRIKIGN REIGEGCPCY IIAEMSANHA GDLGKAIEII HAAKEAGADC IKIQTYTPDT MTINCDKKYF HINDGTWKGE NLYGLYQKAN TPWEWHARLK EEAQKAGIDF FSTPFDKSAV DFLEDLGVEF YKIASFEVVD IPLIKYIASK KKPIIMSTGM ATLGEIEEAV ETIRSQGNDN FCLLKCSSAY PAVPEQMNLK TIAHLKETFN VPVGLSDHSL GSVSAVVAVA MGASIIEKHF CLSREIKSPD SSFSMEPDEF KKMVEDIRAA EKSIGKVSYS ISENEAVSRS HRRSIFVVKD IKKGEAFTEE NIRIIRPADG LEPKYFEQVL NRRASQDIER GTPLKWTMIS
|
| |