Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2629 |
Symbol | glmU |
ID | 4808940 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3107356 |
End bp | 3108759 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640108042 |
Product | bifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase |
Protein accession | YP_001039021 |
Protein GI | 125975111 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) |
TIGRFAM ID | [TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGAGGA AACTTTTAAT GGAATGCTTG ATGGCAGTCA TTCTTGCCGC CGGTGAAGGA AAAAGAATGA AGTCCAAAAA AGCAAAGGTT GTGCATGAAA TTCAGGGTAT ACCTTTGGTT GAATGGGTTT ATAGATCGGT GAAAAACGCA GGAATAGACG AAGTTGTACT TGTTGTGGGG CATAAAGCGG AAGAAGTAAA AGAAAAAATG GGGGACAAAG TCCTTTACGC TTTTCAGGAA AAACAATTGG GAACGGGGCA TGCGCTTATG CAGGCCCAGG AGTACCTGAA GGATAAAGAC GGTTATGTTG TGGTACTCTA CGGAGATACG CCACTGATTA CTTCAAAAAC TATTTCCGAC ACAATTAATT ATCACAGGGA ACAGGCAAAC TCAGCCACAA TTATTACCGC AGTTCTTAAC AATCCGGACG GATATGGCAG AATAGTAAGA AGCGGCGACG GCAGTGTCAG AAAAATTGTT GAACACAAGG ATGCTTCTTT GGAGGAAAGG AATATAAAGG AAATCAATTC AGGGATATAC TGTTTTAATA TAAGAGATTT GACAGAAGCA TTAAAAGAGC TTGACAACAA CAACAGCCAG GGAGAGTATT ACCTTACGGA TACTATTGAG ATACTCATAA ACAAAGGAAA AAAAGTCGGC GCAATAAAAG TTGAGGACAG CAGTGAGATA TTGGGCATAA ATGACAGGGT GCAGCTTGCT GAGGCAGGCA GGATAATCAG AAGCCGGATT CTGAAGAGAC ATATGAAAAA CGGTGTGACC ATAATTGACC CTGATTCAAC GTATATTGAT GAGGACGTGG AAATAGGTAT TGACACGGTG GTTTACCCTT CAACAATTAT TGAAGGAAAG ACAAAAATAG GCGAGGATTG TATAATAGGT CCCGGAAGCA GGCTTGTAAA CGCCCAAATT TCGGACAGGG TGGAAGTAAA AAATTCCGTT GTATTGGAAA GCTCCATAGA CAATGATACG AAAGTCGGGC CTTTTGCATA TGTAAGACCG GGAAGTGTTA TAGGTAAAAA TGTTAAGATT GGTGATTTTG TTGAAATTAA AAAGTCTGTA ATAGGAGACA AGACAAAAAT ATCTCATCTT ACTTATGTGG GAGATGCCGA AGTCGGAAAA AATGTCAACC TTGGATGCGG AGTTGTAGTG GTAAACTATG ACGGAAAGAA AAAGAACAAG ACGATTATTG GAGATAATGC ATTTGTAGGC TGCAATGTAA ATCTGATTTC ACCGGTTGAA GTTAAGGACA ACGCGTATGT GGCTGCTGGT TCCACGATTA CGGAAGAAGT GCCGGAATAC TCTCTTGCCA TTGCCAGAAG CCGGCAGACA ATCAAGGAGG ACTGGGTTAT AAAAAAGGGA ATGTTAAGGC AGGAGAAAGA ATAG
|
Protein sequence | MRRKLLMECL MAVILAAGEG KRMKSKKAKV VHEIQGIPLV EWVYRSVKNA GIDEVVLVVG HKAEEVKEKM GDKVLYAFQE KQLGTGHALM QAQEYLKDKD GYVVVLYGDT PLITSKTISD TINYHREQAN SATIITAVLN NPDGYGRIVR SGDGSVRKIV EHKDASLEER NIKEINSGIY CFNIRDLTEA LKELDNNNSQ GEYYLTDTIE ILINKGKKVG AIKVEDSSEI LGINDRVQLA EAGRIIRSRI LKRHMKNGVT IIDPDSTYID EDVEIGIDTV VYPSTIIEGK TKIGEDCIIG PGSRLVNAQI SDRVEVKNSV VLESSIDNDT KVGPFAYVRP GSVIGKNVKI GDFVEIKKSV IGDKTKISHL TYVGDAEVGK NVNLGCGVVV VNYDGKKKNK TIIGDNAFVG CNVNLISPVE VKDNAYVAAG STITEEVPEY SLAIARSRQT IKEDWVIKKG MLRQEKE
|
| |