Gene Cthe_2629 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2629 
SymbolglmU 
ID4808940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3107356 
End bp3108759 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content41% 
IMG OID640108042 
Productbifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase 
Protein accessionYP_001039021 
Protein GI125975111 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAGGA AACTTTTAAT GGAATGCTTG ATGGCAGTCA TTCTTGCCGC CGGTGAAGGA 
AAAAGAATGA AGTCCAAAAA AGCAAAGGTT GTGCATGAAA TTCAGGGTAT ACCTTTGGTT
GAATGGGTTT ATAGATCGGT GAAAAACGCA GGAATAGACG AAGTTGTACT TGTTGTGGGG
CATAAAGCGG AAGAAGTAAA AGAAAAAATG GGGGACAAAG TCCTTTACGC TTTTCAGGAA
AAACAATTGG GAACGGGGCA TGCGCTTATG CAGGCCCAGG AGTACCTGAA GGATAAAGAC
GGTTATGTTG TGGTACTCTA CGGAGATACG CCACTGATTA CTTCAAAAAC TATTTCCGAC
ACAATTAATT ATCACAGGGA ACAGGCAAAC TCAGCCACAA TTATTACCGC AGTTCTTAAC
AATCCGGACG GATATGGCAG AATAGTAAGA AGCGGCGACG GCAGTGTCAG AAAAATTGTT
GAACACAAGG ATGCTTCTTT GGAGGAAAGG AATATAAAGG AAATCAATTC AGGGATATAC
TGTTTTAATA TAAGAGATTT GACAGAAGCA TTAAAAGAGC TTGACAACAA CAACAGCCAG
GGAGAGTATT ACCTTACGGA TACTATTGAG ATACTCATAA ACAAAGGAAA AAAAGTCGGC
GCAATAAAAG TTGAGGACAG CAGTGAGATA TTGGGCATAA ATGACAGGGT GCAGCTTGCT
GAGGCAGGCA GGATAATCAG AAGCCGGATT CTGAAGAGAC ATATGAAAAA CGGTGTGACC
ATAATTGACC CTGATTCAAC GTATATTGAT GAGGACGTGG AAATAGGTAT TGACACGGTG
GTTTACCCTT CAACAATTAT TGAAGGAAAG ACAAAAATAG GCGAGGATTG TATAATAGGT
CCCGGAAGCA GGCTTGTAAA CGCCCAAATT TCGGACAGGG TGGAAGTAAA AAATTCCGTT
GTATTGGAAA GCTCCATAGA CAATGATACG AAAGTCGGGC CTTTTGCATA TGTAAGACCG
GGAAGTGTTA TAGGTAAAAA TGTTAAGATT GGTGATTTTG TTGAAATTAA AAAGTCTGTA
ATAGGAGACA AGACAAAAAT ATCTCATCTT ACTTATGTGG GAGATGCCGA AGTCGGAAAA
AATGTCAACC TTGGATGCGG AGTTGTAGTG GTAAACTATG ACGGAAAGAA AAAGAACAAG
ACGATTATTG GAGATAATGC ATTTGTAGGC TGCAATGTAA ATCTGATTTC ACCGGTTGAA
GTTAAGGACA ACGCGTATGT GGCTGCTGGT TCCACGATTA CGGAAGAAGT GCCGGAATAC
TCTCTTGCCA TTGCCAGAAG CCGGCAGACA ATCAAGGAGG ACTGGGTTAT AAAAAAGGGA
ATGTTAAGGC AGGAGAAAGA ATAG
 
Protein sequence
MRRKLLMECL MAVILAAGEG KRMKSKKAKV VHEIQGIPLV EWVYRSVKNA GIDEVVLVVG 
HKAEEVKEKM GDKVLYAFQE KQLGTGHALM QAQEYLKDKD GYVVVLYGDT PLITSKTISD
TINYHREQAN SATIITAVLN NPDGYGRIVR SGDGSVRKIV EHKDASLEER NIKEINSGIY
CFNIRDLTEA LKELDNNNSQ GEYYLTDTIE ILINKGKKVG AIKVEDSSEI LGINDRVQLA
EAGRIIRSRI LKRHMKNGVT IIDPDSTYID EDVEIGIDTV VYPSTIIEGK TKIGEDCIIG
PGSRLVNAQI SDRVEVKNSV VLESSIDNDT KVGPFAYVRP GSVIGKNVKI GDFVEIKKSV
IGDKTKISHL TYVGDAEVGK NVNLGCGVVV VNYDGKKKNK TIIGDNAFVG CNVNLISPVE
VKDNAYVAAG STITEEVPEY SLAIARSRQT IKEDWVIKKG MLRQEKE