Gene Cthe_2601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2601 
Symbol 
ID4809023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3071699 
End bp3072853 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content44% 
IMG OID640108015 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionYP_001038994 
Protein GI125975084 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR00236] UDP-N-acetylglucosamine 2-epimerase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAAGAC TTAAGGTAAT GACTGTTTTT GGCACCAGGC CTGAAGCCAT AAAAATGGCG 
CCTTTGGTTA CTGAACTGAA AAAGTGCGAT CAAATTGAGA CCGTTGTTTG TGTTACGGCG
CAGCACAGGC AAATGCTTGA TCAGGTACTG GAAATATTTA ATATAAATGC GGATTATGAC
CTGGATATAA TGAAAGACAA GCAAACATTA ATAGATATAA CCACACGGGC TTTGGAACGT
TTAAGTGTCA TACTTGACAA AACAAAGCCC GATATTGTGT TGGTGCACGG GGATACCACC
ACCACTTTTG TGGGAAGTCT GGCCGCGTTT TATAAAAAAA TCAGCGTGGG ACATGTGGAG
GCAGGGCTTC GCACCTATGA CAAATACTTT CCCTACCCCG AGGAAATAAA CAGGCGTCTT
ACCGGCGTAA TAGCCGACCT TCATTTTGCG CCGACCAGGA CCAACAGGGA TAATCTTGTG
CGGGAAGGCG TGGATGAAAG CAAAATATAT ATAACCGGCA ACACTGTAAT TGACGCACTG
AAAACTACGG TGGTGGAAAA TTACGATTTT GCAAATGAGG GCCTTAAAAA ACTGGACTTT
AAAAAGAGAA TTATAACCGT CACGGCGCAT AGAAGGGAAA ATCTCGGCGA GCCTCTGCAC
AATATTTGCG AGGCTCTAAA GCATATAGCG GATCGGTATG ATGACATAGA GATAGTGTAC
CCCGTTCATC TGAATCCCGC GGTGCAGGAA GTGGCAAAAA AGATTCTCGG AAGCCATGAG
AGGGTGCATT TGATTGATCC TCTGGATGTG CAGGATATGC ACAATCTGAT GGCAAGGTCA
TATCTTATAA TGACGGATTC CGGAGGGCTT CAGGAAGAAG CACCGTCGCT GGGCAAGCCT
GTACTGGTAT TGAGAAATGA GACGGAAAGA CCGGAGGCGG TAAAAGCCGG TACGGTCAAG
CTTGCGGGAA CTGAAAAAGA GAACATTATA CGTTTGACTG AGGAACTTTT GGACAACAAA
ACGGAATATG ACAAGATGGC AAAAGCGGTT AACCCTTACG GAGACGGTTT TGCTTCCGAA
AGAATTGTTA AGGCGCTTCT ATTTGAGTTT GGATTGTCAA AAACAAAGCC TGAGGGATTT
GATGTAAAAA TATAA
 
Protein sequence
MKRLKVMTVF GTRPEAIKMA PLVTELKKCD QIETVVCVTA QHRQMLDQVL EIFNINADYD 
LDIMKDKQTL IDITTRALER LSVILDKTKP DIVLVHGDTT TTFVGSLAAF YKKISVGHVE
AGLRTYDKYF PYPEEINRRL TGVIADLHFA PTRTNRDNLV REGVDESKIY ITGNTVIDAL
KTTVVENYDF ANEGLKKLDF KKRIITVTAH RRENLGEPLH NICEALKHIA DRYDDIEIVY
PVHLNPAVQE VAKKILGSHE RVHLIDPLDV QDMHNLMARS YLIMTDSGGL QEEAPSLGKP
VLVLRNETER PEAVKAGTVK LAGTEKENII RLTEELLDNK TEYDKMAKAV NPYGDGFASE
RIVKALLFEF GLSKTKPEGF DVKI