Gene Cthe_2640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2640 
Symbol 
ID4808951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3121893 
End bp3123068 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content40% 
IMG OID640108053 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionYP_001039032 
Protein GI125975122 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR00236] UDP-N-acetylglucosamine 2-epimerase
[TIGR03568] UDP-N-acetyl-D-glucosamine 2-epimerase, UDP-hydrolysing 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGG TAATAAGTGT TTTTACAGCC ACAAGAGCCG AGTATGGTTT GCTAAAGCCC 
ATAATAAATA AGTTGAATAA AATAAAGGAA TTTGACGTAA GGATTGTAGC AACCGGTGCG
CATCTTTCGC CGGAGTTTGG GCTTACCTAC AAAGAAATTG AAAAAGACGG ATTTCATATA
GATGAAAAAA TAGAAATTTT GCTAAGTGCG GATACACCGT CTGCAATATC CAAATCAATG
GGCCTTGTTT TGATTGGATT TGCAGATTAT TTTAAAAGGA TTAATCCCGA TTTGTTGATT
GTCCTTGGAG ACAGATATGA AACCCTTGCA GTTTCCATGG CGGCGGTAAA TCAAAGAATT
CCCATTGCCC ATCTTTACGG CGGCGAATCG ACGGAAGGGG CTGTTGACGA GTCAATCCGC
CATGCCATAA CCAAACTGAG CTATCTTCAT TTTACAAGTA CGGAAACTTA CCGGAAAAGA
GTCATACAAC TGGGGGAACA TCCCGACCGG GTGTTCAATG TGGGGGCCAT TGGCATAGAA
AATATATTAA ATGAAAAACT CCTGTCAAAA GATGAATTGG AAAAAGAATT AAAGATAGAT
TTAAGTAAGC CTTATGCAAT GGCATGTTTT CATCCGGTAA CCCTGGAAGA AAACACTTCC
GAAAAGCACA TTACTGCTTT GCTTGAAGCA TGCAAGGCAT ATAAGAATAT GAATTTCATA
TTTACCAAAA CCAATGCCGA CACCGACGGG CGCATTATAA ACCGGCTTAT TGACAAATAT
GCAGAGGAAA ATGACAACAT TACTGCTTTT ACCTCACTGG GCACGGTTAA TTACTTAAGT
GTCCTGAAAC ACAGTGCCAT GATAATAGGC AATTCCTCAA GCGGGCTGCT GGAAGCGCCC
AGTTTTGGCA TTCCGACAAT AAATATCGGC GAACGCCAGA AAGGAAGAAT ACAAGCCACC
AGTGTCATAA ATTGCAACCC AAACGAGGAA GAAATAAAAC AGGCAATTAA AAAGGCTTTG
TCGGATTCAT TCATCAAACA GGCAAAAGAA ACAGTAAATC CTTACGGAGA CGGAAACACT
TCCGAGAGAA TTATTGAAGT AATTAAAGAA TATATGCTGG GCGAAAAAAT CAATCTTAAA
AAAGAATTTT ACGACGTTGA GGTTGTCGGA ATATGA
 
Protein sequence
MKKVISVFTA TRAEYGLLKP IINKLNKIKE FDVRIVATGA HLSPEFGLTY KEIEKDGFHI 
DEKIEILLSA DTPSAISKSM GLVLIGFADY FKRINPDLLI VLGDRYETLA VSMAAVNQRI
PIAHLYGGES TEGAVDESIR HAITKLSYLH FTSTETYRKR VIQLGEHPDR VFNVGAIGIE
NILNEKLLSK DELEKELKID LSKPYAMACF HPVTLEENTS EKHITALLEA CKAYKNMNFI
FTKTNADTDG RIINRLIDKY AEENDNITAF TSLGTVNYLS VLKHSAMIIG NSSSGLLEAP
SFGIPTINIG ERQKGRIQAT SVINCNPNEE EIKQAIKKAL SDSFIKQAKE TVNPYGDGNT
SERIIEVIKE YMLGEKINLK KEFYDVEVVG I