Gene Cthe_0186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0186 
Symbol 
ID4808674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp224385 
End bp225449 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content44% 
IMG OID640105597 
ProductUDP-galactose 4-epimerase 
Protein accessionYP_001036620 
Protein GI125972710 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID[TIGR01179] UDP-glucose-4-epimerase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.347724 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAAC AAGGTGCATA TGGCTTTGTC ATCTTGGGTG GCATAATAAA CAATAGAGGA 
AAGGTTGTGT TTGAAGTGGC AGTATTGGTT ACAGGAGGAG CAGGTTATAT TGGAAGCCAC
ACAGTGGCGG AACTTGTAGA AAAGAAGGAA GAGGTAATAG TTGTCGATAA CCTTGAAAAA
GGCCACAGGG ATGCTGTGGC AGGAGCGAAA CTTATTGTAG GTGATTTAAG GGATAAAGAA
TTTGTGAAAA AAGTATTTTT GGAAAACGAT ATTGAAGCGG TTATCCATTT TGCGGCTTAT
ATTGAAGTAG GCGAAAGCGT CCAAAATCCC TTAAAATATT ATAACAACAA TGTTATCGCA
ACTTTAAACC TTCTTACGGC AATGGAAGAG GCAAAAGTCG ACAAAATTGT ATTTTCTTCC
ACGGCGGCAA CCTATGGTGA GCCGGAAAAC ATACCGATCT TGGAGACTGA CAGAACCCTT
CCCACCAATC CGTACGGTGA AACCAAGCTG GCTGTTGAAA AGGCTCTTAA GTGGTGTGAC
AGAGCTTATG GTATTAAATA CATTGCCTTG AGGTATTTTA ATGCCAGCGG TGCCCATGAA
AGCGGAGAAA TAGGCGAGGA CCATTCTCCC GAAAGCCATT TGATTCCCCT TGTTATCCAG
GCAGCCCTGG GTAAAAGGGA ATCCATAAAG ATATTCGGAA ATGATTATAA TACTCCGGAC
GGAACATGTA TAAGGGATTA CATACACGTT TCCGACCTTG CAAACGCCCA CTATCTTGCG
TTGCAAAGGC TCAGAGAAGG CAAGGAAAGC GCGGTTTACA ATCTTGGAAA CGGAAAAGGT
TTTTCCGTAA AAGAGGTTAT TGATGTGGTA CGAAAAGTCA CGGGAAGACC GATAAAAGTT
GAAGACGCTC CGAGAAGACC CGGAGACCCG GCAGTACTGG TTGCTTCATC GGAAAAAATC
AAAAAGGAGC TGAACTGGCA GCCACGCATG GCTGATCTCG AGACAATTGT AAGCACTGCG
TGGAAATGGC ACTTATCCCA TCCGAACGGC TATAATGACA AATAA
 
Protein sequence
MQKQGAYGFV ILGGIINNRG KVVFEVAVLV TGGAGYIGSH TVAELVEKKE EVIVVDNLEK 
GHRDAVAGAK LIVGDLRDKE FVKKVFLEND IEAVIHFAAY IEVGESVQNP LKYYNNNVIA
TLNLLTAMEE AKVDKIVFSS TAATYGEPEN IPILETDRTL PTNPYGETKL AVEKALKWCD
RAYGIKYIAL RYFNASGAHE SGEIGEDHSP ESHLIPLVIQ AALGKRESIK IFGNDYNTPD
GTCIRDYIHV SDLANAHYLA LQRLREGKES AVYNLGNGKG FSVKEVIDVV RKVTGRPIKV
EDAPRRPGDP AVLVASSEKI KKELNWQPRM ADLETIVSTA WKWHLSHPNG YNDK