Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0186 |
Symbol | |
ID | 4808674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 224385 |
End bp | 225449 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640105597 |
Product | UDP-galactose 4-epimerase |
Protein accession | YP_001036620 |
Protein GI | 125972710 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1087] UDP-glucose 4-epimerase |
TIGRFAM ID | [TIGR01179] UDP-glucose-4-epimerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.347724 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAAAC AAGGTGCATA TGGCTTTGTC ATCTTGGGTG GCATAATAAA CAATAGAGGA AAGGTTGTGT TTGAAGTGGC AGTATTGGTT ACAGGAGGAG CAGGTTATAT TGGAAGCCAC ACAGTGGCGG AACTTGTAGA AAAGAAGGAA GAGGTAATAG TTGTCGATAA CCTTGAAAAA GGCCACAGGG ATGCTGTGGC AGGAGCGAAA CTTATTGTAG GTGATTTAAG GGATAAAGAA TTTGTGAAAA AAGTATTTTT GGAAAACGAT ATTGAAGCGG TTATCCATTT TGCGGCTTAT ATTGAAGTAG GCGAAAGCGT CCAAAATCCC TTAAAATATT ATAACAACAA TGTTATCGCA ACTTTAAACC TTCTTACGGC AATGGAAGAG GCAAAAGTCG ACAAAATTGT ATTTTCTTCC ACGGCGGCAA CCTATGGTGA GCCGGAAAAC ATACCGATCT TGGAGACTGA CAGAACCCTT CCCACCAATC CGTACGGTGA AACCAAGCTG GCTGTTGAAA AGGCTCTTAA GTGGTGTGAC AGAGCTTATG GTATTAAATA CATTGCCTTG AGGTATTTTA ATGCCAGCGG TGCCCATGAA AGCGGAGAAA TAGGCGAGGA CCATTCTCCC GAAAGCCATT TGATTCCCCT TGTTATCCAG GCAGCCCTGG GTAAAAGGGA ATCCATAAAG ATATTCGGAA ATGATTATAA TACTCCGGAC GGAACATGTA TAAGGGATTA CATACACGTT TCCGACCTTG CAAACGCCCA CTATCTTGCG TTGCAAAGGC TCAGAGAAGG CAAGGAAAGC GCGGTTTACA ATCTTGGAAA CGGAAAAGGT TTTTCCGTAA AAGAGGTTAT TGATGTGGTA CGAAAAGTCA CGGGAAGACC GATAAAAGTT GAAGACGCTC CGAGAAGACC CGGAGACCCG GCAGTACTGG TTGCTTCATC GGAAAAAATC AAAAAGGAGC TGAACTGGCA GCCACGCATG GCTGATCTCG AGACAATTGT AAGCACTGCG TGGAAATGGC ACTTATCCCA TCCGAACGGC TATAATGACA AATAA
|
Protein sequence | MQKQGAYGFV ILGGIINNRG KVVFEVAVLV TGGAGYIGSH TVAELVEKKE EVIVVDNLEK GHRDAVAGAK LIVGDLRDKE FVKKVFLEND IEAVIHFAAY IEVGESVQNP LKYYNNNVIA TLNLLTAMEE AKVDKIVFSS TAATYGEPEN IPILETDRTL PTNPYGETKL AVEKALKWCD RAYGIKYIAL RYFNASGAHE SGEIGEDHSP ESHLIPLVIQ AALGKRESIK IFGNDYNTPD GTCIRDYIHV SDLANAHYLA LQRLREGKES AVYNLGNGKG FSVKEVIDVV RKVTGRPIKV EDAPRRPGDP AVLVASSEKI KKELNWQPRM ADLETIVSTA WKWHLSHPNG YNDK
|
| |