Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1905 |
Symbol | |
ID | 4810763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2262463 |
End bp | 2263695 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640107322 |
Product | glycosyl transferase family protein |
Protein accession | YP_001038317 |
Protein GI | 125974407 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAATA TAGTAATTAC AACTCATTGG ACCGATGGAG ATGTACTTCC GTTTATAAAA ATCGGAAGTG AACTTAGAAA AAGGGGACAC AGGGTAACGC TTATCACACA TTGTTATTAT AAAAACATGG CAAAAAGTCA AGGGCTGGAT TTTGAGGCAT GGGATTCTCC GGAACAACAT AGGCAGATGT TGGAAGATAT GAAAGAAGAA TTGGACCCGT TAAGAAATCC ATGTGCACTT GAAAGATATA AAAAAAAGTA TGAAAGTATT GAGGTACGGC TTCGTGAATT TAAAAAAGTG ATTGCACATT GCAATACAAC GGATACTGTC TTGGTAGCAA AAAACCGTTC CAGTGTTGCA GCATATCTTG CAGCGGAAAA ACTCAATATT CCTTTGGTAT GTGTATTTAT GGCTCCAAGC GAAATGTTAA GCATGGTTAG TTATGAAATG ATGCTTGGCA AATTATTGGC AAATGAGTTG AATTTGTTGC GAAAAGAGCT AAACCTCGCA CCGGTAAAGA GCTGGCTGGC GTGGCAAAGC AGTCCCAAAC GGCAAATTGC TCTGTGGCCT GATTGGTTTG CCGAGCCAAT TGAAGAATGG CCCGCAGAAG TGATAAATGT AGGTTTTCCA CTGTCATATA ATGAGAGATT TGATAATTTA CCTCCAGATT TAATGGAAGA TTTGCTTGGA GACGAGCCAC CTGTTGTTAT AACCGGAGGC ACCAGTAAAA CGATTCGTCC GGAGTTTTAT CCCTTATGTG TTGAGACTTG CAGACTTTCC GGACGTAAGG GTATTTTGGT AACACGGTAT GAGGAATTAT TACCCAAAGA GCTTCCCGAT AAAGTAAAGT GGTTTAGAGA ACTTCCTTTA AATAAAATAT TTCCATATAC ATCAGCAGTA ATTCATCATG GAGGCATGGG AACATTGAGT GGAGCAATTG CGGCAGGAGT GCCACAATTG GTGCTTCCGT ATTATCTTGA CAGGCCTTAT AATGCTTTAT GCTTAAAAAA ACTTGGTATT GCCGAATATC TACCCCCTAT AAAATGGAAA CCTGAAATTA TGGTTGATGC ACTTCAAAAA ATTACGGCTT CTTCCTTTAG AGAACGCTGT AAATTATTCT CAAAAAAAGT ATCTCTTCAA AACACTATGA ATGAAATCTG CTGTTTAATT GAAGAGACTG TTAATAATGA GGAATTTTTG CTTAAGGACA TTAGTTTATT ACAGACAACG TGA
|
Protein sequence | MANIVITTHW TDGDVLPFIK IGSELRKRGH RVTLITHCYY KNMAKSQGLD FEAWDSPEQH RQMLEDMKEE LDPLRNPCAL ERYKKKYESI EVRLREFKKV IAHCNTTDTV LVAKNRSSVA AYLAAEKLNI PLVCVFMAPS EMLSMVSYEM MLGKLLANEL NLLRKELNLA PVKSWLAWQS SPKRQIALWP DWFAEPIEEW PAEVINVGFP LSYNERFDNL PPDLMEDLLG DEPPVVITGG TSKTIRPEFY PLCVETCRLS GRKGILVTRY EELLPKELPD KVKWFRELPL NKIFPYTSAV IHHGGMGTLS GAIAAGVPQL VLPYYLDRPY NALCLKKLGI AEYLPPIKWK PEIMVDALQK ITASSFRERC KLFSKKVSLQ NTMNEICCLI EETVNNEEFL LKDISLLQTT
|
| |