Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1906 |
Symbol | |
ID | 4810764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2263726 |
End bp | 2264937 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640107323 |
Product | glycosyl transferase family protein |
Protein accession | YP_001038318 |
Protein GI | 125974408 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCAATA TATTAATTGC CACGCATTGG ACGGGCGGCG ATGTATATCC GTTTATAAGA ATAGGAAAAG CTTTAAAGAG ACGAGGACAT GATGTAACTA TATTTACTCA TTGTATATAT AAAAATATTG TAGAACAGGA CGGAATGAAG TTTGTACCAT GGGACAGCCC GGATGAATAT GAACAGTTGA TGAATGACCT GCCGCTTTTA GTGGATCCTC TGCGCAATTT AGATGGAATG CTTACATTTT ACGGCAGGCA TCATAACAAT GAAAAAACTC TTATGGAATA TAGAAAGATA TCAGAGTATT GTTCAAAAAA AGATACTGTA ATATTGGCAA GGCATCGTTC AGCGATTTCC GCCCTCCTTG CAGCAGAAAA ATTTAATATA CCAGTTGTAA GTGTGTTTTT AGCTCCAAAT TACATTTCCC ATTTGCAAAT ACATGAAGAA ATATTTGGAG ATATTATGAA GAAAACTGTC AATGAAATTC GCAAAGCATT AAATCTAAAG CCGATAGAAT GCTGGACATC TTGGATATGT TCTCCAAAAC GAAAACTGGG ACTGTGGCCT GAATGGTTCG CACATCCTGA TGAAACATGG CCTTCTGGAT TGATTTGTGT AGGTTTTTAT GTGGAAGAAG CTGGAGATAA AGAAGAATTA CCCCCTGAAA TTGTGGAAAT GCTGAATGGA GATTCAAAAC CCATTTTAAT TACAGCCGGC ACAAGTAAAA TGATTAGACC GGAGTTTTAT GAGGTTGCTT CTGAAGCATG CAGAATACTT GGCAAAACCG GAATTCTGGT GACACTTTAT GATGAATTGG TTCCCAAACC GTTACCTGAT AATGTAAAAC GATTTCAAAA GTTATCAATT AGAAGCTTGT TGCCGCATGT GGATGCGGTT ATTCACCATG GAGGCATTGG AACGACGAGT GAAGCAACAG CGGCAGGCAT TCCTCAATTG ATATTACCTC ATTTGACTGA CGGACCTGAT AATGCACATC GGTTAAGGGG ATTGGGAATT GCGGAATTGT TGCCTCCATT AAGGTGGAAA CCGCATTTAT TGGCGGCAAA ATTAACAACA TTAATGAGTC AGGATTATAG AAGTCGTTGC TTAAAATTTT CCCAATATAT CAGGCAGGAA GATTCAGAAA GCAACATATG CAGAGCAATT GAACAAGTAA TCGGGAATAA TGATTTTTTA ATATCGAATT AG
|
Protein sequence | MANILIATHW TGGDVYPFIR IGKALKRRGH DVTIFTHCIY KNIVEQDGMK FVPWDSPDEY EQLMNDLPLL VDPLRNLDGM LTFYGRHHNN EKTLMEYRKI SEYCSKKDTV ILARHRSAIS ALLAAEKFNI PVVSVFLAPN YISHLQIHEE IFGDIMKKTV NEIRKALNLK PIECWTSWIC SPKRKLGLWP EWFAHPDETW PSGLICVGFY VEEAGDKEEL PPEIVEMLNG DSKPILITAG TSKMIRPEFY EVASEACRIL GKTGILVTLY DELVPKPLPD NVKRFQKLSI RSLLPHVDAV IHHGGIGTTS EATAAGIPQL ILPHLTDGPD NAHRLRGLGI AELLPPLRWK PHLLAAKLTT LMSQDYRSRC LKFSQYIRQE DSESNICRAI EQVIGNNDFL ISN
|
| |