Gene Cthe_1906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1906 
Symbol 
ID4810764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2263726 
End bp2264937 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content38% 
IMG OID640107323 
Productglycosyl transferase family protein 
Protein accessionYP_001038318 
Protein GI125974408 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAATA TATTAATTGC CACGCATTGG ACGGGCGGCG ATGTATATCC GTTTATAAGA 
ATAGGAAAAG CTTTAAAGAG ACGAGGACAT GATGTAACTA TATTTACTCA TTGTATATAT
AAAAATATTG TAGAACAGGA CGGAATGAAG TTTGTACCAT GGGACAGCCC GGATGAATAT
GAACAGTTGA TGAATGACCT GCCGCTTTTA GTGGATCCTC TGCGCAATTT AGATGGAATG
CTTACATTTT ACGGCAGGCA TCATAACAAT GAAAAAACTC TTATGGAATA TAGAAAGATA
TCAGAGTATT GTTCAAAAAA AGATACTGTA ATATTGGCAA GGCATCGTTC AGCGATTTCC
GCCCTCCTTG CAGCAGAAAA ATTTAATATA CCAGTTGTAA GTGTGTTTTT AGCTCCAAAT
TACATTTCCC ATTTGCAAAT ACATGAAGAA ATATTTGGAG ATATTATGAA GAAAACTGTC
AATGAAATTC GCAAAGCATT AAATCTAAAG CCGATAGAAT GCTGGACATC TTGGATATGT
TCTCCAAAAC GAAAACTGGG ACTGTGGCCT GAATGGTTCG CACATCCTGA TGAAACATGG
CCTTCTGGAT TGATTTGTGT AGGTTTTTAT GTGGAAGAAG CTGGAGATAA AGAAGAATTA
CCCCCTGAAA TTGTGGAAAT GCTGAATGGA GATTCAAAAC CCATTTTAAT TACAGCCGGC
ACAAGTAAAA TGATTAGACC GGAGTTTTAT GAGGTTGCTT CTGAAGCATG CAGAATACTT
GGCAAAACCG GAATTCTGGT GACACTTTAT GATGAATTGG TTCCCAAACC GTTACCTGAT
AATGTAAAAC GATTTCAAAA GTTATCAATT AGAAGCTTGT TGCCGCATGT GGATGCGGTT
ATTCACCATG GAGGCATTGG AACGACGAGT GAAGCAACAG CGGCAGGCAT TCCTCAATTG
ATATTACCTC ATTTGACTGA CGGACCTGAT AATGCACATC GGTTAAGGGG ATTGGGAATT
GCGGAATTGT TGCCTCCATT AAGGTGGAAA CCGCATTTAT TGGCGGCAAA ATTAACAACA
TTAATGAGTC AGGATTATAG AAGTCGTTGC TTAAAATTTT CCCAATATAT CAGGCAGGAA
GATTCAGAAA GCAACATATG CAGAGCAATT GAACAAGTAA TCGGGAATAA TGATTTTTTA
ATATCGAATT AG
 
Protein sequence
MANILIATHW TGGDVYPFIR IGKALKRRGH DVTIFTHCIY KNIVEQDGMK FVPWDSPDEY 
EQLMNDLPLL VDPLRNLDGM LTFYGRHHNN EKTLMEYRKI SEYCSKKDTV ILARHRSAIS
ALLAAEKFNI PVVSVFLAPN YISHLQIHEE IFGDIMKKTV NEIRKALNLK PIECWTSWIC
SPKRKLGLWP EWFAHPDETW PSGLICVGFY VEEAGDKEEL PPEIVEMLNG DSKPILITAG
TSKMIRPEFY EVASEACRIL GKTGILVTLY DELVPKPLPD NVKRFQKLSI RSLLPHVDAV
IHHGGIGTTS EATAAGIPQL ILPHLTDGPD NAHRLRGLGI AELLPPLRWK PHLLAAKLTT
LMSQDYRSRC LKFSQYIRQE DSESNICRAI EQVIGNNDFL ISN