Gene Cthe_1905 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1905 
Symbol 
ID4810763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2262463 
End bp2263695 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content39% 
IMG OID640107322 
Productglycosyl transferase family protein 
Protein accessionYP_001038317 
Protein GI125974407 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAATA TAGTAATTAC AACTCATTGG ACCGATGGAG ATGTACTTCC GTTTATAAAA 
ATCGGAAGTG AACTTAGAAA AAGGGGACAC AGGGTAACGC TTATCACACA TTGTTATTAT
AAAAACATGG CAAAAAGTCA AGGGCTGGAT TTTGAGGCAT GGGATTCTCC GGAACAACAT
AGGCAGATGT TGGAAGATAT GAAAGAAGAA TTGGACCCGT TAAGAAATCC ATGTGCACTT
GAAAGATATA AAAAAAAGTA TGAAAGTATT GAGGTACGGC TTCGTGAATT TAAAAAAGTG
ATTGCACATT GCAATACAAC GGATACTGTC TTGGTAGCAA AAAACCGTTC CAGTGTTGCA
GCATATCTTG CAGCGGAAAA ACTCAATATT CCTTTGGTAT GTGTATTTAT GGCTCCAAGC
GAAATGTTAA GCATGGTTAG TTATGAAATG ATGCTTGGCA AATTATTGGC AAATGAGTTG
AATTTGTTGC GAAAAGAGCT AAACCTCGCA CCGGTAAAGA GCTGGCTGGC GTGGCAAAGC
AGTCCCAAAC GGCAAATTGC TCTGTGGCCT GATTGGTTTG CCGAGCCAAT TGAAGAATGG
CCCGCAGAAG TGATAAATGT AGGTTTTCCA CTGTCATATA ATGAGAGATT TGATAATTTA
CCTCCAGATT TAATGGAAGA TTTGCTTGGA GACGAGCCAC CTGTTGTTAT AACCGGAGGC
ACCAGTAAAA CGATTCGTCC GGAGTTTTAT CCCTTATGTG TTGAGACTTG CAGACTTTCC
GGACGTAAGG GTATTTTGGT AACACGGTAT GAGGAATTAT TACCCAAAGA GCTTCCCGAT
AAAGTAAAGT GGTTTAGAGA ACTTCCTTTA AATAAAATAT TTCCATATAC ATCAGCAGTA
ATTCATCATG GAGGCATGGG AACATTGAGT GGAGCAATTG CGGCAGGAGT GCCACAATTG
GTGCTTCCGT ATTATCTTGA CAGGCCTTAT AATGCTTTAT GCTTAAAAAA ACTTGGTATT
GCCGAATATC TACCCCCTAT AAAATGGAAA CCTGAAATTA TGGTTGATGC ACTTCAAAAA
ATTACGGCTT CTTCCTTTAG AGAACGCTGT AAATTATTCT CAAAAAAAGT ATCTCTTCAA
AACACTATGA ATGAAATCTG CTGTTTAATT GAAGAGACTG TTAATAATGA GGAATTTTTG
CTTAAGGACA TTAGTTTATT ACAGACAACG TGA
 
Protein sequence
MANIVITTHW TDGDVLPFIK IGSELRKRGH RVTLITHCYY KNMAKSQGLD FEAWDSPEQH 
RQMLEDMKEE LDPLRNPCAL ERYKKKYESI EVRLREFKKV IAHCNTTDTV LVAKNRSSVA
AYLAAEKLNI PLVCVFMAPS EMLSMVSYEM MLGKLLANEL NLLRKELNLA PVKSWLAWQS
SPKRQIALWP DWFAEPIEEW PAEVINVGFP LSYNERFDNL PPDLMEDLLG DEPPVVITGG
TSKTIRPEFY PLCVETCRLS GRKGILVTRY EELLPKELPD KVKWFRELPL NKIFPYTSAV
IHHGGMGTLS GAIAAGVPQL VLPYYLDRPY NALCLKKLGI AEYLPPIKWK PEIMVDALQK
ITASSFRERC KLFSKKVSLQ NTMNEICCLI EETVNNEEFL LKDISLLQTT