Gene Cthe_3094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3094 
Symbol 
ID4809720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3649007 
End bp3649981 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content40% 
IMG OID640108522 
Productglycosyl transferase family protein 
Protein accessionYP_001039482 
Protein GI125975572 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000830791 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGAGA AAATTGTATG TTCAGTTGTT GTACCTCTTT ACAACGAAGA AGAGGTAATT 
CTGGAAACCT ACAAAAGGCT TAAAAATGTA ATGGAATCAC TGAATGAGCC TTATGAAATA
ATATTTGTCA ACGATGGAAG CAAAGACAGA ACAGGAATTA TCGCCAATGA AATATGTAAC
AAGGATAAAA CCGTAAAGCT GGTGGACTTT GCAAGAAATT TCGGCCATCA AACCGCAATA
ACCGCCGGAA TGGATTATTC TGAAGGTGAA GCCATAGTTG TCATTGATGC AGACCTTCAG
GACCCTCCGG AGCTTATTCC CAAAATGATT GAAAAATGGC GCGAAGGCTA TGATGTTGTA
TACGGAAAAA GAAAAGAAAG AAAAGGCGAA ACTTTCTTTA AAAAGTTTAC GGCAAAGGTA
TTCTACCGCT TCTTAAGAAG AATGACCGAT GTAGATATTC CTGTTGACAC AGGCGACTTC
AGACTCATAG ACCGGAAGGT CTGTGAAGCT CTCAAGCTGG TTAATGAGCG CAACAGATAT
ATACGCGGCA TAATAAGCTG GCTTGGCTTT AAACAAACAG GAATTGAGTT TGTAAGGGAA
AAACGCTTTG CCGGCGAAAC AAAGTATCCC TTAAAGAAAA TGCTGAAGTT TGCCGCCGAT
GCCATTACAT CATTCTCCTA TAAACCTTTG AAGCTGGCGT CATACTTTGG TATGCTGCTC
TCATTTTGCA GTTTCGTATA TCTGCTTGTG GTTATCTGGA TGAAGCTTTT TACGGACCAT
GTACAACAAG GTTGGGCGTC AACCGTCGCA ATCAACCTCT TTTTCCACGG CATTACTCTT
ATCATTTTAG GTATCATGGG AGAGTATATA GGAAGAATTT ATGACGAAGC CAAAGGAAGA
CCTTTATATA TCGTAAAACA GACCAGAAAC TTCTCTGAAG ACAAAACCGA CAAGATAACC
ATAAGAAAAA AATAA
 
Protein sequence
MSEKIVCSVV VPLYNEEEVI LETYKRLKNV MESLNEPYEI IFVNDGSKDR TGIIANEICN 
KDKTVKLVDF ARNFGHQTAI TAGMDYSEGE AIVVIDADLQ DPPELIPKMI EKWREGYDVV
YGKRKERKGE TFFKKFTAKV FYRFLRRMTD VDIPVDTGDF RLIDRKVCEA LKLVNERNRY
IRGIISWLGF KQTGIEFVRE KRFAGETKYP LKKMLKFAAD AITSFSYKPL KLASYFGMLL
SFCSFVYLLV VIWMKLFTDH VQQGWASTVA INLFFHGITL IILGIMGEYI GRIYDEAKGR
PLYIVKQTRN FSEDKTDKIT IRKK