Gene Cthe_2554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2554 
Symbol 
ID4809161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3023268 
End bp3024521 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content37% 
IMG OID640107969 
Productglycosyl transferase, group 1 
Protein accessionYP_001038948 
Protein GI125975038 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAGG TTGTGCACAT AACTGCACAT TTAGGCGGAG GAATTGGAAG AATACTTTCG 
AGTATAGCTA TATATTCGCA AATTGAAAAA GAAGTTGAAC ATGTTATTGT TACGCTGGAA
AAAACCGAGA ATTCTCATTT TGAACAGTTA CTGAAAGAAC ACAGCATAAA AGTTTTCCTG
CAAAACCAAT GCTGTCTAAA ACAAATATTA CAAGAGGCTG ACATTGTCGA AGTGGATTGG
TGGCATCATC CTCTAACCTC TGCGTTTATG CATAATTATT TTAATGATAT AGAATGCAGA
TTGCTGATTT GGAGTCATGT TTCAGGCTGT ACCTATCCGT ATATAAAATA TGAACTTATT
AAATGTGCAG ATAAATTTGT GTTTTCCACT CCCTTTTCAT TTGAAAATGA GTATTGGTCA
AATGAAGAAA AAGAAGAAGT CATGAAAAGA GTAGAAATAA TAGTAAGCTC TGGAATAGAT
TTTGATGCTC CTGTAAAGAA AAAGCCACAT CACGGTTATA ATGTCGGCTA TATAGGTTTT
TTAAGCTATT CCAAAACACA CCCTGATTTT GTAAGGTTTC TGGAAGCTGC TGCCGACATC
CCTGACATAT GTTTTAAAGT AGTAGGTGAT ACAGCATACG GGAAAGAATT GATTAAAGAT
GTTCAAAATT CCAAGCTTGT ACGCAACAAG GTTATATTTG AAGGTTACGC CCTGGATGTG
AAAGAAAAGT TTGCTGAATT TGATGTATTT GGGTATCCCC TAAATCCAAT GCACTACGGA
ACTGCTGAAA ATGCGTTGCT TGAAGCCATG GCGGCAGGAG TTGTTCCGGT TGTTCTGAAT
CAATGTACCG AAAAATACAT GGTAAGGCAT ATGGAAACAG GAATAATAGT AAACAGTATC
GAAGAATACG GAACTGCATT AAGGTGGCTA AAAGATAATG CGGACAAGAG AATCCACATG
GGCAATAACG CTTCTGAATT TGTCATTAAA AAGCTCCATA TTCGCGAAAC TGTCAACAGA
TTAAATGCCT GTTATTCTGA TATGATGAGC CAGAATAAAA GGCTGCATGA TATATACTCT
GCAATAGGAA CTAATCCATA TGAATGGTTT GTTAGTGCTT ATTGGGGTGA TGTCAATTGT
TTGGAAGGCA ATTCGTTTGC CGAAACAAAG GGCTCTGCCA AGCACTATTT AAGGTACTTT
CCGGAAGACA AAATATTAAG AAAGGTTGTG GAAACTAATG AAAGCCGAAT TTAA
 
Protein sequence
MIKVVHITAH LGGGIGRILS SIAIYSQIEK EVEHVIVTLE KTENSHFEQL LKEHSIKVFL 
QNQCCLKQIL QEADIVEVDW WHHPLTSAFM HNYFNDIECR LLIWSHVSGC TYPYIKYELI
KCADKFVFST PFSFENEYWS NEEKEEVMKR VEIIVSSGID FDAPVKKKPH HGYNVGYIGF
LSYSKTHPDF VRFLEAAADI PDICFKVVGD TAYGKELIKD VQNSKLVRNK VIFEGYALDV
KEKFAEFDVF GYPLNPMHYG TAENALLEAM AAGVVPVVLN QCTEKYMVRH METGIIVNSI
EEYGTALRWL KDNADKRIHM GNNASEFVIK KLHIRETVNR LNACYSDMMS QNKRLHDIYS
AIGTNPYEWF VSAYWGDVNC LEGNSFAETK GSAKHYLRYF PEDKILRKVV ETNESRI