Gene Cthe_2637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2637 
Symbol 
ID4808948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3118798 
End bp3119835 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content40% 
IMG OID640108050 
Productglycosyl transferase, group 1 
Protein accessionYP_001039029 
Protein GI125975119 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGCA GGGGCCATAA TACAATTTTT GCGTCCAGCG GAGGAGTTCT TGCAGAAGAA 
ATAAAAAAAG AATTCAGGCA CATTGACATA CCCACACTGG CAGTAAATAA AAGGGACCTT
ATCTCAACAA TAAAAAACAT CATAAAGATA AGAAAAATAC TCAGGGAAGA AAATATAGAC
ATTATACACG GCCACAATGC CGCAGCGGCA TTTACCGCAT ATCTGGCGTC AAAAACCATA
AACAGGAAAG TGGCAATCAC CCATAGTGTC AGAGGAATGG AAATCCGGAA AGGCTATCAG
TGGAGGAACT TTATATACAG ACTTTATCCC GCCACTTTTT TTGCCGTGTC CGATTTTACC
AGACAAATGC TGATTAAAGC GGGTGTAAAA GAGAACAGAA TTATAAATAC CTATAATGGA
GTGGATATTG GGAAATTTGA CGTGTCAAAA TGGAACAAAA ACGCTTTCAG AGACGAAATT
GGCGTTTCAA AAGACACTGT TCTTGTCGGT ACTGTGGGAA GAGTCAATTA CAACAAGGGG
CAGGAAGTTC TTATAAAAGC TATCCCACAT ATTCTTAAGA AAACATCAAA TTTCAAAGTC
GTAATAGTCG GAGACGGAGA GAAGCTGGAA GCTTGCAAAA CACTTGCAAA AGATTTGGGC
GTGGAGGAAT TTGTGCATTT TACCGGATTC AGAAGAGACA TACCCAATAT TCAGGCAGCC
CTGGACATAT ATACTCTTGC TTCGGTTAAA GGTGAAATGT TTCCAAATTC CATACTTGAA
GCAATGGCCA TGGGAAATCC CTGGGTTGCC AGCAACCTCA GCGGTATCCC GGAAATATCG
GAAAACGGCA GAAATGGATT TTTGTCAGAG CCGAACAACT GCGAAGATCT TGCGGACAAA
TTAAGTAAAT TGATTATGAA TGAAAGCTTA AGAAAAGAAA TGGGTGAAAA CTGCATTAAA
ACCATTTACG AAAAGTACAC CATAGAAAAA GTATGCGATG CGATAGAATA CGGATATCTG
AGTGCTTTAG AACAATAA
 
Protein sequence
MKRRGHNTIF ASSGGVLAEE IKKEFRHIDI PTLAVNKRDL ISTIKNIIKI RKILREENID 
IIHGHNAAAA FTAYLASKTI NRKVAITHSV RGMEIRKGYQ WRNFIYRLYP ATFFAVSDFT
RQMLIKAGVK ENRIINTYNG VDIGKFDVSK WNKNAFRDEI GVSKDTVLVG TVGRVNYNKG
QEVLIKAIPH ILKKTSNFKV VIVGDGEKLE ACKTLAKDLG VEEFVHFTGF RRDIPNIQAA
LDIYTLASVK GEMFPNSILE AMAMGNPWVA SNLSGIPEIS ENGRNGFLSE PNNCEDLADK
LSKLIMNESL RKEMGENCIK TIYEKYTIEK VCDAIEYGYL SALEQ