Gene Cthe_1788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1788 
Symbol 
ID4810033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2110040 
End bp2111296 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content42% 
IMG OID640107202 
Productglycosyl transferase family protein 
Protein accessionYP_001038202 
Protein GI125974292 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTATCAC AATTAAACGG CATTATATAC GGCACAACCC AAATTATGCA GGTGGTAATA 
TTCATAGCGG GATGTTATTT TTTCGGAATA TCAATCTTTG GCTGGATAAA GCGCAGGGAA
ACCAGCCCTA AAGAATATGT TCCCCAAAAA AAGTTTGCTT TGATCGTTGC CGCCCATAAT
GAAGAAGCGG TAATCGGCCA TATAGTTGAC AGCCTCTTTA GGCAAAATTA CCCCCGCAAT
CTTTTTGACG TATATGTAGT TGCCGACAAC TGTACGGACA GAACCGCCGA AATTGCTGAA
GAACACGGCG CAATTGTATA CAAAAGGTAC AACAACTCCG CCCGGGGAAA GGGATATGCC
CTGGAGTGGA TGTTTGAAAA AATATATAAT ATGGAAGAAA AATATGATGC AATAAGTGTA
TTCGATGCCG ACAACCTCGT TTCAGCCAAT TACCTTTTGG AAATGAACAA GCAGCTTTGC
AAGGGCCACA AGGTTGTTCA GGGATATGTT GACAGCAAAA ATCCTTTTGA TTCATGGATT
ACATTGTCAT ACTCCATAGC TTTCTGGCTT TCAAACAGGA TATTCCAGCT TCCCAGGTAC
TATTTGGGCT TGAGCTGCGG TCTTTGCGGC ACGGGCTTTT GCATTTCCGT GGATGTTTTA
AAAGAAATAG GCTGGGGAGC AACATGCCTT ACAGAAGATT TGGAATTTAC AATGAAGCTG
GCCCTGAACA ACTACAAAGT CGCATGGGCT CACAACGCTG TGGTTTATGA CGAAAAACCC
ATTACATTAA AGCAGTCATG GAACCAGCGT AAAAGGTGGA TGCAGGGTCA TGCCGACTGT
GCAAGCCGAT ATTTGGGCCC GTTGTTTAAA AAAGCCTTCA GGGAAGGAGA TTTAATAGCT
TTTGACTGCG CGGTTTACTT GTTTCAGCCC ATAAGGCTGG TTTTTATCGG GCTGATAACA
ATAATGATGT GGATTCAAAC CGTTTTCCCT GAATCCCCTT TTTATAATCT TAAGTATGTT
TTTCCCACAG AAGTATGGTC CGTGTTCGTA ACGCTGCAGT TTCTCTATGG TCCTTTGGTG
GTGCTTTCGG AGAAAAAATT CAATCTCAAG GTGCTTTACG GCTTTTTGAT TTATCCTTTT
TACTGCATTA CGTGGATACC AATCACCATA CAGGGATTCA TGAGTAAAAA CAATAAAGAC
TGGAGCCACA CTCAGCATTC AAGGAAAATA AGCATATCCG ATCTTGAAAA GGCATAA
 
Protein sequence
MLSQLNGIIY GTTQIMQVVI FIAGCYFFGI SIFGWIKRRE TSPKEYVPQK KFALIVAAHN 
EEAVIGHIVD SLFRQNYPRN LFDVYVVADN CTDRTAEIAE EHGAIVYKRY NNSARGKGYA
LEWMFEKIYN MEEKYDAISV FDADNLVSAN YLLEMNKQLC KGHKVVQGYV DSKNPFDSWI
TLSYSIAFWL SNRIFQLPRY YLGLSCGLCG TGFCISVDVL KEIGWGATCL TEDLEFTMKL
ALNNYKVAWA HNAVVYDEKP ITLKQSWNQR KRWMQGHADC ASRYLGPLFK KAFREGDLIA
FDCAVYLFQP IRLVFIGLIT IMMWIQTVFP ESPFYNLKYV FPTEVWSVFV TLQFLYGPLV
VLSEKKFNLK VLYGFLIYPF YCITWIPITI QGFMSKNNKD WSHTQHSRKI SISDLEKA