Gene Cthe_0314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0314 
Symbol 
ID4808532 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp396564 
End bp397742 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content38% 
IMG OID640105725 
Productglycosyltransferase 28-like protein 
Protein accessionYP_001036745 
Protein GI125972835 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCGAA GCCTCCCGAT TGCCCTGGCT TTGGCTGAGG CCGGATATGA AATAAAATAT 
TTGGGTTATG ACATGGCCAA AGAATATATG AAAAAAGCCG GGATAGAAGA ACTGTGCCCT
GAGTTCAGCA TAAGCGATAT TAAAAAGGGA AGTCCCAACC CGTACTGGAA CACGGCAGAA
GAATTCTGGT CGATGATTGG TTATGGCAAC ATGCCGTGGG TTGAAAGAAA AGTCGATGAA
TTAATAAATT TGTTAAAACA ATTTTTCCCC GATTACATAC TGTCCGACCT GGGCATTTTA
GCATGTCTTG CTGCAAGAAT AACGGGAATT CCATTAATTG CGATAAACCA GAGCTGTTAT
CATCCAAATG TAAAATTAAA ATGGTGGGAA AACAATTATG AAGCCGAAAA CTATAAAGAT
AAAGACAGTC TTTTAAATAA ACTGAATGCA TTTCTAAAGA AAAAAGGCGC ACAGCAATTA
AATACTTTTA CGGAAATATT TACAGGAAGG CTTACAATCA TTCCCGGTTT CTATGATTTT
GATCCGATAC CGAATCTTGA AAAATATAAT ACCCATTATG TAGGGCCTGT TCTGTATACT
CCAAAGGAAA ATGTTTCCGA AAAGCTTTTA AAACTTTTTG ACGCCGATCA ACCGATAATC
TTTTGCTATA CGGCAAGGTT CTATGATAAT GTGGGAGAAA GCGGGAAAGC AATTTTCGAT
AATATGATTA AAATTGCCGA TAAAATAGAT GCCTCCATTA TTATTTCGAC AGGGAATAAA
AAGGATGAAT TGCTTGCCTT GGATATTGCG TCAAAGGAAT TGAAAAGCGG CAAAGTCAGT
ATCGTTGATT ACGTGCCTTT GGATATGGCT TATGAAAAGT CTGACCTGGT GATTCACCAC
GGCGGTCATG GAAGCTGTCT TGCACAATTT TACTATGGTG TTCCTTCTGT CATAATACCT
ACTCATACTG AACGGGAATA TAACGCAAGA ATGTGTGAAA AACTGCATGT TGGCAAAATG
CTCCCCAGAA GAGAATTAAA CAGTGCAAAT TTGAAGAATT GCATCAATGA TGTGCTAAAT
GACATTACTT ATAAGAAAAG TGTCCGGGAT TGGAAAGAGA AAGTGTCCGG TGATTTTAAC
AACCTTGATA AAGTGGTAAA ACTCGTTGAT TCGCTGTAA
 
Protein sequence
MSRSLPIALA LAEAGYEIKY LGYDMAKEYM KKAGIEELCP EFSISDIKKG SPNPYWNTAE 
EFWSMIGYGN MPWVERKVDE LINLLKQFFP DYILSDLGIL ACLAARITGI PLIAINQSCY
HPNVKLKWWE NNYEAENYKD KDSLLNKLNA FLKKKGAQQL NTFTEIFTGR LTIIPGFYDF
DPIPNLEKYN THYVGPVLYT PKENVSEKLL KLFDADQPII FCYTARFYDN VGESGKAIFD
NMIKIADKID ASIIISTGNK KDELLALDIA SKELKSGKVS IVDYVPLDMA YEKSDLVIHH
GGHGSCLAQF YYGVPSVIIP THTEREYNAR MCEKLHVGKM LPRRELNSAN LKNCINDVLN
DITYKKSVRD WKEKVSGDFN NLDKVVKLVD SL