Gene Cthe_1244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1244 
Symbol 
ID4809749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1507662 
End bp1508834 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content35% 
IMG OID640106667 
Productglycosyl transferase family protein 
Protein accessionYP_001037669 
Protein GI125973759 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA AGTATGGTGA GCAAAAATTG AATAAAGAGT TAGTTTCAGT AAGCATATGC 
GTTTACAATG GCGAAAAATA TATAGAATCT TCTATAAAAA GCGCCCTTGC GCAAACATAT
CAAAACATAG AAATAATAGT AATTGATGAT GGTTCGACGG ACAGGACGGG CGAGATTGTA
AAAAACTATT GTCCCGATGT TAAATATATC TATCAAGAGA ATAAAGGTGT GTCAGAAGCC
AGGAATACAG GACTTAGGCA CTGCAGCGGA AATTATATTG CATGGTTGGA TGCCGATGAC
TTATATTTAC CGGATAAAAT AAAAGAACAG GTTGATTTTT TACAACAGAA TAAAGATATA
GACTGTGTAT ACAATGACGC TTTTTTAATC GATGCTCACG ACAACTTGGT TAAAGTGCTT
AGAAGCGATT ATGGCAATTT GGCTCCAAAT GATTTTTTGG CACAGCTTCT TTTCAGACAA
ACCATTCCCT GTCCGCCAAG TACCTTGTAT AGAAGAAAGT GTTTTGAAAA CCTGCGTTTT
ATTCCCGGCA TGAGGTATGC GGAAGATTAT TGGAGCAGCA TCCAACTGGC CCAAAGATTC
AAATGTGGAT ATTTGCCTAA AATCCTCTAC AAATACAGAA GGCATGACTC CAACCTGACC
AATAACAAAG AAAAACAAGA AGAAATGGAA ATCAAAGTAG TTAAAAGTCT TGGAATTGAT
AAAATAAAAG ATATCGTAGA AAAGTCTTCT TATCCTGAGC ATGAAAAGCT TTTATTGCTT
GGTAAAATTT TTATCAAAAT CAGTGAATAC GAGGAAGCAT GTAAAGCCTT GGAAAAAATC
CAAGTCCCGG ACTATATTCA GGACAGAAAA ACGAAATTTT TAAAATACTT TTACCTGGGA
AATGTAAACT ATTTGACAAA AGAATATAAC AAAGCAAAAT TTTGCTACGA AAAATCGCTC
CGAACAGACC CCGGCAAAGC AGAAGCATAT AACAATTTGG GTGCGGCATT ATATCACTTG
TCAGAAACTG AGGAAGCACT TGAAAATTTT AATAAAGCAC TCGCTTTGAA AAAAGAATAT
CTCGACCCTC AGAACAACTT AAAAAATATA AAAACAGGCG GGGATTTAAA AATTACAATC
CGGGAGTTGA GAGAAAATTT GATGGTTTAT TAG
 
Protein sequence
MKRKYGEQKL NKELVSVSIC VYNGEKYIES SIKSALAQTY QNIEIIVIDD GSTDRTGEIV 
KNYCPDVKYI YQENKGVSEA RNTGLRHCSG NYIAWLDADD LYLPDKIKEQ VDFLQQNKDI
DCVYNDAFLI DAHDNLVKVL RSDYGNLAPN DFLAQLLFRQ TIPCPPSTLY RRKCFENLRF
IPGMRYAEDY WSSIQLAQRF KCGYLPKILY KYRRHDSNLT NNKEKQEEME IKVVKSLGID
KIKDIVEKSS YPEHEKLLLL GKIFIKISEY EEACKALEKI QVPDYIQDRK TKFLKYFYLG
NVNYLTKEYN KAKFCYEKSL RTDPGKAEAY NNLGAALYHL SETEEALENF NKALALKKEY
LDPQNNLKNI KTGGDLKITI RELRENLMVY