Gene Cthe_1136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1136 
Symbol 
ID4810804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1348949 
End bp1350409 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content32% 
IMG OID640106558 
ProductTPR repeat-containing serine/threonin protein kinase 
Protein accessionYP_001037561 
Protein GI125973651 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAGAA AAGCAGTTAA TTTTCATCCG CAAGTAGATA TGAGTATAGG GGATAAGTAT 
TTGCTGAAAG AATTTATTGA TGAAGGTTCC TTTGGCTATG TATGGAAAGC AGTAAACCTT
GAAAATAGGC AAACAGTGGC GCTTAAAATA CCCAAAGACC AGGAACGGGG AGACAACACC
TTATCGGAAG GAAAAGAATT CATCGGAAGT CATCACCCAA ATGTTATTTC TATATACTGG
ATGGGACGTG TAGATGGAGT TTTTCTAATT GAAATGGAGT ACTTTAATGG GCATAAACTA
TCAGACGAAT TATGTGAAAC GGGATTTAAG AGTCCTAGAA CATTTGAAGA AATATACAAC
TTGTTTTTTC AAATATTGGA TGGTGTAGAA TATATACATT CAAAACATAT TTGCCATGGA
GATATTAAGC CTCAAAATAT ATTAATAGAT GGGAAAATAG CCAAAATAAC CGATTTCGGT
ACAAGTAAAC TGATAGAAGA TTTATTTATA AAAACAATTG ATGGTGGAGG TACCTGGGCT
TATATGGCTC CAGAAGTGGC TGGGTCTAAT CGTAGATATC TTAATTCAGA TATATATTCA
TTGGGAGTGC TTTTATATAA GTTTTTAACT GGCAGGACTC CTCATGAAAC TGCAAATCAA
TTAATTAATA ATATACCTTA TCCAAAACCA AGAGAGATAA ATAATAATAT ACCTGAATCG
GTTGAAAGAA TTATAATGAA ATTACTCAAA AGAAACCCTG ATGAAAGGTA TCAGAATATA
AGCGAAATAA AAAGAGATTT AGAAGAAGCA TTAAGAAGTG AAGATAGAAA TATTATTTCT
TACAATGAAC GGGCTGAAGT AAAATACGAG GATACTGATT GGATAGAAAG AGTAATTCGA
TATTATAAGA ATAATGAATT TGATAAGGCA GAACTACTGC TAAAAACGGA GTACGAGAAT
GGAAATAAAT CTGCTGATGT ATTGTATCAT ATTGCTTATA CATATTTTCA GCAGGGAAGA
TATTTTGAGA GTATGGATGT GATAAAAGAT ATTGATATTA CACAAGTAGA GGATATCAGA
CAAGAGGCAT TAGAAGATAA TCTTCTTTAC TTAAAAGGGA AATTGTTTTT TGAACTTAAA
AAATATGAGG AAGCAGTTAA AGTATATGAA AAACTCGTAT CAAGGAATCC AGATGACTTG
AATTATAGAT ATAAATTGGC TTGTGCTTAT GGCTTAAATG ATGAACAGGA GAAATCCATA
GAAATTCTGG AGGATATAAA TAAAAAGACT CCTGGGATGC TCTATATAGT AAAAAAACTT
GGACATGCAT ATGACCAGAT AAAAGACTTT AAAAAGGCAA GGGCTTATTT TAATTATGCC
ATACGATTAG ACCCGAGCGA TACAATAATT AGGAATAGGC TGGAGGAATA TAGCAAATAT
TTTAACTATC TTGGATATTA A
 
Protein sequence
MKRKAVNFHP QVDMSIGDKY LLKEFIDEGS FGYVWKAVNL ENRQTVALKI PKDQERGDNT 
LSEGKEFIGS HHPNVISIYW MGRVDGVFLI EMEYFNGHKL SDELCETGFK SPRTFEEIYN
LFFQILDGVE YIHSKHICHG DIKPQNILID GKIAKITDFG TSKLIEDLFI KTIDGGGTWA
YMAPEVAGSN RRYLNSDIYS LGVLLYKFLT GRTPHETANQ LINNIPYPKP REINNNIPES
VERIIMKLLK RNPDERYQNI SEIKRDLEEA LRSEDRNIIS YNERAEVKYE DTDWIERVIR
YYKNNEFDKA ELLLKTEYEN GNKSADVLYH IAYTYFQQGR YFESMDVIKD IDITQVEDIR
QEALEDNLLY LKGKLFFELK KYEEAVKVYE KLVSRNPDDL NYRYKLACAY GLNDEQEKSI
EILEDINKKT PGMLYIVKKL GHAYDQIKDF KKARAYFNYA IRLDPSDTII RNRLEEYSKY
FNYLGY