Gene Cthe_0286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0286 
Symbol 
ID4808504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp353826 
End bp355100 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content39% 
IMG OID640105698 
Productresponse regulator receiver sensor signal transduction histidine kinase 
Protein accessionYP_001036718 
Protein GI125972808 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG3437] Response regulator containing a CheY-like receiver domain and an HD-GYP domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.136894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGAAC CGGTATACAC AATTCTGATT GTAGATGACA ACGAAAATAA TTTGTTTTCT 
TTGAGAACTC TTATCGAAGA ACATATAAAT GCAGATGTAA AAGAAGCTAA TTCGGGAGAA
AAGGCATTAA AAATTTTGTT TAAAGAGAGA GTGGATCTTA TCATTCTTGA TATTCAAATG
GAAGGAATGG ACGGCTTTGA GCTCGCTTCG ATAATAAAAA AGAGAAAAAA GACAAGCAGT
ATACCTATAG TTTTTCTGAC TGCTTCCTAT ATCGGTGACG AGTTTCAAAG AAGAGGATTT
GAGATTGGTG CTGTGGATTA CCTGACAAAG CCTATTGACG AATACCAGCT TATTAACAGA
ATTAATGTCT ACCTGAAAAT GATAGAAAAA GAAAGGACTA TGAACATACT TCTTGAAAAA
AGGGTAAGGG AGCAGACAGA AGAATTAAGG GCTGCAAAAG AAGCAGCAGA AGCGGCGAAT
GAAGCCAAAA GCATTTTTCT TGCCAATATA TCTCATGAGC TTAGGACTCC CATCAACATC
CTGTACAGCA CAACACAGAT AATTAATTCA TATCTCAATG AGGACAAGGT TCTTGACAGA
GAAAAAATTC GAAGTAAGAT AGCCATGCAG CAGCAAAACT GCTATCGTCT GTTGAGGCTT
GTCAACAATC TCATTGACAT TACCAAAATA GATTCAGGTT ATTTTGAACT TAAATTCTCT
CGTTGCAATA TAGTTGAAGT GGTTGAAAAT ATTACTTTAT TGGTTGTGGA ATATGCCAAA
AACAAAGGGG TCTCCCTCAT ATTTGACACC GATGTGGAAG AAAAGATCAT TTCCTGCGAC
CAGAATGCAA TGGAGCGAAT AATATTAAAC CTTTTGTCCA ATGCGATAAA ATTCACGCCG
AGGGGAGGAT CTATAAAGGT TGAGGTGAAA GACTGCGGCA AGACTGTTGC AATAAGTGTG
AAAGATACCG GAATAGGAAT CCAGGAGGAT AAACTGGAAA TGATTTTTGA AAGGTTCAAG
CAGGTGGATA ACCTTTTGAC CAGAAAAAAT GAGGGAAGCG GTATTGGTTT GAGCCTGGTC
AAATCACTGG TGGAACTGCA CGGCGGAAAG ATCAGTGTAA AGAGTGAGTA CAACAGGGGA
AGCGAGTTTA CGGTTGAACT TCCCGCGGAT CTGGAAAACG GGGAAAATCC TTCAATGGAT
GCGGCGGACA GAAAAGAAGA AAACGAAAAC AAGCAGCACA ATGTGCATAT AGAATTTTCT
GATATATACT ATTGA
 
Protein sequence
MQEPVYTILI VDDNENNLFS LRTLIEEHIN ADVKEANSGE KALKILFKER VDLIILDIQM 
EGMDGFELAS IIKKRKKTSS IPIVFLTASY IGDEFQRRGF EIGAVDYLTK PIDEYQLINR
INVYLKMIEK ERTMNILLEK RVREQTEELR AAKEAAEAAN EAKSIFLANI SHELRTPINI
LYSTTQIINS YLNEDKVLDR EKIRSKIAMQ QQNCYRLLRL VNNLIDITKI DSGYFELKFS
RCNIVEVVEN ITLLVVEYAK NKGVSLIFDT DVEEKIISCD QNAMERIILN LLSNAIKFTP
RGGSIKVEVK DCGKTVAISV KDTGIGIQED KLEMIFERFK QVDNLLTRKN EGSGIGLSLV
KSLVELHGGK ISVKSEYNRG SEFTVELPAD LENGENPSMD AADRKEENEN KQHNVHIEFS
DIYY