Gene Cthe_0287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0287 
Symbol 
ID4808505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp355145 
End bp358627 
Gene Length3483 bp 
Protein Length1160 aa 
Translation table11 
GC content40% 
IMG OID640105699 
ProductGAF sensor hybrid histidine kinase 
Protein accessionYP_001036719 
Protein GI125972809 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTGC TGAAAAATGT AAGAATTCAA CACAGAATGT GGGCATTGTT TGCCGTTTGG 
CTTATATTGT ATGTCATTTT TGCCGTAGCT GCTTACAGAA ATGTAAGTGA GATTGGGCAA
GTATCCGAAG ACATTTATAC CCAGTCGCTG AAGACGTCAA ATGCCGCCAG AGAGGCAAGG
GTGGCAATAA TGAAAATACA AAGAGGAATA AAGGAGATTT TGCTGTCTGA CAATCCTGAT
AATGTCTATT ACGAATTGGA AAAAATCAGG GAATTGGATA ATCAGGTTTT GGAAAACTTT
GAAATAATAA AATCCAATTC CGGCGGTAAT GAAGAAATAG AGAAGCTTGT AAATGAATCA
ACAGAAATAT TTGACGAATG GAGAAAAACC CGGGGGGAAA TTGCGGCTCT GATACAGCAA
AACAAACATT TACAAGATAC AACTGCTCTT ACAGTAAAAA ATAATGAGTT TGTTGAAAAA
TTGGAGAAAT GCCTGGACAG TCTGGATAAA ATTGAAGAGG ATGAAGCGCA AAATCTTATA
GACCGATCCG CCAAAATACA GAAATACCAT AAAAACAGTA TATTCTATTG GACCGGGATA
ATAGTGTTTA CTTCGGTTGT TCTGTTTACC ATTGTGATAC GGAGCATAAT GTGGCCCGTT
TCATATTTGC AGCATATAAT GCGAACCAGC GCAAATACCA GAGAGATTAT TGAAGCTGAA
CTTCCCGGGA AAAATGAATT GGTTAACATG GCCAATTACT ACAACAAGCT TGTAAGAAAT
CTAAAGGAGC AGTTCTGGAT AAAAGACAAT TGCAGTGCTC TAAATGAGGA ACTTGCCGGC
AAATTTGATC TTAAGGAAAT TACGGATAAA GCGGTAACTT TTTTGGCAAG GATTTTGGAT
GCAGGAAACG GTGTGCTTTA TCTTTATAAT GATGAGGATA AAAAGCTGTA TTTGAATTCA
TCCTATGCCT TTACCGAAAG GGACCGTCTG TCAAACCAAT ATGAACTGGG AGAAGGGATA
ATAGGTCAGG TGGCCCTTGA AAGAAAGCCG ATTCTTTTAA AAAATGTCAA AAGGCAGGAA
GCACTGATTA CCACGGGCAC CGTAACTGAA GCACCTCTTA ATGTTTATGC CGTGCCGCTC
TTATATCAAG GTCATCTTTA TGGCGTCTTG GAACTGTCAT CTTTTGAACC TTTTGTGGAA
TTAAAACAGG AATTGATGAA CGAGGCTGCC AAGATTGTAT CCACGTATTT ATATACCGCC
GTACAAAATA ACAGGATTGT AAACCTCTTG AAAATTACAG AGGCTGCAAA ACAAGAGGCG
GGCAGGAAAG CAAGCGAGCT GGAAGAGGCC AACAGGGTGC TGGAAGAGCA GCAAATTCTT
CTGCAAAAAC AGACTGAAGA GCTTCAGCAA ACCAATATCC AGCTTGAATA TCAGCAGCAG
AAACTTCAGC AGCAGAGTGA GGAATTGCAG CAGACCAATT CCCAGCTTGA GGAACAACAA
CAGCTTCTTG AAGAACAGGC AAGGCTTTTG AATATTAAAA ATGAAGATCT TGAAAGAACA
ACCCGGGAGC TTAGATTAAG AACTGAAGAA TTGGAAAAGG TCAATAAATA CAGATCAGAG
TTTCTGGCCA ATATGTCCCA TGAGCTCAGG ACTCCGCTCA ATTCAATTAT TCTCCTTTCC
AAAATGCTTG CACGAAAGAA AATAAATGAG TTTGATGAAA AAGATATGGA GAAGATTGAG
GTAATCAATA AAGCCGGTCA GGAACTTTTG CGTTTAATTA ATGATGTTTT GGATCTTTCC
AAAGTTGAAT CGGGCAAAAT GAATTTGAAT ATTTTCACTT TCCATTCCAG TGAACTGATG
GAGGACTTAA GGCAGATGTT TGAAAGCTCT GCAAAAGAGA AAAATATATC CTTTTTTGTG
GAGGATTCAA TAAACTCCAT GTTGGTCGGA GACAGGGATA AAATTTCCCA GATACTGAGA
AATTTCCTTT CCAATGCGTT TAAATTCACT TCAGAGGGTT CTGTAATTTT AAAGGCGGAA
ATGGACAAGG ACAGAGAAAA CGATCTCATA TTCTCGGTAA CCGACACCGG AATCGGTATT
CCTGAAGAAC ACATTTCTGA TATATTCCAG GAGTTTCGGC AGGTGGACGG AACAATATCA
AGAAAGTATG GGGGGACCGG ACTTGGTTTG TCCATATCAA AGAAACTTGC CGACCTTATG
AAAGGAGAAA TAGAAGTCCG AAGCAAAATT GGAGAGGGAA GTACGTTTTT CCTCAAACTG
CCCGGTCTTG TTTGCAAAAA AGAGGATGGC AGGGAAAAGG TGACGGTAAA GAAAGACATG
GACAGGAAAA TCTCAACGGA TCGAAAAAAT GACAAGGTAA TACTGGTTAT TGAAGACAAT
GAGAATTTTG CAAAGTACCT GAAAGAAATA AATGACGGCA TGGGATTTGA CACCATTATC
GCAGGCAGCG GAAAGGAAGG CCTGGAGACA GCCAAAAACA CCAGGGTGGA CGGAATTTTG
CTGGACATTA TGCTTCCTGA TATGAGTGGA ATTGAAGTGC TAAGAGAGCT AAAAAGCATT
GTGGAGCTTG GAAAAATTCC GGTACATGTA ATTTCCGACC TTGAAGAAGA TGACGTTTCG
CAAATTATTG CGTATTTGTT GAATTATTCC GCACGGCACC CCGAACGGAT TATGATTGTC
GGAGGGGATG CAATCTGGCA GAGTACCATC AAAAAGCCGT TTGAAGAAAG AAATATAACA
ACAAAAACGG TATCGACTGA GGAAGAGGTA AAGTTCGAAT TTAAAAAGGA AATGTATGAT
GCTGTGATTT TGAATATGGA ATTGGATGAC AAAGATGTTG CAGGCATTTG CGAATATATA
GCTGAGCAAA ATGTGGACAT TCCTCTGATT GTTTACGCCG AAAAAAGTCT TTCACAAGAA
CGGCAGAAGG AAATAAAAAA ATATGCAGAT AAAATAATTA TTAAGACAGC GGATTCAAAA
GAAAGGTTTC AGGATGAACT GACTGCTTTC CTTAAAGGAA TTAAAGAAAA CATGGACAAT
GGCTGCTATT TGTTTTCGAA AGTAAAAAAA GAATATGCGT TGAATTTGGA CGGAAAAACC
ATTATGATTG TCGATGATGA CCACAGAAAT GTTTTTGTCC TGGCAGCGGC TCTTGAAGAT
TACGGAGCCA ACATCATAGC TGCGGAAAAC GGTAAAATGG CTCTTGAATT GCTTAGGACC
AATAAAGTAG ACTTAATATT GATGGATATT ATGATGCCGG TGATGGACGG ATATGAGACT
ATCAGAGCGA TAAAGGGAGA TAAAGAGCTA AAAGGTATTC CGGTTATCGC AGTTACGGCA
AAGTCTCTAA AATCCGACAA GGAAAAGTGC ATCGAAGCCG GAGCGGATGA CTATATTTCA
AAACCTGTGG ATTACGATGT CCTGATACGG CTTATAAAAG CCTGGACAGC CAAAGAAAAC
TAA
 
Protein sequence
MKLLKNVRIQ HRMWALFAVW LILYVIFAVA AYRNVSEIGQ VSEDIYTQSL KTSNAAREAR 
VAIMKIQRGI KEILLSDNPD NVYYELEKIR ELDNQVLENF EIIKSNSGGN EEIEKLVNES
TEIFDEWRKT RGEIAALIQQ NKHLQDTTAL TVKNNEFVEK LEKCLDSLDK IEEDEAQNLI
DRSAKIQKYH KNSIFYWTGI IVFTSVVLFT IVIRSIMWPV SYLQHIMRTS ANTREIIEAE
LPGKNELVNM ANYYNKLVRN LKEQFWIKDN CSALNEELAG KFDLKEITDK AVTFLARILD
AGNGVLYLYN DEDKKLYLNS SYAFTERDRL SNQYELGEGI IGQVALERKP ILLKNVKRQE
ALITTGTVTE APLNVYAVPL LYQGHLYGVL ELSSFEPFVE LKQELMNEAA KIVSTYLYTA
VQNNRIVNLL KITEAAKQEA GRKASELEEA NRVLEEQQIL LQKQTEELQQ TNIQLEYQQQ
KLQQQSEELQ QTNSQLEEQQ QLLEEQARLL NIKNEDLERT TRELRLRTEE LEKVNKYRSE
FLANMSHELR TPLNSIILLS KMLARKKINE FDEKDMEKIE VINKAGQELL RLINDVLDLS
KVESGKMNLN IFTFHSSELM EDLRQMFESS AKEKNISFFV EDSINSMLVG DRDKISQILR
NFLSNAFKFT SEGSVILKAE MDKDRENDLI FSVTDTGIGI PEEHISDIFQ EFRQVDGTIS
RKYGGTGLGL SISKKLADLM KGEIEVRSKI GEGSTFFLKL PGLVCKKEDG REKVTVKKDM
DRKISTDRKN DKVILVIEDN ENFAKYLKEI NDGMGFDTII AGSGKEGLET AKNTRVDGIL
LDIMLPDMSG IEVLRELKSI VELGKIPVHV ISDLEEDDVS QIIAYLLNYS ARHPERIMIV
GGDAIWQSTI KKPFEERNIT TKTVSTEEEV KFEFKKEMYD AVILNMELDD KDVAGICEYI
AEQNVDIPLI VYAEKSLSQE RQKEIKKYAD KIIIKTADSK ERFQDELTAF LKGIKENMDN
GCYLFSKVKK EYALNLDGKT IMIVDDDHRN VFVLAAALED YGANIIAAEN GKMALELLRT
NKVDLILMDI MMPVMDGYET IRAIKGDKEL KGIPVIAVTA KSLKSDKEKC IEAGADDYIS
KPVDYDVLIR LIKAWTAKEN