Gene Cthe_0806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0806 
Symbol 
ID4810424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp972830 
End bp975538 
Gene Length2709 bp 
Protein Length902 aa 
Translation table11 
GC content39% 
IMG OID640106223 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_001037234 
Protein GI125973324 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase
[COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTGCA GAGTAAAAAA CGAGGTGAGG GATGATTTGA ATGTGAACCT CGAGGTGATG 
GTGGCAGGAA ATGACGAGGA GTTTGCAGCA TCCCTAAAGA ACGCTCTTTT GGACGTGGGG
TGCAAGCATT CCGTAAAAAG TGCTGCTACA TTTAATGAAT TTAAAAACTA TTTAAGAAAA
AATAATTTTC ATATTGTTTT TTTTGACTGT TCTTCTGACA TTTCTTTTGC CGATGTGATG
GAGAGTTTTG AAAATCTTAA AAAAGAAATG CCCGTTGTTG CAGTTTTGGA AGAAGACGGA
AGTAGGCATG GTTTTGAAAT GATAAAACTG GGAGCCTGTG AGTATCTCGA AAAGAAAGAT
TCCTCAAGGC TAAGATCCAT TATATTTAAA AGATGCCGGG AGCTTGAACG GGAAAGAGAA
CGAATAAGGT TTGAAAAAAG GCTGAAGGAG GAAAATGAAA GACTGTTAAC AACTTTGGAA
AGCATAGGGG ACGGCGTAAT TGTTACCGAC AAGGCGGGTT GTGTAATCAT GATGAACAAA
GCGGCCCAGG AACTGACAGG GTATTTGTCC TGTAATTCAA TTGGGAAACC GCTGTCGGAA
GTTTTTGTGA TAATAAACGG TGTAAACCGT GAGCCTGAGC CCGATCCCTA TGAGAGGGTA
TTGTCCGAGC GGAAGACCGT GGGCCTTAGG AAAAATACCA TGCTGGTTTC AAAAGACGGT
ACTTTAAGGT ATGTTTCGGC AAGCACCGCT CCGATAAAAA GTTCTGACAG GGAAATAATG
GGTGTTGTTG TAGTTTTCAG AGACATAACT AGGATAAGAA AAGCGGAAGA GAAGCTTCAG
AAATTATCCC AGGCAGTGGA GCAAAGTCCC GGTATCATTG TGATGGCAGA CACGCGTGGA
AATATTGAAT ATGTAAATCC CCGATTTACT GAAGTTACAG GATACAGCTT TGATGAGGTA
AAGGGAAAAA GCCTTTTTTT CATGGAGTTA GAAGAAAATG GCGAAGAAAA ATGCAAAGAT
ATGATTCAGG CAGTGTTTTC AGGCAAGGAA TGGAGAAGTG AAATCAGGCA TAAATCAAAA
AACAATGTGG TATTTTGGGA ATATGCATAT TATTCTCCGA TAATCAACTC GGAAGGTGTC
GTAACTAATT ATCTTAAAGT TTCCGAAGAT ATTACTGAAA GAAAGTTAAT GTCGGAAAAC
CTTTTCAAGG CAAAAGAAGC TGCCGAGGCT GCAAACAGGG CAAAAAGCGA ATTTCTTGCA
AACATGAGCC ATGAAATACG GACTCCCTTA AACGGAATAA TCGGGATGAC CAACCTTACT
TTACAGACAG AATTAACTGA TGAGCAAAGG GAAAACCTCA ATATTGTAAA TTCATGTGCG
GAACTGCTTC TTAGGGTTAT AAACGATATT TTGGACTATT CCAAGATAGA AGCCGGCAAA
ATGACTTTAG AAAATGTCAA GTTTGATTTC TTCAACCTTT TGGAAAAGAC ATACAAAGCC
CATATTGTAC AGGCAAACGA AAAAGGACTG AGATTAAGCT ACACTGTGCA AAAAGGCATA
CCGAGGATTT TGGTGGGGGA TCCCGGACGG CTTCAGCAGG TTTTAAACAA CCTGATTTCC
AATGCCGTAA AATTTACTGA TATCGGCGAA GTAAGAATAG ATGTGGATAT TATTGAAAAA
AGAGATGATT CTGTAAAGTT GAAATTTACT GTGTCAGATA CGGGAATAGG TATAGACGGT
GACAAGATGA ACATGCTGTT TAAAAGTTTC AGCCAGGTTG ACAGCTCCAT TACCCGAAAA
TATGGAGGTA CAGGCTTGGG ACTTGCAATT TCAAAACAGC TTGTAGAGAT GATGGGTGGC
GAAATCTGGG TAGAGAGTCA AAAAGGGAAA GGAAGCACTT TCTATTTTAC TGCAGGTTTT
AAAAGAAAAG CCAAAAGCAG TGTTAAGGCA GAGTGTACTT CAAATGCCGA TGAGCGGGTA
CTTGGGAGGA AACTCAACAT TCTTCTTGCC GAAGATGACA AGGTCAATCA GCAGGTTATA
GCAGGCATGC TGAAAAAAGA GCAATGTTCT CTTACAATTG CTGAAAATGG TTTTGAGGCA
ATTAAACTCT TCGAAGAAAA TGAGTTTGAC CTTATACTGA TGGATGTCCA AATGCCGGAA
ATGGATGGTA TTGAAGCGAC AAAAAGAATA AGAAAAATGG AAGAAGGAAC TTTCAGGCAC
ATTCCCATTA TTGCGGTTAC TGCATTTGCT TTTGAGAATG ACAGGGAGAA GATTTTGGAA
GCCGGAATGG ATGATTACAT TTCAAAACCC TTTTCCTTTG ATGACATTTA TGCAGCAATA
AACAGAGTGC TGTACAAAGA TGAGCAATCG AGTGATGATA TGCAAAAAGA GAAAGAAGAT
GAAAAAGGAA GTTTGGATCC GGATGAAAAA GTTAATGAAA CTTTATCTGA AAGTGGAACT
GTAAATGAAA CGGGTCCGGA TTATAGTTTG AAAGATATAA TGGAACGACT GGATCTTTCC
ATACTGAATG GAAACTTTAC GGCAATTGAC AAGAGTGCTT TGGCAGTAAA AGAATTTGCA
CAAAACAGAG GATTTGACAA AGTAAAAAAT CTTGCGTTCA AAATACAACT TTCTGCCAGG
AAAAATCAGC TAGAGGATGT CAAAAAACTG TATGAGGTGC TAAAGCAGGA AATCAGCAAA
TTTAATTAA
 
Protein sequence
MDCRVKNEVR DDLNVNLEVM VAGNDEEFAA SLKNALLDVG CKHSVKSAAT FNEFKNYLRK 
NNFHIVFFDC SSDISFADVM ESFENLKKEM PVVAVLEEDG SRHGFEMIKL GACEYLEKKD
SSRLRSIIFK RCRELERERE RIRFEKRLKE ENERLLTTLE SIGDGVIVTD KAGCVIMMNK
AAQELTGYLS CNSIGKPLSE VFVIINGVNR EPEPDPYERV LSERKTVGLR KNTMLVSKDG
TLRYVSASTA PIKSSDREIM GVVVVFRDIT RIRKAEEKLQ KLSQAVEQSP GIIVMADTRG
NIEYVNPRFT EVTGYSFDEV KGKSLFFMEL EENGEEKCKD MIQAVFSGKE WRSEIRHKSK
NNVVFWEYAY YSPIINSEGV VTNYLKVSED ITERKLMSEN LFKAKEAAEA ANRAKSEFLA
NMSHEIRTPL NGIIGMTNLT LQTELTDEQR ENLNIVNSCA ELLLRVINDI LDYSKIEAGK
MTLENVKFDF FNLLEKTYKA HIVQANEKGL RLSYTVQKGI PRILVGDPGR LQQVLNNLIS
NAVKFTDIGE VRIDVDIIEK RDDSVKLKFT VSDTGIGIDG DKMNMLFKSF SQVDSSITRK
YGGTGLGLAI SKQLVEMMGG EIWVESQKGK GSTFYFTAGF KRKAKSSVKA ECTSNADERV
LGRKLNILLA EDDKVNQQVI AGMLKKEQCS LTIAENGFEA IKLFEENEFD LILMDVQMPE
MDGIEATKRI RKMEEGTFRH IPIIAVTAFA FENDREKILE AGMDDYISKP FSFDDIYAAI
NRVLYKDEQS SDDMQKEKED EKGSLDPDEK VNETLSESGT VNETGPDYSL KDIMERLDLS
ILNGNFTAID KSALAVKEFA QNRGFDKVKN LAFKIQLSAR KNQLEDVKKL YEVLKQEISK
FN