Gene Cthe_0511 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0511 
Symbol 
ID4808311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp623795 
End bp625117 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content34% 
IMG OID640105924 
Producthistidine kinase 
Protein accessionYP_001036941 
Protein GI125973031 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000775208 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATAA AGAAAGATTT AAAAAAGCCA TATCCACTTA AATTAAAAAA ATCCGGAAGA 
AAGCTGCTTG CTGAAAGATA TTACCAAATT CTTAGAAGAA AATATAATTG CCTTTATGAG
ATTTTTAACA GTATTGAACT TCCGATAGTA AGTATATCCT ATCCGAGTTT TAATATAATT
GGAATTAATA ATAAAGCCAA GAAAGATGTA AAGCTTTTGC GTAAATGTGT TGCATTTGTA
AGAGAGGTTA AAAAAGGTAA TAATATATTA AATGCTTTAT CAAAAGTATA TTCTTGTCGA
AACCAAAAAT ATTTAGCTGA AATGCTTTTT ACAAAGTCGC CGGTATATCT TCCTAATGTT
GAAATTGATA ATAATGGCAG AAAAATTTAT TATAAATTAA TGCACCAGCC GATTTTAAAT
TCTCGTAATA AAATAGTAGG TTTTCTGATA ATTCCCATTG ATATAACCCA GGAGATAGAA
CAGAAAAAAT ATATAGAAAA TCTGGCCAGA TTCAAAGATG AGTTTTTGTA CGGCATAACC
CATGAGTTCA AAACTCCTCT GGTAGTTATA AGCTCGGCTT TGCAGGCAAT AGAGGCTCTT
TGCAGAGATG AGCTTACCGA CAGGGTAAAA AAATACTTAA ACAGGATTAA GCAAAATACC
TTCAGGCAAA TAAGGCTTGT AAATAATTTG CTGGATATAG CCAGAATTGA GGCCGGTCAT
ATCAAAATAT TCAAGAGGAA TCTGGATATT GTTGCTTTGA CTCGTCTGAT AACCGAGTCA
GTATCAATGT TTGCCGATTT AAAAGGTGTA CAGCTCTTGT TTTATTCCAA TATTGATAAA
AAGATTATTG CTATTGATGA AGAAAAATAT GAACGGATTA TGCTTAACCT GCTCTCAAAT
GCAATTAAAT TTACCCCCAA AGATAGATCG GTATATGTGG AAGTTTCAGC TAAGGAGGGA
TGTGTGGAGA TAAAAGTCAA AGACAGTGGA ATAGGGATAC CCAAAGATAA AATCAAGACC
ATTTTTGAAA GGTTCGGGCA GGTGGACAGT TCATTGTCAA GAAATGCCGA GGGTACAGGT
ATAGGATTGT CCCTGGTTCG CATGTTTGTA AATGCTATGG GCGGAGAAAT TAGTGTGACG
AGCGAAGAAA ACATTGGCAG TACTTTTACA GTCACTTTGC CTGATGCCGT AACTGAATGC
GGATATGAAA ATGACTGTTT TGACGATTTG AGAGATGAGC GTTTGATACA AGCTATCAAA
ATAGAGTTTT CGGATATATA TTTTGAGGAA AAGGAAGACA AAAAGAAAAG CCTTGAAACT
TAA
 
Protein sequence
MNIKKDLKKP YPLKLKKSGR KLLAERYYQI LRRKYNCLYE IFNSIELPIV SISYPSFNII 
GINNKAKKDV KLLRKCVAFV REVKKGNNIL NALSKVYSCR NQKYLAEMLF TKSPVYLPNV
EIDNNGRKIY YKLMHQPILN SRNKIVGFLI IPIDITQEIE QKKYIENLAR FKDEFLYGIT
HEFKTPLVVI SSALQAIEAL CRDELTDRVK KYLNRIKQNT FRQIRLVNNL LDIARIEAGH
IKIFKRNLDI VALTRLITES VSMFADLKGV QLLFYSNIDK KIIAIDEEKY ERIMLNLLSN
AIKFTPKDRS VYVEVSAKEG CVEIKVKDSG IGIPKDKIKT IFERFGQVDS SLSRNAEGTG
IGLSLVRMFV NAMGGEISVT SEENIGSTFT VTLPDAVTEC GYENDCFDDL RDERLIQAIK
IEFSDIYFEE KEDKKKSLET