Gene Cthe_1287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1287 
Symbol 
ID4809539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1564633 
End bp1566147 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content40% 
IMG OID640106710 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_001037712 
Protein GI125973802 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000512156 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAAAAA CTCTGTTTAG CAAGATTGTT GTCCTGTTTA TTGCCATTTT GCTTGTAAGC 
ACATCCATAA CCGGGGTGAT GCTTTATATT TTTCTTGGAA ATTTTGCCTC GGAGGAAAAG
GAAAAATTAT TAAGTGACAC TGCGGACAGT ATAAACAGTA TGCTGAACGA TTATTTGACT
GCTTATTACA ATAACTATAA CAATCCCATG TTTGCATTCT GGGAGGAAAT ATACCGCAGA
ATGCTGGATA ACGCCCTTGA GAGGGAAAGT CAAAGTACAG GCACTGTCAT ATGGATTGTG
TCCACAGAAG GAGAAATCGG TATTATCAAG GGTAATCAGG CGGTGGTAAG GGAAATAGTC
CAAAAGCTTA CCGACGACAC CGGCAAGATT AAACTGCAAA ACCCGGCACA ATATAAAGAC
GTTATGAGCG GTAGTGTGCC CATGGTCAAA GAAATAGGAG ATTTTTACGG ACTGTTTAAG
GACACGAATG TATCGTGGCT GACTATTGGA AAGCCGTTTA CATATAACGG CAAAATTCTC
GGGGCTGTTT ACCTTCATAC GCCGGTTCCT GAGGTACAGA GGGCAAGAAG CAGTGTTTTT
AAGTTCTTTA TTTTTGCCGT TGTCATTTCC ATAATAATAT CAATAGTACT GATTTATATT
TTTTCACTTA AGCTTTCAAG GCCTTTGAAA AAGATTAACA GTGCTGCAAA GAAAATTGCA
AGCGGAAAAT TTGACGAAAG GCTTGATATT TCATCGGAAG ATGAGATAGG TGAGCTTGCA
AGAAGTTTTA ACAACATGGC GGGAGAGCTT CAGAATCTGG AAAACATGCG AAGAGGTTTT
ATCGCCAATG TGTCCCATGA GCTTAGAACC CCGATGACTT CAATACATGG GTTTATAGAA
GGAATTCTGG ATGGAACCAT TCCTCCGGAA AAGGAAAAGG ATTACCTTTT AATTGTCCGG
GATGAAATAC GAAGGCTCAA CAGGCTGACC ACGGATCTTC TTGACCTTGC AAAAATGGAG
GCCGGTGAAA TCACTATAAA TCCGGTTAAC TTTAATATCA ATGAACTTAT CAGACGATGC
ATTATAAAGC TTGAAAACTT TATAACACAA AAGGACATTG AGGTTGAGGC AAATTTTGAA
GAAGAGGATA TGTATGTAAA AGCGGATATT GACTCCATTG AGAGAGTGCT TATAAATCTT
ATGCACAACG CTGTCAAATT TGTTCAGCAG AACGGAAAAA TCAAAGTATC CACCTCAAGT
TACAAAAACA AGGTACTTGT TTGTGTTGAG GATAACGGCA TAGGAATTGA CAGGAATGAA
ATTGACCTTA TCTGGGAAAG ATTTTACAAG TCGGACAAAT CCAGAAGCAA AGAAAAAGGC
GGTGCCGGCC TTGGACTTGC CATAGTCAGA AACATAATAA ACGACCACAA GCAGGAAATC
TGGGTTGAGA GTGAGGTCGG AAAAGGAACC AAGTTTTATT TTACTTTGGA CAAAGGAAGC
AATGAAAAAG CATAA
 
Protein sequence
MLKTLFSKIV VLFIAILLVS TSITGVMLYI FLGNFASEEK EKLLSDTADS INSMLNDYLT 
AYYNNYNNPM FAFWEEIYRR MLDNALERES QSTGTVIWIV STEGEIGIIK GNQAVVREIV
QKLTDDTGKI KLQNPAQYKD VMSGSVPMVK EIGDFYGLFK DTNVSWLTIG KPFTYNGKIL
GAVYLHTPVP EVQRARSSVF KFFIFAVVIS IIISIVLIYI FSLKLSRPLK KINSAAKKIA
SGKFDERLDI SSEDEIGELA RSFNNMAGEL QNLENMRRGF IANVSHELRT PMTSIHGFIE
GILDGTIPPE KEKDYLLIVR DEIRRLNRLT TDLLDLAKME AGEITINPVN FNINELIRRC
IIKLENFITQ KDIEVEANFE EEDMYVKADI DSIERVLINL MHNAVKFVQQ NGKIKVSTSS
YKNKVLVCVE DNGIGIDRNE IDLIWERFYK SDKSRSKEKG GAGLGLAIVR NIINDHKQEI
WVESEVGKGT KFYFTLDKGS NEKA