Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1287 |
Symbol | |
ID | 4809539 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1564633 |
End bp | 1566147 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106710 |
Product | periplasmic sensor signal transduction histidine kinase |
Protein accession | YP_001037712 |
Protein GI | 125973802 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0642] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000512156 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAAAAA CTCTGTTTAG CAAGATTGTT GTCCTGTTTA TTGCCATTTT GCTTGTAAGC ACATCCATAA CCGGGGTGAT GCTTTATATT TTTCTTGGAA ATTTTGCCTC GGAGGAAAAG GAAAAATTAT TAAGTGACAC TGCGGACAGT ATAAACAGTA TGCTGAACGA TTATTTGACT GCTTATTACA ATAACTATAA CAATCCCATG TTTGCATTCT GGGAGGAAAT ATACCGCAGA ATGCTGGATA ACGCCCTTGA GAGGGAAAGT CAAAGTACAG GCACTGTCAT ATGGATTGTG TCCACAGAAG GAGAAATCGG TATTATCAAG GGTAATCAGG CGGTGGTAAG GGAAATAGTC CAAAAGCTTA CCGACGACAC CGGCAAGATT AAACTGCAAA ACCCGGCACA ATATAAAGAC GTTATGAGCG GTAGTGTGCC CATGGTCAAA GAAATAGGAG ATTTTTACGG ACTGTTTAAG GACACGAATG TATCGTGGCT GACTATTGGA AAGCCGTTTA CATATAACGG CAAAATTCTC GGGGCTGTTT ACCTTCATAC GCCGGTTCCT GAGGTACAGA GGGCAAGAAG CAGTGTTTTT AAGTTCTTTA TTTTTGCCGT TGTCATTTCC ATAATAATAT CAATAGTACT GATTTATATT TTTTCACTTA AGCTTTCAAG GCCTTTGAAA AAGATTAACA GTGCTGCAAA GAAAATTGCA AGCGGAAAAT TTGACGAAAG GCTTGATATT TCATCGGAAG ATGAGATAGG TGAGCTTGCA AGAAGTTTTA ACAACATGGC GGGAGAGCTT CAGAATCTGG AAAACATGCG AAGAGGTTTT ATCGCCAATG TGTCCCATGA GCTTAGAACC CCGATGACTT CAATACATGG GTTTATAGAA GGAATTCTGG ATGGAACCAT TCCTCCGGAA AAGGAAAAGG ATTACCTTTT AATTGTCCGG GATGAAATAC GAAGGCTCAA CAGGCTGACC ACGGATCTTC TTGACCTTGC AAAAATGGAG GCCGGTGAAA TCACTATAAA TCCGGTTAAC TTTAATATCA ATGAACTTAT CAGACGATGC ATTATAAAGC TTGAAAACTT TATAACACAA AAGGACATTG AGGTTGAGGC AAATTTTGAA GAAGAGGATA TGTATGTAAA AGCGGATATT GACTCCATTG AGAGAGTGCT TATAAATCTT ATGCACAACG CTGTCAAATT TGTTCAGCAG AACGGAAAAA TCAAAGTATC CACCTCAAGT TACAAAAACA AGGTACTTGT TTGTGTTGAG GATAACGGCA TAGGAATTGA CAGGAATGAA ATTGACCTTA TCTGGGAAAG ATTTTACAAG TCGGACAAAT CCAGAAGCAA AGAAAAAGGC GGTGCCGGCC TTGGACTTGC CATAGTCAGA AACATAATAA ACGACCACAA GCAGGAAATC TGGGTTGAGA GTGAGGTCGG AAAAGGAACC AAGTTTTATT TTACTTTGGA CAAAGGAAGC AATGAAAAAG CATAA
|
Protein sequence | MLKTLFSKIV VLFIAILLVS TSITGVMLYI FLGNFASEEK EKLLSDTADS INSMLNDYLT AYYNNYNNPM FAFWEEIYRR MLDNALERES QSTGTVIWIV STEGEIGIIK GNQAVVREIV QKLTDDTGKI KLQNPAQYKD VMSGSVPMVK EIGDFYGLFK DTNVSWLTIG KPFTYNGKIL GAVYLHTPVP EVQRARSSVF KFFIFAVVIS IIISIVLIYI FSLKLSRPLK KINSAAKKIA SGKFDERLDI SSEDEIGELA RSFNNMAGEL QNLENMRRGF IANVSHELRT PMTSIHGFIE GILDGTIPPE KEKDYLLIVR DEIRRLNRLT TDLLDLAKME AGEITINPVN FNINELIRRC IIKLENFITQ KDIEVEANFE EEDMYVKADI DSIERVLINL MHNAVKFVQQ NGKIKVSTSS YKNKVLVCVE DNGIGIDRNE IDLIWERFYK SDKSRSKEKG GAGLGLAIVR NIINDHKQEI WVESEVGKGT KFYFTLDKGS NEKA
|
| |