Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3068 |
Symbol | |
ID | 4809942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3609155 |
End bp | 3610594 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640108492 |
Product | periplasmic sensor signal transduction histidine kinase |
Protein accession | YP_001039457 |
Protein GI | 125975547 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5002] Signal transduction histidine kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000279575 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTTAA AAAAACGCCT TATTTTTTCG AATGCTGCAA TTATTGTTAT TCCTCTTGGG ATAACATTTG TGGCATCTTT TATTTTTATG TTTGTTTTGG CGAGGATACA TGATGTCGAC CTAAGTTACA ATAATGTAAA AAAGCTTACT CAGATACAAT ACGAGTTTTT TAAAGCAGAG GGGGGATTGC TTAAAAATTC CCCGGAAATA ATTTTGGAAA AAGATTTTCA GCAGTATATT ACCACAAGAC TTGAGAGCAT AGAAGCCGAC ATAGTGGTGC TAAAAGGCCA GGAACGGGTT TTTGAAACTC GCAAGCTCAG TATTATTGAG TTGGAAAGAT GTCTTGAAAA AACCGGCGAC AACTTGTTTA GAAACATTGT TGAGATTCAG GGCAAATCCC ATATGGTAAA AGTAATACCT GTGATATTTA AAAGCGGTGA GGATGGGAAA ATTCTTTTGC TTGTGCCCGC CGTAAATGAC TGGATGACAA CGGAAAAGCT TTTCATATTT TCCGGCGTGG TGTTTGTTCT CAGCTTTATA ATAACAAATA TAGTCATCAT TACTGCTTTT TCAAAGAAAG TTATAACTCC TCTGGGGAAG CTTCAGGCTG CTGCGGGCAA AATAAGCGAA GGCAATCTGG ATTTTGAGAT TATTGAGGAC GGAGATACCC AAATTAGAGA ATTGTGCCGC TCCTTTGAGA AAATGAGGCT TAAGCTTGTG GAGGCAAATT ATACGCAGAA AAAATATGAT GAGAGCAGAA AAATGCTTTT TTCAAGCATA TCTCACGATC TTAAAACTCC TATAACTTCA ATAAAGGGAT ATGTTGAGGG GATATTGGAC GGTGTGGCAA ATACCCCTCA GAAAGTGGAA AAATATTTAA GAACGGTTCA TTCCAAGGCT GTTCACATGG ATAGAATGAT TGATGACCTT CTTTTGTATT CGAGACTGGA TATGCACAAG GTTTCGTTTA ATTTTGAAAA GACGGATGTG CTAAAGTACT TTGAAGATTG CATGTATGAA ATAGATATTG AACTTGAAAA GTCCAATATC AAGGTTGAGC TTCATAACAA CTTGAGAGGA AAGCGTTATA TAATGATAGA CAGGGATCAG GTGCGAAGAG TTGTGATAAA CATAATTGAC AACTCAAGAA AATATATGGA CAAGGAACAG GGGAAAATAG ATATTTTTTT GAGGGAAGCA ACATCAAATG TGGTAATAGA GATAAAAGAC AACGGAGCCG GAATTAGTGA AAGTGATTTG CCCTACATTT TTGACAGGTT CTATCGCGCC GATTCGGCGA GGGATACCAG GAAAGGAAGC GGGCTTGGAC TTGCCATTGC CAAACAAATA ATAGAAGGAC ACGGAGGGAA AATTTGGGCG GTCAGCCGTA TGGGCGAAGG CACGAGTGTG ATGATTTCTC TGAAAAAATA TGAAGGCTGA
|
Protein sequence | MDLKKRLIFS NAAIIVIPLG ITFVASFIFM FVLARIHDVD LSYNNVKKLT QIQYEFFKAE GGLLKNSPEI ILEKDFQQYI TTRLESIEAD IVVLKGQERV FETRKLSIIE LERCLEKTGD NLFRNIVEIQ GKSHMVKVIP VIFKSGEDGK ILLLVPAVND WMTTEKLFIF SGVVFVLSFI ITNIVIITAF SKKVITPLGK LQAAAGKISE GNLDFEIIED GDTQIRELCR SFEKMRLKLV EANYTQKKYD ESRKMLFSSI SHDLKTPITS IKGYVEGILD GVANTPQKVE KYLRTVHSKA VHMDRMIDDL LLYSRLDMHK VSFNFEKTDV LKYFEDCMYE IDIELEKSNI KVELHNNLRG KRYIMIDRDQ VRRVVINIID NSRKYMDKEQ GKIDIFLREA TSNVVIEIKD NGAGISESDL PYIFDRFYRA DSARDTRKGS GLGLAIAKQI IEGHGGKIWA VSRMGEGTSV MISLKKYEG
|
| |