Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1393 |
Symbol | |
ID | 4809054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1702079 |
End bp | 1703848 |
Gene Length | 1770 bp |
Protein Length | 589 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640106817 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_001037818 |
Protein GI | 125973908 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5002] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000411549 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA AAATTTTTAA ATATTATTTG CTTTTGATTC TTATAATCCT ATCTGTGACA GTCATATTTA TCCCAAAGGT CTCAAGAAAG TTCTACACTC AAGAGGTGGA AAACAAACTT GAGGGAATAG CTTTTTCAAT TGAGTACTAT TTGTTGAATG AAGCAAAGAA CGGCGAAATT GATTTTGACT TTATCGCAAA GGATTATGCC GCAAAGTACA ACCAAAATTC CACCTTTCAG GGTGAGAGTC TAAGAATTAC CTTTATCAGT TATGACGGAA AAGTGTTGGG TGATTCGGAT GCAAATTTTA ATCAAATGGA AAATCACTTG AGCAGAAAAG AGATTCAGGA TGCACTCAAA GGGAATGTCG GTAAAGACAT CCGCAGCAGT AAGACTTTGA AGTTGGATTT GCTTTATATG GCAATTCCTG TGGAGGAATT GAATGTAATT GCCAGAGTTT CTGTTCCCCT TGTTCAAATA AAAAAAATTA ACAGATTGAT TTGGCTCTAT TCAATTTTAA TTTTTATTAT GGCACTTATA ATAACGGTAA TTGTGTCTTT GAGGATAGCA GGACTTGTAA TTCGTCCGTT GAATGATATT ATTTCGGTAT CAAAGGAAAT AACCAACGGC AACTATTCCA GGAGAATCAA GTTAAAATCA AAAGATGAGC TGGGACAGCT TGCCGTCCAT TTTAACAAAA TGGCTTCAAA GCTTGAAAGA ACCATATCTG ATTTGAATAC AAAGAAAATT GAGCTTGAAT CTATTGTGGA GAGTATAACA AACGGAATTG TTGCAGTGGA CGGCAATAAC AAGGTTATAC TGATAAATCC TGCTGCTTTC ACTGTTTTTA ATTTGGATGC CGATGCCGAA ATTTTGGGAG ATGATATTGA AAACCATATT AAAAACAGTC AGATAAATTC TCTTTTAAAA GATGCAATAC AGAAAAATAA GCCGTTGGAG GCTGAAGTTG CAATTGACGG CCGGGTGCTT TTGGTAAACG CTTCACCTAT AAGACCCAAA GACAGCGATA TTGATAATTC GGGAGGAATT GTATTTATTC AGGACATAAC AAAGGTAAGA AAGCTGGAGC AGATTCGAAC TGAATTTGTT TCCAATGTCA CCCATGAACT AAAAACACCG ATAACACCGA TCCGAGGATT CATTGAGACA TTGAAAAACG GTGCTATGAA TAATCCTGTT GTGGCGGAAA GATTTTTGGA AATCATTGAT ATCGAAGCTG AGCGTCTTCA CGAATTGATT AACGACATTT TGCTGCTGTC GGAAATTGAG ACAAAGTTAA AAGATACCAA CTTGGAAATC TTCGATTTAA AATCCATGGT GGATGATGTT TTTAAAGTTA TGCAAAACAT TGCAAAGGAG AAGAAGATCA GCCTGAACAA CAATGTGCGG GATGAAGTGT TGATGAAAGC AAACATAAAC CGCATGAAAC AGTTGATAAT GAATTTAGTG GACAACGGAA TAAAATATAA TGTTCAAAAC GGCTCGGTAT CGGTTGACGG GTACAGGGAG GACGGAAAGG TTGTCATTTC CGTAAAAGAT ACGGGAATAG GAATTCCTTC GGCCCATATA CCGAGAATTT TTGAGCGCTT TTACAGGGTG GACAAGGGAA GGTCAAGGGG AATGGGAGGT ACCGGTCTGG GTCTTTCAAT AGTAAAGCAC ATTGTAAACC TTTATAACGG AGAAATAAAG GTAAATTCTG TTGTGGGGGA AGGCACCGAA TTTATTGTAA AAATTCCGTG CCAACCGTAA
|
Protein sequence | MKKKIFKYYL LLILIILSVT VIFIPKVSRK FYTQEVENKL EGIAFSIEYY LLNEAKNGEI DFDFIAKDYA AKYNQNSTFQ GESLRITFIS YDGKVLGDSD ANFNQMENHL SRKEIQDALK GNVGKDIRSS KTLKLDLLYM AIPVEELNVI ARVSVPLVQI KKINRLIWLY SILIFIMALI ITVIVSLRIA GLVIRPLNDI ISVSKEITNG NYSRRIKLKS KDELGQLAVH FNKMASKLER TISDLNTKKI ELESIVESIT NGIVAVDGNN KVILINPAAF TVFNLDADAE ILGDDIENHI KNSQINSLLK DAIQKNKPLE AEVAIDGRVL LVNASPIRPK DSDIDNSGGI VFIQDITKVR KLEQIRTEFV SNVTHELKTP ITPIRGFIET LKNGAMNNPV VAERFLEIID IEAERLHELI NDILLLSEIE TKLKDTNLEI FDLKSMVDDV FKVMQNIAKE KKISLNNNVR DEVLMKANIN RMKQLIMNLV DNGIKYNVQN GSVSVDGYRE DGKVVISVKD TGIGIPSAHI PRIFERFYRV DKGRSRGMGG TGLGLSIVKH IVNLYNGEIK VNSVVGEGTE FIVKIPCQP
|
| |