Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0828 |
Symbol | |
ID | 3831525 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 858289 |
End bp | 860103 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637828758 |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_429688 |
Protein GI | 83589679 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG5002] Signal transduction histidine kinase |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000000197924 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.403449 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAGCA GGAGTATTAC CGCCAAGCTC TGGTTGCTCC TGGTTTCCCT GGTTGTCGTC AGCCTCCTGG TCATTGGCAT ATCCCTGCAC GGCCTTCTGG GGAATTACTA CTACCGGCAG CAAGCCCAGA CCATGCTGGA TAAAGGGGAA CTGCTGGCCC GGAGCCTGGC GACCGGTAGC GGCGGTGACC TGGCCGGCCA GGCCGACCTC CTGGGCCGGA TGGCCGGTAC CGGAGTGATG ATCATCGACC GCCAGGGCCT GGTCCTTTCC TGCAGCAGCG AAGCCGGGCC GGGTGACGGG TCCGGTATGG GGATGATGGG CCGGGGCCGC GGTTACGGCC GCATGATGCA CGGTAACTTT CCAGTAACCG GTATGCACCT GGAGGGTGCC GAGGTCCAGC AGGTCCTGGC CGGGAATACC GTCGTGAAAC GGGGTTACCA GCAGGCCTTT AATACCAGTA TGCTGACGGT GGCGGTCCCT ATCAAGACCG GCAATGAAGT TAACGGGGCA GTTATCCTTT TTGCCCCGGA GGCTTCGTTA AGCGCCGCCA TGGGCGCCAT GAGTCGACTG ATCCTTTATG CCGGCCTTGT GGCCGTACTC CTGGCGACCA TCCTGGCCCT CTTTGCCGCC CGCAGGGTGA CCAGGCCTCT GAAAAGCTTA AGCCTGGCGG CCCGGCAAAT GGCCAGGGGT GATTTCAGCG TCCGGGTGCC GGTAGCTTCA GCGGACGAAC TGGGCCAGCT GGCCGGGAGC TTTAATTTTC TGGCCGGGGA ATTGTCCCGG ACGGTGGCCG CCCTCTCCCG GGAGAAGGAA AAGCTGAATA GGGTGGTCCG GGATATGACT GACGGGGTTC TCGCCTTTAC AGCCAGTGGC CGGGTCCTCT TTGCCAACCC TCGGGCGGAA AAGCTCCTGG GTTTACCCTT GTCACCGGGG GCGGAACTGC CGGTCGAACT CCTGGACCCC TTACGGGCGG CAGTGGCCGG GGAGGGAACT ACCGGCGAAA TAAACTGGCA GGAGCGGGTG CTGGCAGTCC GGGCCGCTCC CCTGCAGGAA GAGGACCCCT GCGGGGAGGC AGCGGTGGCC ATACTCCAGG ATATAACCAC CCAGAAAAAG ATGGAGCAGA TGCGCCGGGA GTTCCTGGCC AGCGTCTCCC ATGAATTGCG CACGCCCCTG AGTTTTATCC AGGGTTACGG CGAAGCCCTG GCCGACGGCC TGGCGACGGG TGAGAAGGAA CGCCAGGAGT ATACCGGCAT TATCCTGGCC GAGGCCAACC GCCTGCGCCG CCTGGTAGAT GACCTTTTTG ACCTCAACAA GATGGCTGCC GGGCACCTGC CCCTGGAACT GGCCGAGGTA GACCCCGGGG AACTGGTAAC TGGAGTGGCC AGGAAGTACC AGCCCTTGCT GGCGGAACAC GGTTTAGTCC TGGAGGTGGA GCTCGAGCCC TACCTGCCGC CGGTATGGGC TGACGCCGGG CGTCTGGAGC AGGTCCTGGT CAACCTCCTG GATAACGCCC GGCGACATAC GTCTCCCGGG GGTCGGATTA CCATCAGCGC CGGCCTGGCC GGTAGGGAGT TAAAGATAAG CGTAGCCGAC ACCGGCAAGG GCATCCCGGC AGGGGAACTG CCCTATATCT GGGAGCGATT TTACAAGGTG GACAAATCCC GATCCCGGGG CGATAGTGGC AGCGGCCTGG GCCTGGCCAT CGTTAAGGGC CTGGTAGAAG CCCATGGCGG CCGGGTCGAA GTAGTAAGTG AACCGGGTCG GGGCAGCATC TTTAGCTTTT ATCTGCCGTT GCATATTGAC AGCGAAAATG GATAA
|
Protein sequence | MISRSITAKL WLLLVSLVVV SLLVIGISLH GLLGNYYYRQ QAQTMLDKGE LLARSLATGS GGDLAGQADL LGRMAGTGVM IIDRQGLVLS CSSEAGPGDG SGMGMMGRGR GYGRMMHGNF PVTGMHLEGA EVQQVLAGNT VVKRGYQQAF NTSMLTVAVP IKTGNEVNGA VILFAPEASL SAAMGAMSRL ILYAGLVAVL LATILALFAA RRVTRPLKSL SLAARQMARG DFSVRVPVAS ADELGQLAGS FNFLAGELSR TVAALSREKE KLNRVVRDMT DGVLAFTASG RVLFANPRAE KLLGLPLSPG AELPVELLDP LRAAVAGEGT TGEINWQERV LAVRAAPLQE EDPCGEAAVA ILQDITTQKK MEQMRREFLA SVSHELRTPL SFIQGYGEAL ADGLATGEKE RQEYTGIILA EANRLRRLVD DLFDLNKMAA GHLPLELAEV DPGELVTGVA RKYQPLLAEH GLVLEVELEP YLPPVWADAG RLEQVLVNLL DNARRHTSPG GRITISAGLA GRELKISVAD TGKGIPAGEL PYIWERFYKV DKSRSRGDSG SGLGLAIVKG LVEAHGGRVE VVSEPGRGSI FSFYLPLHID SENG
|
| |