Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2114 |
Symbol | |
ID | 3833265 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2209107 |
End bp | 2210930 |
Gene Length | 1824 bp |
Protein Length | 607 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637830039 |
Product | histidine kinase |
Protein accession | YP_430949 |
Protein GI | 83590940 |
COG category | [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG3275] Putative regulator of cell autolysis [COG5012] Predicted cobalamin binding protein |
TIGRFAM ID | [TIGR00640] methylmalonyl-CoA mutase C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.156393 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATCCT TTTCCCTGGA AAATCTAATC CAGGCTGTAA TCGACGGCAA CGCTGTCAAG GTCCGGGAAG AAGTAAAAAG GGCCTTGGCA GCGGGGATCG AACCGGCCCG GATCATAACC GATGGCTTTG TGGCAGCCAT GGATGTAGTG GGGGAAAGGT TCGAACGCAA CGAGATTTAC GTAACTGACT TGATTATTAC CGCCCGGGCC ATGCATACCG GCCTCAAGGA ACTTAAGCCC CTGATGCTGG CCGGTAAGGT GCAACCGGTG GGGCGGGCCA TTGTCGGTAC TGTCCAGGGG GACATCCATG ATATCGGCAA GAATCTCCTC GCCATCATGC TCGAGGCATC GGGCTTTGAG GTTATCGACC TGGGCGTGAA CGTAGCCCCC AGCACCTTTG TGGAGGCGGT CATTAAGCAC CGTCCCGATG TCCTCTGCCT CTCGGCCCTC CTTTCTTCTA CCCGCAAGGG TATGGAGGAA ACTATTACTG CCCTGCGGGA GGCCGGCTGG CGGGATAAAG TAAAGGTTGT CGTAGGCGGC ACCCCCCTGA ATGAAAAGAT TGCCGCCAGG ATGGGAGCTG ACGGCTACGC CCCCGATGCC ACGGCGGCTA TACCTCTGAT TAAGAGCCTG ATCGGTGCCG ACCGCAAGCG CCGGGCCGTC CTGGCTCCGG CTACCCTGGA CCTGTTTTTC GGGGAGGGTT CCCTGGAAGA CCTGCAGCGT GCCTTTACCC GGATGACAGG CCTGCATCTG GTCATGGTGG ATGCCGCCGG CCGCCCATTG ACTTCCCTGG GCGGTTTCCT GGAGTGTTCC CGCCATTGCC ACCTGCTTAA GGAAAACCCG GCCAGAGCCC AAGATGTCAC CACCCTCCAG GGCAATTTTA AAGAAGCTTT TATTTATCGC TGTCATGCCG GGTTGGTGGA AATTTCCTAC CCTCTGGCCA ATGAGGATGG GACGGTGGGG GCCGTCCTCT GTGGCCACTG CCTCTTGAGA GGCGACCCTG ACCCGGCCGA TTTGAAGGCA GCCGTCCCAG TCTTATCCCT AACGGACCTG GAAGCTGTTT GCGGTCTTCT CTCCTTTGTA TCCGGCCAGA TCATGCAGCT CAACACCGTT TTACTGGTCA ATAAGGAACT GGAGGACCAG CAGGCGAGCT TCATCCACTT CCTTAAGCGG CAGCACCAGC TGGAACAGGC CCTGAAGGAC GCCGAACTCA AGGCCCTCCA ATCCCAGGTC AACCCCCACT TCCTCTTTAA TTCTTTAAAT ACCGTAGCCC GCCTGGCCCT CCTGGAAGGG GCGGCCAATA CGGAAAAAAT GGTCCGCGCC CTGGCCCGCC TCATGCGCTA CAGCCTCTAC CAGGTCAAGG GAACGGTTTC CCTGGCAGAA GAAATAGCTG CCGTGCGCGA CTACCTTTTT ATCCAGGAAA CGCGGTTTTC AGACCGGGTC CGGAGCCGGG TGAAAGTAGA AGAGGCCGCC ATGCAGGCCC GGCTGCCCTG CATGGTCCTG CAACCCCTGG TGGAGAATGC TATTATCCAT GGCCTGGAAC CCAAGGAGGA GGGAGGGGAA ATCACCGTAT CCGCCCGCCT GGTGGGCGAC CAGGTCCGGG TAGAAATCAA GGATGACGGG GTCGGAATAC CGCCGGAGGT GAAAAAGGCG ATCTTTGACC TGGAAGTCCG GCGGAGCGGT AAAGGCCAGG TAAGCGGCCT GGGGATAGTC AATGTCTACC GGCGCCTGCA GCACCATTTT GGTAGCAACT GCGCCCTGGA TGTAGCCAGT ATGCCGGGAA AAGGTACTTG CGTCCAGCTG ACTTTTCCTT ATACTGTGGA TTAG
|
Protein sequence | MASFSLENLI QAVIDGNAVK VREEVKRALA AGIEPARIIT DGFVAAMDVV GERFERNEIY VTDLIITARA MHTGLKELKP LMLAGKVQPV GRAIVGTVQG DIHDIGKNLL AIMLEASGFE VIDLGVNVAP STFVEAVIKH RPDVLCLSAL LSSTRKGMEE TITALREAGW RDKVKVVVGG TPLNEKIAAR MGADGYAPDA TAAIPLIKSL IGADRKRRAV LAPATLDLFF GEGSLEDLQR AFTRMTGLHL VMVDAAGRPL TSLGGFLECS RHCHLLKENP ARAQDVTTLQ GNFKEAFIYR CHAGLVEISY PLANEDGTVG AVLCGHCLLR GDPDPADLKA AVPVLSLTDL EAVCGLLSFV SGQIMQLNTV LLVNKELEDQ QASFIHFLKR QHQLEQALKD AELKALQSQV NPHFLFNSLN TVARLALLEG AANTEKMVRA LARLMRYSLY QVKGTVSLAE EIAAVRDYLF IQETRFSDRV RSRVKVEEAA MQARLPCMVL QPLVENAIIH GLEPKEEGGE ITVSARLVGD QVRVEIKDDG VGIPPEVKKA IFDLEVRRSG KGQVSGLGIV NVYRRLQHHF GSNCALDVAS MPGKGTCVQL TFPYTVD
|
| |