Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0912 |
Symbol | |
ID | 3831300 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 946723 |
End bp | 948561 |
Gene Length | 1839 bp |
Protein Length | 612 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637828843 |
Product | serine/threonine protein kinase |
Protein accession | YP_429772 |
Protein GI | 83589763 |
COG category | [K] Transcription [L] Replication, recombination and repair [R] General function prediction only [S] Function unknown [T] Signal transduction mechanisms |
COG ID | [COG0515] Serine/threonine protein kinase [COG2815] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0411502 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTGGCA AAGTCCTCGA AGGTCGTTAT GAAATAGTCA GCGAACTCGG GGGAGGCGGC ATGGCCAGGG TGTACCGGGG CCAGGATCGC CTGTTGAACC GGAACGTAAC TATTAAGATT TTGCGGGAAC AGTATGCCAG CGATAAAGAG TTTTTAGCCC GTTTTCAGCG GGAGGCCCAG GCCGTAGCCA GTCTCTCCCA TCCCAACGTG GTCAGTATTT ACGACGTTGG CCAGGAAGAT GATCTTCATT ACTTGATTAT GGAATATGTC GAGGGCAGGT CGCTGAAGGA CCTTATTTCC GAGCGAGCCC CACTGCCGCC CCTGGAAGCC ATCGATATTT CCCTGCAGAT CTGTGACGCC CTTGAGCATG CCCATGAAAA CGGTGTTATC CATCGTGATA TCAAACCCCA CAATATCCTT ATTACCCGTA ACGGCAGGGT TAAGGTGACG GATTTCGGCA TTGCCCAGGC TGTCAGCGAG GTTACCATGT CCCAGAGTGG AACCATGATT GGCTCCGTTC ATTACCTGGC TCCCGAACAG GCCCGGGGCG GGGTTATTGG GGCCACGGCC GATATCTATT CCCTGGGCAT CGTCCTCTAC GAGATGTTGA CCGGCGACCT CCCATTTCAC GGCGAAACAC CGGTAGCCGT AGCCCTCAAG CACCTTCAGG AAAACCCCCG GCCTGTGCGC GAATTAAATC CCAATGTACC GCCGGCCCTG GAACGCGTCG TTATGCGAAC CCTGGAGAAA GACCCTGCCC GGCGCTACCC GTCGGCAGCG GCCTTGCGTT CCGACCTGCT GGCCGTAAGA AACGCTCTGG CGGATGCCAC CTTCGCCACC CAGGTTTTGC CGGCCATTGA GACTCCCGAT CCTCCTTCTA CCCTGCCCAA ACCCCGCCGG CGGCCACGGG TCTGGGCGTG GGTGCTAATG GCCCTCTTGT TCCTAGGCCT GGCAGCTGCC GGCCTGTGGG CCGGTTTCCG TTATTACCTG GCAGTAGGCG AGACCCTGGT ACCGTCGGTA GTGGGCCTGC CCGAGGGCCA GGCCCTGGAG CAGCTGGCGG CGGCCGGATT GCGGGGTCAG GTTATAGCCC GGCAGTATGA TGCCAGCGTT CCAGCCGGCC AGGTCATGGC CCAGGACCCC GGCCCCAATC AAAGGGTGCG GCGCGGCCGG GTGGTAGCCC TGACCGTTAG CCAGGGAGCC AGGTTAGTGA GGGTTCCCAG TGTTATCGGT GAAACGGAAC GCAATGCCCG CTTAATACTG GAGAATGCTA ATCTCAAGGT AGCCGCCGAT ACTCTAAAAG TATATCACCC CTCTATTCCG GCAGGTTCCG TTGTTGACCA GAATCCCCCG GCCAATACCC AGCAACCGGA AGGGACAGAA GTCAGGCTGA TTATCAGCAA GGGCCCGGAA CCCCAGTTTA CCACCGCCCC GTCCGTGGTA GGCCTTTCCC TGGCCGAGGC CCAGCAGAAA CTCCTGGAGG CTAAACTGAA ACAGGGCACC CTGACCTATC AGCGGAGCGA TAATCAATTC CCGGGATATA TTATTGCCCA GGACCCCCGG GAGGGGAGCA ATGTTTTGCA GGGAAGCGCC ATAAATTTGG TTGTCAGCCA GGGACCGGGC CCGGTCCAGA AACAGGTGGG GGTAACCATT GACCCGGCCC CTGATGATAA AGACCATGAG GTGCGGATTG TAGTTACCGA TGCCAAGGGT ACTAATGAAG TGCTAAAGAA GAAGCAAAAG ATGGGCCAGC AAATCCAGGC CACCATCAAC TATTTCGGCA AAGGTAAGTT GCAGGTTTTC CGTGACGGCA ACGTTATTTA TGAACAGGAC TTGCAGTAG
|
Protein sequence | MIGKVLEGRY EIVSELGGGG MARVYRGQDR LLNRNVTIKI LREQYASDKE FLARFQREAQ AVASLSHPNV VSIYDVGQED DLHYLIMEYV EGRSLKDLIS ERAPLPPLEA IDISLQICDA LEHAHENGVI HRDIKPHNIL ITRNGRVKVT DFGIAQAVSE VTMSQSGTMI GSVHYLAPEQ ARGGVIGATA DIYSLGIVLY EMLTGDLPFH GETPVAVALK HLQENPRPVR ELNPNVPPAL ERVVMRTLEK DPARRYPSAA ALRSDLLAVR NALADATFAT QVLPAIETPD PPSTLPKPRR RPRVWAWVLM ALLFLGLAAA GLWAGFRYYL AVGETLVPSV VGLPEGQALE QLAAAGLRGQ VIARQYDASV PAGQVMAQDP GPNQRVRRGR VVALTVSQGA RLVRVPSVIG ETERNARLIL ENANLKVAAD TLKVYHPSIP AGSVVDQNPP ANTQQPEGTE VRLIISKGPE PQFTTAPSVV GLSLAEAQQK LLEAKLKQGT LTYQRSDNQF PGYIIAQDPR EGSNVLQGSA INLVVSQGPG PVQKQVGVTI DPAPDDKDHE VRIVVTDAKG TNEVLKKKQK MGQQIQATIN YFGKGKLQVF RDGNVIYEQD LQ
|
| |