Gene Moth_0912 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0912 
Symbol 
ID3831300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp946723 
End bp948561 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content58% 
IMG OID637828843 
Productserine/threonine protein kinase 
Protein accessionYP_429772 
Protein GI83589763 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[S] Function unknown
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase
[COG2815] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0411502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGGCA AAGTCCTCGA AGGTCGTTAT GAAATAGTCA GCGAACTCGG GGGAGGCGGC 
ATGGCCAGGG TGTACCGGGG CCAGGATCGC CTGTTGAACC GGAACGTAAC TATTAAGATT
TTGCGGGAAC AGTATGCCAG CGATAAAGAG TTTTTAGCCC GTTTTCAGCG GGAGGCCCAG
GCCGTAGCCA GTCTCTCCCA TCCCAACGTG GTCAGTATTT ACGACGTTGG CCAGGAAGAT
GATCTTCATT ACTTGATTAT GGAATATGTC GAGGGCAGGT CGCTGAAGGA CCTTATTTCC
GAGCGAGCCC CACTGCCGCC CCTGGAAGCC ATCGATATTT CCCTGCAGAT CTGTGACGCC
CTTGAGCATG CCCATGAAAA CGGTGTTATC CATCGTGATA TCAAACCCCA CAATATCCTT
ATTACCCGTA ACGGCAGGGT TAAGGTGACG GATTTCGGCA TTGCCCAGGC TGTCAGCGAG
GTTACCATGT CCCAGAGTGG AACCATGATT GGCTCCGTTC ATTACCTGGC TCCCGAACAG
GCCCGGGGCG GGGTTATTGG GGCCACGGCC GATATCTATT CCCTGGGCAT CGTCCTCTAC
GAGATGTTGA CCGGCGACCT CCCATTTCAC GGCGAAACAC CGGTAGCCGT AGCCCTCAAG
CACCTTCAGG AAAACCCCCG GCCTGTGCGC GAATTAAATC CCAATGTACC GCCGGCCCTG
GAACGCGTCG TTATGCGAAC CCTGGAGAAA GACCCTGCCC GGCGCTACCC GTCGGCAGCG
GCCTTGCGTT CCGACCTGCT GGCCGTAAGA AACGCTCTGG CGGATGCCAC CTTCGCCACC
CAGGTTTTGC CGGCCATTGA GACTCCCGAT CCTCCTTCTA CCCTGCCCAA ACCCCGCCGG
CGGCCACGGG TCTGGGCGTG GGTGCTAATG GCCCTCTTGT TCCTAGGCCT GGCAGCTGCC
GGCCTGTGGG CCGGTTTCCG TTATTACCTG GCAGTAGGCG AGACCCTGGT ACCGTCGGTA
GTGGGCCTGC CCGAGGGCCA GGCCCTGGAG CAGCTGGCGG CGGCCGGATT GCGGGGTCAG
GTTATAGCCC GGCAGTATGA TGCCAGCGTT CCAGCCGGCC AGGTCATGGC CCAGGACCCC
GGCCCCAATC AAAGGGTGCG GCGCGGCCGG GTGGTAGCCC TGACCGTTAG CCAGGGAGCC
AGGTTAGTGA GGGTTCCCAG TGTTATCGGT GAAACGGAAC GCAATGCCCG CTTAATACTG
GAGAATGCTA ATCTCAAGGT AGCCGCCGAT ACTCTAAAAG TATATCACCC CTCTATTCCG
GCAGGTTCCG TTGTTGACCA GAATCCCCCG GCCAATACCC AGCAACCGGA AGGGACAGAA
GTCAGGCTGA TTATCAGCAA GGGCCCGGAA CCCCAGTTTA CCACCGCCCC GTCCGTGGTA
GGCCTTTCCC TGGCCGAGGC CCAGCAGAAA CTCCTGGAGG CTAAACTGAA ACAGGGCACC
CTGACCTATC AGCGGAGCGA TAATCAATTC CCGGGATATA TTATTGCCCA GGACCCCCGG
GAGGGGAGCA ATGTTTTGCA GGGAAGCGCC ATAAATTTGG TTGTCAGCCA GGGACCGGGC
CCGGTCCAGA AACAGGTGGG GGTAACCATT GACCCGGCCC CTGATGATAA AGACCATGAG
GTGCGGATTG TAGTTACCGA TGCCAAGGGT ACTAATGAAG TGCTAAAGAA GAAGCAAAAG
ATGGGCCAGC AAATCCAGGC CACCATCAAC TATTTCGGCA AAGGTAAGTT GCAGGTTTTC
CGTGACGGCA ACGTTATTTA TGAACAGGAC TTGCAGTAG
 
Protein sequence
MIGKVLEGRY EIVSELGGGG MARVYRGQDR LLNRNVTIKI LREQYASDKE FLARFQREAQ 
AVASLSHPNV VSIYDVGQED DLHYLIMEYV EGRSLKDLIS ERAPLPPLEA IDISLQICDA
LEHAHENGVI HRDIKPHNIL ITRNGRVKVT DFGIAQAVSE VTMSQSGTMI GSVHYLAPEQ
ARGGVIGATA DIYSLGIVLY EMLTGDLPFH GETPVAVALK HLQENPRPVR ELNPNVPPAL
ERVVMRTLEK DPARRYPSAA ALRSDLLAVR NALADATFAT QVLPAIETPD PPSTLPKPRR
RPRVWAWVLM ALLFLGLAAA GLWAGFRYYL AVGETLVPSV VGLPEGQALE QLAAAGLRGQ
VIARQYDASV PAGQVMAQDP GPNQRVRRGR VVALTVSQGA RLVRVPSVIG ETERNARLIL
ENANLKVAAD TLKVYHPSIP AGSVVDQNPP ANTQQPEGTE VRLIISKGPE PQFTTAPSVV
GLSLAEAQQK LLEAKLKQGT LTYQRSDNQF PGYIIAQDPR EGSNVLQGSA INLVVSQGPG
PVQKQVGVTI DPAPDDKDHE VRIVVTDAKG TNEVLKKKQK MGQQIQATIN YFGKGKLQVF
RDGNVIYEQD LQ