Gene Moth_0510 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0510 
Symbol 
ID3831812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp528115 
End bp530007 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content55% 
IMG OID637828444 
Productputative serine protein kinase, PrkA 
Protein accessionYP_429383 
Protein GI83589374 
COG category[T] Signal transduction mechanisms 
COG ID[COG2766] Putative Ser protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGATAT TGCAACGGTT GGAGAAATAC CGCCAGGAAG CCAGGAAGCT TCACTGGGAA 
GGGACTTTTG CTGAGTATTT AAAAATGGTG ATAGCCAATC CCAAACTGGC GCGGCTCGCC
CATGCCCGTA TATATGACAT GATTGCCGGT TACGGGGTGG AGGAGATCGA TGGGGTTAAG
CACTACCGGT TCTTTGAAGG GGAGATCTTT GGGCTGGAGC GAACCCTGGA AAAACTGGTG
GAGGAGTATT TCCATTCCGC AGCCAGGCGC CTGGACGTAC GCAAGCGGAT CCTGCTCCTT
ATGGGTCCGG TCAGCGGGGG TAAGTCGACC ATTGTAACTA TGTTAAAACG GGGCCTGGAG
AAATACAGCC AGACCGACGC CGGCGCCCTC TATGCCATTA AGGGCTGCCC CATGCATGAA
GAGCCCCTGC ACCTGATTCC CCGAGAACTG CGGCCCGAGT TACAAAAGGA ATATGGTATT
TATATCGAAG GCAACCTGTG TCCCGTTTGC CAGTTAATGG TTGAAGAGAA ATATGAAGGC
CGGGTAGAAG AGGTCCCGGT GGAACGCATC ATCCTTTCCG AGGAAAAGCG GGTCGGTATC
GGCACCTTCA GTCCCTCGGA CCCTAAATCC CAGGATATTG CCGAACTCAC CGGTAGTATC
GACTTCTCAA CCATCGCCGA GTACGGTTCC GAGTCCGACC CCCGGGCCTA CCGTTTTGAC
GGGGAACTCA ATAAGGCCAA CCGGGGAATG ATGGAGTTCC AGGAAATGCT GAAATGCGAT
GAAAAATTCC TTTATAACCT CCTCAGCCTT TCCCAGGAGG GCAATTTCAA GGCCGGCCGC
TTTGCCTTGA TCTCAGCCGA CGAGATGATC ATTGCCCACA CCAACGAAGC GGAGTACCGG
GCCTTTATCA GCAACCCCAA GAACGAGGCC CTGCAGTCCC GCATCATGGT CATTCCCATC
CCCTATAACC TGAAGGTCAG GGAAGAGGTC AAGATCTACC AGAAGCTCAT TCGCCAGAGT
GATATCGACG TTCATATCGC TCCCTATGCC CTCCAGGCGG CCGCCATCTT CTCCATCCTC
TCGCGCCTGA AGGAATCCAA GAAGCAGGGC ATGGACCTGT TAAAGAAGAT GAAACTCTAT
GACGGCGAAG ATGTGGAGGG CTTCAAGCAA AAGGACGTTC TGGAGCTGAT GAACGAGGCC
GAGGCCGAGG GTATGAGCGG CGTCGACCCC CGCTACGTCA TCAACCGTAT CTCCAGCGCC
CTCATTACCG CCGACACCCG CTGCATCAAC GCCCTGGACA TCCTGCGGGC TTTAAAGGAC
GGCCTGGACC AGCACCCCTC CATAACCAGG GAAGAGAAGG AGCGCCTCAT CAATTTTATC
GCCATGGCGC GCCAGGAATA CGACGAGTAC GCCAAAAAGG AAGTCCAGCG GGCCTTCGTC
TATTCCTACG AAGAATCCGC CCGGGCCCTC TTCAACAACT ACCTGGATAA CGTCGAGGCC
TTTGTCAATA AAACCAAGGT TCGCGACCCC ATCACCGACG AGGAACTGGA CCCCGATGAA
AAGCTCATGC GTTCCATTGA GGAGCAGATC GGCGTCACCG AGAATGCCAA AAAGTCCTTC
CGGGAGGAGA TTCTCATCCG CCTCTCGTCC TACGCCCGCA AGGGGAAGAC CTTTGATTTC
AATTCCCACG AGCGCCTGCG GGAGGCCATC GAGAAGAAGC TCTTCGCCGA CATGAAGGAT
ATCGTCAAGA TAACTACCTC TACCCGTACG CCGGATCCGG AGCAGCTGAA ACGCATTAAC
GCCGTCATCG ACCGCCTGAT TTCCCAGCAC GGTTATTGCC CGATCTGCGC CAACGAACTC
TTGAAATACA CCGGCAGCCT GTTGAACCGT TAA
 
Protein sequence
MEILQRLEKY RQEARKLHWE GTFAEYLKMV IANPKLARLA HARIYDMIAG YGVEEIDGVK 
HYRFFEGEIF GLERTLEKLV EEYFHSAARR LDVRKRILLL MGPVSGGKST IVTMLKRGLE
KYSQTDAGAL YAIKGCPMHE EPLHLIPREL RPELQKEYGI YIEGNLCPVC QLMVEEKYEG
RVEEVPVERI ILSEEKRVGI GTFSPSDPKS QDIAELTGSI DFSTIAEYGS ESDPRAYRFD
GELNKANRGM MEFQEMLKCD EKFLYNLLSL SQEGNFKAGR FALISADEMI IAHTNEAEYR
AFISNPKNEA LQSRIMVIPI PYNLKVREEV KIYQKLIRQS DIDVHIAPYA LQAAAIFSIL
SRLKESKKQG MDLLKKMKLY DGEDVEGFKQ KDVLELMNEA EAEGMSGVDP RYVINRISSA
LITADTRCIN ALDILRALKD GLDQHPSITR EEKERLINFI AMARQEYDEY AKKEVQRAFV
YSYEESARAL FNNYLDNVEA FVNKTKVRDP ITDEELDPDE KLMRSIEEQI GVTENAKKSF
REEILIRLSS YARKGKTFDF NSHERLREAI EKKLFADMKD IVKITTSTRT PDPEQLKRIN
AVIDRLISQH GYCPICANEL LKYTGSLLNR