Gene Moth_0940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0940 
Symbol 
ID3832941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp973428 
End bp974621 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content56% 
IMG OID637828870 
Productacetate kinase 
Protein accessionYP_429799 
Protein GI83589790 
COG category[C] Energy production and conversion 
COG ID[COG0282] Acetate kinase 
TIGRFAM ID[TIGR00016] acetate kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000914313 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAATCC TGGTCTTAAA CTGCGGCAGC TCATCGGTCA AATATCAGCT TTTTGATATG 
GAAGACGAGA GTGTTCTGGC TAAAGGACTG GTGGAAAGGA TTGGTATCGA CGGTTCCGTC
CTGACCCATC GCCCGGCGGG CAAAGAAAAA CTGGTGCGTG AAACAGAAAT CCCCGATCAT
AAAGTGGCTA TCCGCCTGTG CCTGGAAGCC CTGACCGACC CCCATTACGG GGTTATCAAA
GACTACAGCG AAATCGGAGC CATCGGTCAT CGTATCGTCC ACGGTGGTAC TTTTCCCCAT
TCGGTCCTGG TAGATGCCTC CACTAAAAAG GCCATTAGTG AACTGGAGGT TCTGGCACCC
CTCCATAATG GCCCGGCCCT ACGGGGTATC GAGGCCTGTG AAGCCATCCT GCCCGGCACC
CCCCAGGTAA CGGCTTTTGA TACGGCCTTT CACCAGGGTA TGCCGGATTA CGCCTATACT
TACAGCCTGC CTTATGAACT CTGCCAGAAG CACCTCATTC GCCGCTACGG CGCTCACGGT
ACCTCCCACC AGTATGTTGC CCTGCGGGCG GCGGCCATAG TTGGTAAGCC CCTGGAGGAA
TTGAAGGTTA TTACCTGCCA CCTGGGTAAC GGCTCCAGTA TTACTGCTAT TAAAAACGCT
AAATCATACG ACACCAGCAT GGGCTTCACC CCCCTGGCAG GTTTAACCAT GGGTACCCGT
TGCGGTGATA TTGATCCGGC CATCGTACCC TTCCTGATGG AAAAAGAGGG CTATACCCCG
GCGGAGATGG ACCAGGTGAT GAACCGCCGG TCAGGGGTCT TGGGAGTCTC CGGCCTCAGC
AGCGACTTCC GGGATATTGA AGCCGCCATG GCTGAGGGTA ATGATCGCGC TCGCCTGGCC
TGGGAGGTTT TCGTCCATAG CGCCAAAAAA TATATTGGCG CTTACGCTGC CCTTTTGAAC
GGCCTGGATA TCTTGGTCTT TACAGCCGGC CTGGGGGAAA ACTCCATCGC CGCCCGGGAA
GCCATATGCC GGGACATGGA CTACCTGGGT ATAAAGATTG ACCCCGAGAA AAACCAGGTC
CGGGGCCAGG AAAGGGAGAT CACGGCCGCC GGAGCTAGGG TGCGCACCTT TGTTATCCCC
ACCAATGAAG AATTAATGAT TGCTCGCGAT ACCCTGGCCC TCGTCCAGGC TTGA
 
Protein sequence
MKILVLNCGS SSVKYQLFDM EDESVLAKGL VERIGIDGSV LTHRPAGKEK LVRETEIPDH 
KVAIRLCLEA LTDPHYGVIK DYSEIGAIGH RIVHGGTFPH SVLVDASTKK AISELEVLAP
LHNGPALRGI EACEAILPGT PQVTAFDTAF HQGMPDYAYT YSLPYELCQK HLIRRYGAHG
TSHQYVALRA AAIVGKPLEE LKVITCHLGN GSSITAIKNA KSYDTSMGFT PLAGLTMGTR
CGDIDPAIVP FLMEKEGYTP AEMDQVMNRR SGVLGVSGLS SDFRDIEAAM AEGNDRARLA
WEVFVHSAKK YIGAYAALLN GLDILVFTAG LGENSIAARE AICRDMDYLG IKIDPEKNQV
RGQEREITAA GARVRTFVIP TNEELMIARD TLALVQA