Gene Moth_0887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0887 
Symbol 
ID3831428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp919289 
End bp922087 
Gene Length2799 bp 
Protein Length932 aa 
Translation table11 
GC content65% 
IMG OID637828817 
ProductATPase, E1-E2 type 
Protein accessionYP_429747 
Protein GI83589738 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0474] Cation transport ATPase 
TIGRFAM ID[TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC
[TIGR01517] plasma-membrane calcium-translocating P-type ATPase
[TIGR01523] potassium and/or sodium efflux P-type ATPase, fungal-type 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.377259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTCAC GACCATGGTA CCAGTTGAGT CCTGAAGAAG CCTTAATGGT CCTGGGGGTC 
GACGCCGGCC GGGGTCTGGC GACAATAGAG GCCCGGCGGC GCCTGGAAGA AAAGGGGCCC
AACCAGCTCC AGGCCCGGCC GGGGGTCCCG CCGTACAGGT TATTTCTGTC CCAATTTCAG
GACCTCATGG TCCTGGTCCT CCTGGCGGCC ACGGCCGTAT CAGCCTTCCT GGGGGAAGTG
GCCGATGCCA TTACCATCGT CGCCATCGTC ATCATCAACG CCATCCTGGG CTTTATCCAG
GAATACCGGG CCGAACGTTC CCTGGAGGCT TTGAAGGAGA TGGCGGCGCC CGAGGCCAGG
GTGCGGCGGG ACGGCGAGAT CCGCCGGGTA CCGGCGCGGG AGATAGTGCC CGGCGACATC
CTTCTCCTGG AAAGCGGCGA CCGGGTGGCC GCCGACGCCC TGGTCCTCAC CGGCAGCAAC
CTCCAGGCCG ATGAAGCGGC CCTGACGGGC GAGTCGACCC CGGTGCCCAA GGCGCCCGGG
GCCCTGGCGG GCGAGGTGGC CCTGGGCGAC CGGCGTAATA TGGTGCACCA GGGAACGGTG
ATTACCGGCG GCCGGGGCGT GGCTGTGGTG GTGGCTACCG GCATGGCTAC GGAGTTCGGC
AAGATCGCCG GCCTGCTCCA GGAAGTGGAG GCCGAGGAGA CCCCCCTGCA AAAACGCCTG
GCCGCCCTGG GCCGCTGGCT GGTCCTGGCC TGTCTGGCCA TCTGCATAGC TGTGGTGGTT
GCCGGTACCT TACGCGGGGA AGACCTCTAC GGCATGTTCC TGGCGGGCGT CAGCCTGGCG
GTGGCAGCCA TCCCCGAGGG CCTGCCGGCC ATCGTCACCG TCTGCCTGGC CCTGGGGGTG
CAGAAAATGG TACAGCGCCA GGCTATTATC CGCAAACTGC CGGCTGTAGA AACTCTGGGT
TGTGCCACGG TCATCTGCTC GGACAAAACC GGCACCCTGA CCCGGAACCA GATGACGGTG
CGTCGGGTCT GGGTGGGAGG CCGCAGCCTC CAGGTAAGCG GGACGGGCTA TAACCCCCGG
GGGGAATACC AGGAGAAGGG CCGGCGGGCA GCCGTCACCG GGGACCTGAA GATGCTTCTG
ACCATAGCCG CCCAGTGCAA CAACGCCCAG CTGCAAAAGG CAGGCCTTAC CATCGGCGGC
TGGCTGCGCG GGGGCGGGGG TAAAAGGGAT AAAGGTGAGA AGAAGGGCGG CGGTATTTTC
GGTAAGCTTT TCGACGGCCG CGACGGCGGC GAATGGACCA TCAGCGGCGA CCCAACGGAA
GGCGCCCTGC TGGTGGCGGC CGCCAAGGGC GGTCTCTGGC GGGAACGGCT GGAACGGGAG
GAACCCCGGC TAGCCGAGAT CCCCTTTGAT TCCGACCGCA AGCGTATGAG CGTCATCTGC
CGGGTGGGTA AAGGCCTCAG GGCCTATGTC AAGGGCGCCC CGGACGTCAT CTTCGACCTC
TGCGACACCA TCCTCCTGGA CGGGCAGGTC GTGCCCCTGG ATGCCGCCCG GCGCCGGGAG
ATCCAGGAGG AGAATGAAGC CATGGCCAGC AGGGCCCTGC GGGTCCTGGC CGTGGCCTAC
CGCGACCTGG AGCCCGGGAC GGACCTTCAG GCGGCAGCAG TGGAGAAAAA CCTGGTCCTG
GTGGGGCTCA TAGGTATGAT CGACCCGCCC CGGACCGAGG CGGCGGCGGC GATCCAGGTC
TGCCGCCAGG CGGGAATCAA GGTGGTCATG ATTACCGGCG ACCATCAGGT CACCGCCCGG
GCCGTGGCCC GGGAACTGGG ACTGCCGGCA GGGGAGGGTG AGGTCTTAAA CGGCCAGCAG
CTGGAAGCCA TGGACGATGC CGACCTCGCC CGGATGGCGC CGGGGGTCAA TGTCTACGCC
CGGGTGGCGC CCCACCATAA GCTGCGCCTG GTGCGGGCCC TGAAGGCCAG CGGCCATATC
GTGGCCATGA CCGGGGACGG CGTTAACGAC GCTCCGGCCA TTAAAGAGGC CGACATCGGC
ATCGCCATGG GGCAGAGCGG TACCGATGTA ACCCGGGAGG CCGCCGCCAT GATCCTGGCC
GACGATAATT TCGCCACCAT CGTGGCCGCC ATCGAAGAGG GACGTGGTAT CTATGACAAC
GTCCGTAAAT TTATCCGTTA CCTGCTCTCG TGTAATATCG GCGAGGTAAT GACCATGTTC
GTGGCCGTCA TCAGCGGCCT GCCCCTGCCC CTGCTGCCCA TCCAGATCCT CTGGATGAAC
CTGGTGACCG ACGGCCTGCC GGCCATGGCC CTGGGGATAG ACAATAAAGA ACCGGGCCTC
ATGAAACGGC CGCCCCACCC GCCGGGGGAA AGCGTCTTTG CCCGCGGCCT GGGTAAGGCC
ATGGCCTTCC TGGGTCTGCA GATAGGCCTG GCTACCCTGG GTGTCTTTAT CCTGGGCTTA
TACCTGGGCG ACGGCGATCT CATCACCGCC CGCACCCTGG CCTTTACCAC CCTGGTCATG
GCCCAGCTCT TCGCCGTCTT TGAGTGCCGC TCGGAACACC TGTCCCCCTT TGCGGTAGGT
TACTTCTCCA ACCCCTACCT GGTAATGGCG GTGGCCGCTT CTCTGGCCAT GCAGCTTCTG
GTGCTCTACC TGCCGCCCCT CCAGGTTGTT TTTAAAACAG TTCCTTTGAA TCTTTTCCAC
TGGGGCGTTA TCTTGCTGGC CGCCGGCTGG CGCACCCTCC TGGGGGGTGT TAATTACTAC
CTGGTGGCCC AGGTCCGGCG CCTGGTTTGG GAGCGTTAA
 
Protein sequence
MQSRPWYQLS PEEALMVLGV DAGRGLATIE ARRRLEEKGP NQLQARPGVP PYRLFLSQFQ 
DLMVLVLLAA TAVSAFLGEV ADAITIVAIV IINAILGFIQ EYRAERSLEA LKEMAAPEAR
VRRDGEIRRV PAREIVPGDI LLLESGDRVA ADALVLTGSN LQADEAALTG ESTPVPKAPG
ALAGEVALGD RRNMVHQGTV ITGGRGVAVV VATGMATEFG KIAGLLQEVE AEETPLQKRL
AALGRWLVLA CLAICIAVVV AGTLRGEDLY GMFLAGVSLA VAAIPEGLPA IVTVCLALGV
QKMVQRQAII RKLPAVETLG CATVICSDKT GTLTRNQMTV RRVWVGGRSL QVSGTGYNPR
GEYQEKGRRA AVTGDLKMLL TIAAQCNNAQ LQKAGLTIGG WLRGGGGKRD KGEKKGGGIF
GKLFDGRDGG EWTISGDPTE GALLVAAAKG GLWRERLERE EPRLAEIPFD SDRKRMSVIC
RVGKGLRAYV KGAPDVIFDL CDTILLDGQV VPLDAARRRE IQEENEAMAS RALRVLAVAY
RDLEPGTDLQ AAAVEKNLVL VGLIGMIDPP RTEAAAAIQV CRQAGIKVVM ITGDHQVTAR
AVARELGLPA GEGEVLNGQQ LEAMDDADLA RMAPGVNVYA RVAPHHKLRL VRALKASGHI
VAMTGDGVND APAIKEADIG IAMGQSGTDV TREAAAMILA DDNFATIVAA IEEGRGIYDN
VRKFIRYLLS CNIGEVMTMF VAVISGLPLP LLPIQILWMN LVTDGLPAMA LGIDNKEPGL
MKRPPHPPGE SVFARGLGKA MAFLGLQIGL ATLGVFILGL YLGDGDLITA RTLAFTTLVM
AQLFAVFECR SEHLSPFAVG YFSNPYLVMA VAASLAMQLL VLYLPPLQVV FKTVPLNLFH
WGVILLAAGW RTLLGGVNYY LVAQVRRLVW ER