Gene Moth_1349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1349 
SymbolhppA 
ID3831907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1394222 
End bp1396240 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content53% 
IMG OID637829285 
Productmembrane-bound proton-translocating pyrophosphatase 
Protein accessionYP_430205 
Protein GI83590196 
COG category[C] Energy production and conversion 
COG ID[COG3808] Inorganic pyrophosphatase 
TIGRFAM ID[TIGR01104] vacuolar-type H(+)-translocating pyrophosphatase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.528352 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCTTT TAGCACCACT GACAGGGATA GTGGCCTTGC TTTTTGCCTT TTACCTTACT 
AATAAAATTA ATCGGTCAGA TCCCGGGAAT CCACGGATGC AGGAAATCGC TGTCGCCATC
CATGAAGGCG CCATGGCCTT CCTTATGCGG GAATACCGAA CTTTAATCTT TTTTGTCCTT
GGCATGACGG CCTTGATTGT AGTTGCCGGC TTTATGACTA GGGGTGCTGA AAGCATGCAA
CCGGCAACAG CTATTGCCTA TGTCGCCGGT ACCCTTTGTT CTATTGGCGC CGGGTACATA
GGTATGCAGG TTGCCACTAG GGCCAACGTC CGCACGGCCA ATGCCGCCCG TCACAGTTCC
AATGCCGCCC TGGACATAGC CTTTTCCGGC GGTTCGGTTA TGGGGATGGC GGTGGTAGGT
CTGGGCCTTT TGGGTCTCGG GATTATTAAT TATGTTTTTA AAAACCCGAG TATAGTTAAT
GGTTTCGCCC TGGGGGCCAG TTCTATAGCC CTTTTTGCCA GAGTCGGTGG CGGTATATAC
ACCAAGGCCG CCGATGTCGG TGCGGACTTG GTCGGCAAAG TAGAAGCAGG TATTCCTGAA
GACGATCCCC GGAACCCAGC CGTTATTGCT GATAACGTTG GTGACAATGT AGGCGACGTC
GCCGGTATGG GTGCTGACCT CTTTGAGTCT TATGTCGGCT CGATTATTTC TGGTATTGCC
CTGGCGGCAG CCCTGAATAT TCCTAATGGC ACCTTGGTGC CTCTAATGAT TGCTGCTATC
GGCATTGTAT CCTCCATTCT CGGTGCCTTT TTCGTGAAAA CGGGCGAAGG GGCCAATGCC
CAGAAGGCTC TTAATACCGG TACCATGGTG GCAAGTATCC TGGCCATTGT TGGTACTTTC
CTGGCTACTA GGTTATTACC GGCTCACTTC ACCGCTGGCT CTATGAGTTA CACTTCTACA
GGCGTATTTG CAGCCACCAT CGCCGGCCTC ATCGCCGGAG TCTTGATTGG CCGGATTACC
GAGTATTATA CTTCAGGGGA TTACGAACCT GTAAAAGAGA TCGCCAAGGC TTCCCAAACC
GGTACAGCTA CCAACATTAT TGAAGGCTTA AGCACCGGTA TGCTGAGCAC AGTTTTGCCT
ATCCTGGTCA TCGTCATTGC CATTATCGCT TCTTACCGTT TCGCCGGCCT TTATGGTATT
GCTATGGCGG CGGTTGGCAT GCTCTCGACC ACCGGCACTA CTGTGGCCGT TGATGCTTAT
GGTCCTATCG CCGATAATGC TGGCGGTATT GCCGAAATGG CGGAACTGGA CCCAAAGGTC
CGTAAGATAA CCGACGCCCT GGACTCCGTC GGCAACACAA CGGCTGCCAT TGGTAAGGGT
TTTGCTATCG GTTCGGCGGC GCTGACGGCC CTGGCCTTAT TCTCGGCCTA TACAGCTGCT
GCCAGGATTA CCGCCATTGA CCTGACCGAC CCCAAAGTAG TCGGCGGTCT CTTCATAGGT
GGTATGCTGC CGTTTCTTTT TGCTGCCTTA ACTATGAAAG CGGTGGGTAG GGCTGCTTTC
CAAATGATTG AAGAAGTACG CCGCCAGTTC AAATCGATTC CCGGCTTAAT GGAAGGCAAG
GCCCGGCCGG ACTACGCCCG CTGCGTGGCT ATTAGCACCG GAGCCGCTAT TAAGGAAATG
ATTGTTCCCG GCCTACTCGC CGTTCTGGTA CCCCTGGCCG TGGGTCTCAT CCCCGGCCTG
GGTAAGGAGG CCTTGGGTGG CCTTCTCGCC GGCGCCACGG TGACGGGTTT CTTGATGGCC
GTCATGATGG CTAATGCCGG TGGTGCCTGG GATAATGCCA AAAAGTATAT TGAGGGCGGC
CAGTACGGCG GTAAGGGTTC ACCGGCCCAC GCTGCCGCCG TCAATGGAGA TACAGTGGGT
GATCCCTTTA AGGATACCTC TGGCCCGGCC ATGAACATTC TTATTAAGCT AATGACCATT
GTCTCCTTGG TTTTTGCCCC CTTATTTATG CAGCTTTAG
 
Protein sequence
MELLAPLTGI VALLFAFYLT NKINRSDPGN PRMQEIAVAI HEGAMAFLMR EYRTLIFFVL 
GMTALIVVAG FMTRGAESMQ PATAIAYVAG TLCSIGAGYI GMQVATRANV RTANAARHSS
NAALDIAFSG GSVMGMAVVG LGLLGLGIIN YVFKNPSIVN GFALGASSIA LFARVGGGIY
TKAADVGADL VGKVEAGIPE DDPRNPAVIA DNVGDNVGDV AGMGADLFES YVGSIISGIA
LAAALNIPNG TLVPLMIAAI GIVSSILGAF FVKTGEGANA QKALNTGTMV ASILAIVGTF
LATRLLPAHF TAGSMSYTST GVFAATIAGL IAGVLIGRIT EYYTSGDYEP VKEIAKASQT
GTATNIIEGL STGMLSTVLP ILVIVIAIIA SYRFAGLYGI AMAAVGMLST TGTTVAVDAY
GPIADNAGGI AEMAELDPKV RKITDALDSV GNTTAAIGKG FAIGSAALTA LALFSAYTAA
ARITAIDLTD PKVVGGLFIG GMLPFLFAAL TMKAVGRAAF QMIEEVRRQF KSIPGLMEGK
ARPDYARCVA ISTGAAIKEM IVPGLLAVLV PLAVGLIPGL GKEALGGLLA GATVTGFLMA
VMMANAGGAW DNAKKYIEGG QYGGKGSPAH AAAVNGDTVG DPFKDTSGPA MNILIKLMTI
VSLVFAPLFM QL