Gene Emin_0829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0829 
SymbolhppA 
ID6262584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp911480 
End bp913828 
Gene Length2349 bp 
Protein Length782 aa 
Translation table11 
GC content48% 
IMG OID642611307 
Productmembrane-bound proton-translocating pyrophosphatase 
Protein accessionYP_001875721 
Protein GI187251239 
COG category[C] Energy production and conversion 
COG ID[COG3808] Inorganic pyrophosphatase 
TIGRFAM ID[TIGR01104] vacuolar-type H(+)-translocating pyrophosphatase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.00399457 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTGGTATG GAATGAGCGG CCTGCACCTG TATGAACAAT ACGCTATATT CGGCGTTTTG 
GGAGTTGCGG TGTTAGGTCT TTTTTATGCC TTGTTTTTAA AAAGACAAGT TATGGCCCAC
CCTGCAGGGG ACGCGAAAAT GCAAGAGGTT TGGGGCGCTA TAAAAGAAGG CGCGAACGCT
TACTTAAACA AACAGTTCAA AGCGATTGTA CCTTTAATTG TTATTTTGAC AATATGTCTT
TTCCTTTCAG TATATATTGT CCCGGTGACT GCCGAAGCTA TGAAACGCTT TGAAGGTCTA
TCCCCCGAAA AAATAAAGTT AATTATCGCT TTCGGGCGCG CAGGTTCTTT TATTTTGGGA
TCAGTCTTTT CGCTTTTAGT GGGACAAATA GGTATGCGTA TAGCAGTTGC CGCCAACGTG
CGCGTGGCGG ACGCTTCGCG CAGATCTTTT GGCGAATCGC TTAAAATAGC TTACCGCGCG
GGTACCGTTA CAGGTATGCT TACCGACGGT TTAGGCTTAT TTGGCGGTAC GATAATATTT
GTTATTTTCG GTATTTCCGC GCCGGACGCT TTGTTGGGTT TCGGTTTTGG CGGTACTCTT
GTAGCTCTCT TTATGAGGGT AGGCGGCGGT ATTTATACCA AAGCCGCTGA CGTAGGCGCC
GACTTGGTGG GTAAAGTTGA AGCGGGCATT CCGGAAGACG ACCCCAGAAA CCCCGCCGTG
GTAGCCGACT TAGTAGGCGA TAACGTGGGC GACTGCGCGG GTATGGCGGC CGACATTTTT
GAATCCTACG AAGTTACCAT TGTGTCAGGC CTTATTTTAG GTCTTGCGCT TTGGCATATT
ACAGGCAACT ATGAATGGAT AGTTTACCCC CTTATCGTAC GCGGTATCGG CGTAATGTGC
TCTATCATAG GTACTTACGT TGTGCGTGAT CACGGCGGCA AAGGCGACGC GATGAGCGCC
ATTTTCAGAG GATATTTTAC TTCAGCCGTT ATTTCCATAA CGCTTTTTGC CGTATTAGCG
TATTTTTATA TGAGGGACAT TCCCGGCGGT TGGTGGAGAC CATGGGTAGC CGTTTCCGTC
GGTGTAATTT TAGCTATGGC CATTGACAGA CTTACCGAGC ACTTTACGGG TACGGAAGGC
GCGCCTGTTA AAGAAGTTAA ACGTTCCACG TCTACAGGTT CCGCCACAAC CATATTAAGC
GGTTTCGCCG TAGGTTTGGA ATCTTCGGTA TGGTCGGTAG TCGTTATCGC TATAACGATA
TTTATTTCAA TCGTTGTTTT CGGCTCAATT GAAGGTTTGA CGGGGGCCGC TAAATTTAAC
TTTATTCTTT ACGGCGTAGC TATGACCGGT ATCGGCATGC TCACCTTAAC GGGTAATAAC
GTGGCTATGG ACTCCTTTGG CCCCATTGCG GATAACGCTA ACGGCATAGG CGAAATGGCC
TGGCACGGCA AAACGGATGA AGAAACAAAA AAAGCCCGCC AAATCATGGC TGACTTAGAC
GGCGTAGGCA ACACAACAAA AGCTATTACG AAAGGCGTGG CCATAAGCTC CGCTGTTATA
GCGGCAGTGT CTTTGTTCGG CTCTTACATG GTTGACGTAA GCAATGTTCA GGTAATTATT
AATAACGCCT GGCAGAACGC CGGGCTTGAC CAGGCGATGG TGCTTTTAAA AGACGTGGGT
ATAGTTGTTT CCAACCCGTT AGTCTTTGTA GGCATGTTGC TTGGCGGCGC GGTGCCTTGG
CTGTTCTCTT CTTTCGCTAT TAACTCCGTA ACACGCGCGG CTTCATTAAT TGTGCACGAA
GTAAGAAGAC AGTTCGGCTT AGGTATTTTG GAAGGCAAAG CCAAACCCGA TTATGATAAA
GTTGTAAGAA TTTCAACAGC CGCTGCCCAG AAAGAACTTG TTAATCTGGC ACTTTTAGGC
GTTATTTCCC CTATTCTTGT GGGCTTGACG CTGGGTGTTG AAGCTTTGGG CGGTTTCCTT
GCCGGTATTA TTTTATCGGG CCAGCTGCTT GCCGTGTTTA TGTCTAACGC GGGTGGTGCT
TGGGATAATG CCAAAAAACT TATTGAGGAT GAACCTTCCG ACCCCGCGAA CAATACGGGT
AAAGGGTCAG AACGCCATAA AGCCAGCGTA GTAGGCGATA CGGTGGGCGA TCCTCTTAAA
GATACGGCGG GCCCGGCTCT TAACCCTATG ATTAAGGTTG TTAACCTTGT ATCGGTTATC
GTGGCGCCTG TTATCGTTAC TTATAATAAC GAATATACAA AATTAGGCGT GTGGGGCTGG
TGCTTAGCGG CGGCGCTGCT CGCTATCTTA ACTTGGGCTA TTGTAAGAAG TAAAGCAAGA
GTTGAGTAA
 
Protein sequence
MWYGMSGLHL YEQYAIFGVL GVAVLGLFYA LFLKRQVMAH PAGDAKMQEV WGAIKEGANA 
YLNKQFKAIV PLIVILTICL FLSVYIVPVT AEAMKRFEGL SPEKIKLIIA FGRAGSFILG
SVFSLLVGQI GMRIAVAANV RVADASRRSF GESLKIAYRA GTVTGMLTDG LGLFGGTIIF
VIFGISAPDA LLGFGFGGTL VALFMRVGGG IYTKAADVGA DLVGKVEAGI PEDDPRNPAV
VADLVGDNVG DCAGMAADIF ESYEVTIVSG LILGLALWHI TGNYEWIVYP LIVRGIGVMC
SIIGTYVVRD HGGKGDAMSA IFRGYFTSAV ISITLFAVLA YFYMRDIPGG WWRPWVAVSV
GVILAMAIDR LTEHFTGTEG APVKEVKRST STGSATTILS GFAVGLESSV WSVVVIAITI
FISIVVFGSI EGLTGAAKFN FILYGVAMTG IGMLTLTGNN VAMDSFGPIA DNANGIGEMA
WHGKTDEETK KARQIMADLD GVGNTTKAIT KGVAISSAVI AAVSLFGSYM VDVSNVQVII
NNAWQNAGLD QAMVLLKDVG IVVSNPLVFV GMLLGGAVPW LFSSFAINSV TRAASLIVHE
VRRQFGLGIL EGKAKPDYDK VVRISTAAAQ KELVNLALLG VISPILVGLT LGVEALGGFL
AGIILSGQLL AVFMSNAGGA WDNAKKLIED EPSDPANNTG KGSERHKASV VGDTVGDPLK
DTAGPALNPM IKVVNLVSVI VAPVIVTYNN EYTKLGVWGW CLAAALLAIL TWAIVRSKAR
VE