Gene Phep_1001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1001 
SymbolhppA 
ID8252095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1171405 
End bp1173711 
Gene Length2307 bp 
Protein Length768 aa 
Translation table11 
GC content46% 
IMG OID644934655 
Productmembrane-bound proton-translocating pyrophosphatase 
Protein accessionYP_003091284 
Protein GI255530912 
COG category[C] Energy production and conversion 
COG ID[COG3808] Inorganic pyrophosphatase 
TIGRFAM ID[TIGR01104] vacuolar-type H(+)-translocating pyrophosphatase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTTTT TACAAAACAA TTTAATTTAT CTGATCCCTG CAATGGGGCT GATTGGCATT 
CTTGTAATGG CAGTTAAAAG CGCCTGGGTG AACAAACAGG ATGCGGGTGA TAAGAACATG
CAGGAGCTGG CCGGATATAT AGCCGATGGC GCCATGGCTT TTTTAAAAGC TGAATGGAGG
GTACTCAGCA TTTTTGTGGT GTTTACTGCA GCATTGCTGG CCTATTCAGG AACCATACAC
GAGGTAAATG GTGTAGCCTT ACATTCCAGC TGGATCATTG CTATCGCATT TATTATTGGT
GCTGTATTTT CTGCAACGGC AGGATATATA GGAATGAAGG CCGCAACAAA AGCTAATGTC
CGTACTACCC AGGCAGCAAG GACCAGCCTA AAGCAGGCTT TAAAAGTATC TTTTACAGGC
GGTACCGTAA TGGGACTTGG TGTAGCAGGT TTGGCCATAC TGGGTTTGGG CGGCCTCTTT
ATTGTTTTTC TTAAGGTATT TAATGTAGTT GAGCCCAACA GCACAGAAAT GAAAACTGCC
ATAGAAGTAC TCACAGGATT TTCACTTGGG GCAGAATCTA TTGCTCTATT TGCCCGTGTA
GGTGGCGGTA TTTATACCAA AGCTGCTGAT GTTGGGGCCG ATCTGGTAGG TAAAGTTGAA
GCCGGAATTC CGGAAGATGA TGTACGTAAC CCTGCTACCA TTGCGGATAA CGTAGGTGAT
AACGTAGGCG ATGTAGCTGG TATGGGTGCC GATCTGTTCG GCTCGTATGT AGCTACTATT
CTGGCAACGA TGGTGTTGGG ACAGGAAATT GTGGTCGATA AATTAAATGG AATCGCAGTT
GACAACCTGA ACGGATTTTC ACCGGTACTG CTCCCAATGG TCATCTGTGG CTTAGGCATT
CTTTTTTCTA TAGTAGGTAC CTGGTTTGTG CGCATTAAAG GTGAAGACTC CAATGTGCAA
ACGGCCTTAA ATTTAGGCAA CTGGGGCTCA ATTGTAATTA CAGCCATTGC TTCTTATTTT
GTAGTTACGG CCATGCTTCC TGAACATCTG CACCTGCGTG GTGTTAATTT CAGTAGTCTG
GATGTATTTT ATTCCATTAT AGTTGGCTTA GTAGTGGGTA CTTTAATGAG TATCATCACC
GAATACTATA CGGCAATGGG CAAAGGGCCG GTAAATTCAA TTATCCAGCA GTCGGGTACC
GGTCATGCCA CCAATATCAT CGGTGGTCTG TCTGTAGGGA TGAAATCTAC CGTTGCCCCT
ATTCTTGTTC TGGCCGGTGG TATCATTTTT TCTTATGCCT TTGCAGGTTT ATATGGGGTT
GCCATTGCAG CTGCCGGTAT GATGGCTACC ACTGCCATGC AGCTGGCTAT TGATGCTTTC
GGGCCAATAG CAGATAATGC TGGTGGTATA GCCGAAATGA GCCAGTTGCC ACCTGAAGTA
CGCGAACGTA CAGATAACCT GGATGCTGTG GGTAATACTA CGGCTGCTAC GGGTAAAGGT
TTTGCCATCG CTTCTGCCGC CTTAACTTCA CTGGCCTTAT TCGCGGCATT TGTGGGCGTT
GCAGGCATTT CGGCAATAGA CATTTACAAA GCGCCTGTAC TTGCCGGTTT ATTTGTAGGT
GCAATGATCC CTTTCATCTT TTCGGCTTTA TGTATTGCAG CCGTTGGTAA AGCTGCGATG
GATATGGTAC AGGAAGTACG CCGTCAATTC CGCGAAATTC CGGGTATTAT GGAATATAAG
GCAAAACCTG AATATGAAAA ATGTGTAGCC ATTTCAACCA AAGCCTCTAT CCGGGAAATG
ATGCTGCCGG GCGCAATCGC ACTATTGGTA CCTATTCTGG TTGGTTTTGG TTTTAAAGGA
GTTTTCCCAT CAGTAAGCTC AGCCGAAATA CTGGGTGGCC TGCTGGCTGG CGTTACGGTT
TCTGGCGTTT TAATGGGTAT TTTCCAGTCT AATGCCGGCG GAGCCTGGGA CAACGCCAAG
AAATCATTCG AAAAAGGCGT AGAGATCAAT GGTGAGATGC ATTATAAAAA ATCTGAACCC
CATAAAGCTT CAGTAACAGG TGATACCGTT GGAGACCCTT TTAAAGATAC TTCAGGCCCG
TCCATGAATA TTCTGATCAA ACTGATGTCG ATCGTCTCGC TGGTGATTGC TCCTTATATT
GCAGTAACCG GTACCGGGGC CACGGTAAAG CTAACTGCAC CGGTAACACC AACCGAGCTG
TCCGTTAAAC CGACAAACGG TTTAAAAGTT GTCCGGGTTA CGCAAAATGT AACAGCTATT
AAAAAAGCAG CCATTAAAAA TTTCTAG
 
Protein sequence
MEFLQNNLIY LIPAMGLIGI LVMAVKSAWV NKQDAGDKNM QELAGYIADG AMAFLKAEWR 
VLSIFVVFTA ALLAYSGTIH EVNGVALHSS WIIAIAFIIG AVFSATAGYI GMKAATKANV
RTTQAARTSL KQALKVSFTG GTVMGLGVAG LAILGLGGLF IVFLKVFNVV EPNSTEMKTA
IEVLTGFSLG AESIALFARV GGGIYTKAAD VGADLVGKVE AGIPEDDVRN PATIADNVGD
NVGDVAGMGA DLFGSYVATI LATMVLGQEI VVDKLNGIAV DNLNGFSPVL LPMVICGLGI
LFSIVGTWFV RIKGEDSNVQ TALNLGNWGS IVITAIASYF VVTAMLPEHL HLRGVNFSSL
DVFYSIIVGL VVGTLMSIIT EYYTAMGKGP VNSIIQQSGT GHATNIIGGL SVGMKSTVAP
ILVLAGGIIF SYAFAGLYGV AIAAAGMMAT TAMQLAIDAF GPIADNAGGI AEMSQLPPEV
RERTDNLDAV GNTTAATGKG FAIASAALTS LALFAAFVGV AGISAIDIYK APVLAGLFVG
AMIPFIFSAL CIAAVGKAAM DMVQEVRRQF REIPGIMEYK AKPEYEKCVA ISTKASIREM
MLPGAIALLV PILVGFGFKG VFPSVSSAEI LGGLLAGVTV SGVLMGIFQS NAGGAWDNAK
KSFEKGVEIN GEMHYKKSEP HKASVTGDTV GDPFKDTSGP SMNILIKLMS IVSLVIAPYI
AVTGTGATVK LTAPVTPTEL SVKPTNGLKV VRVTQNVTAI KKAAIKNF