Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0762 |
Symbol | hppA |
ID | 5055821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 679248 |
End bp | 681404 |
Gene Length | 2157 bp |
Protein Length | 718 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640468321 |
Product | membrane-bound proton-translocating pyrophosphatase |
Protein accession | YP_001153000 |
Protein GI | 145590998 |
COG category | [C] Energy production and conversion |
COG ID | [COG3808] Inorganic pyrophosphatase |
TIGRFAM ID | [TIGR01104] vacuolar-type H(+)-translocating pyrophosphatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.364765 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGAAT ACGCGATACT CGGAGTAGTG GTGGGGATGT TGGGGGTTTT ATACGCAGTG TACCTAGCGA GGTGGGTTCT GAAACAAGAC CCCGGAACGG AGAAAATGCG TTTCATATCA CAGGCAATAG CCACAGGCGC GAGAGCCTAC CTCTTCAGGC AGTACAGAAC CCTCGCGGTG TTGCTCGCGG TGCTCGCCGT GTTGATCCTA CTAGCGATAG ATGTGCCCCG CGGCACCATG GGCCTCACAG CCTTGGGCTT CGTCGTAGGC GCCCTCGGCT CTATGATAGC AGGCTACCTG GGCATGTACG TGACGACGAG ATCCGCCTCG CGCGTGGCGC AAGCGGCGGC CACCGGGGGC ATGGGCAAGG CGTTGCAGGT ATCGTGGCGT GCCGGCGCCG TCATGGGCCT CTCCCTTGCC AGCATAGCGC TTCTCCTGAT ATCCGGCTTC TACCTCGTAT TCAAGGCCGT CACCGAAGAA TGGGCAGTCC CCCTAGTGGC CCTCGGCTTC GGCGCCTCCC TCGTCACCTT ATTTATGAGA GTCGGCGGCG GCATATATAC AAAGGCGGCG GACCTCGGCG CCGATTTGGT GGGGAAGGTG GAGGCCGGCA TACCTGAAGA CGACCCCCGC AACCCTGGCG TCATCGCCGA CAACGTAGGC GACAACGTAG GCGACGTCGC CGGCATGGCC GCCGACGTCT ACGAGTCCTA CATCGTCACC GTGACCGCCG CCATATTCCT CGCCGCGATA CTCCACCTAC CGGCGCAGTT CATAGAGGCG ATAATCCTCT TCGCCACGCT AGCCCTCGTA GCCACCTTCG CGGGAGTGAA CATGTTAAAG ACGACCGGGG TGAAGCACCC GCTTTCCTCT ATCAGTACGG CCATCTACGC TACCATCGCG CTGTCCATAG TGCTCTTCTT CGCCGGGTCG TTCGCCCTTG GTCTTGACAT AACAAAGGCA CTGGCCCTAG CGGCGGCCAC CTCGCTGGGC GCGGCGATAG CACCGCTTGT TGTGAAGATA ACGGACTACT TCACCTCCTA CAACTACAAC CCAGTCAAGA GGATAGCCGA GCAGGCTAAG ATCAGCCCCG CTACTGTGAT AATTACAGGA TACGGCGTCG GGCTGATGAG CGCCATACCG GTAATAGCCG TCATCGCGAC GGTGCTGGGC ATCTCGTACA TGATAGGTTA CTACACAGTG CCCGTCGAGG GCTTCGGCGA GCTTTCAAAA TACCTGGCCG GGATATTCGG CACGGCTATG GCCAGCGTCG GCCTACTGGT GGTGGCCGGG ATAATAATAA CCGCCGACTC CTACGGCCCC GTCTCCGACA ACGCAGGAGG CGTAGTGGAG ATGGCTGGGC TCCCAGACGA GGTGAGGGAA ATAACAGACG TCTTGGACTC CGTGGGCAAC ACCACAAAGG CCACCACCAA GGGATACGCC ATAGCCAGTG CCGCGCTGGC CGCGTTGGTC CTCTTCATAG CCCTCATATT CGAGATAGTC TCATCGGCGA CCACACTCCT GCACAAAAAC CTAGTCGACG TGATGAGGGA GAGCCTCTCC GTGTTAAACG TGATCAATGC CAACGTCCTC ATAGGCGCCT TCATAGGAGT CTCAATAGTC TACTTCTTCA GTAGCCGCAC TCTTGAGGCT GTGGGCAAGA CCGCCATGGA GATCGTGGAG GAGATCCGCC GCCAGTTCAG AGAGAAGCCG GGAATATTGG AGTGGAAGGA AACCCCCGAC TACGCCCGCG TCGTCGATAT AGCCACCAGG AGAGCCCTCG GCGAGTTCCT AGTGCCCGGC CTAGCCGCCA TCATAGTACC CCTACTCACA GGACTTCTCC TCGGCTGGAA CGCCCTCGCC GGTCTCATCA TGGGCGCCAT AGTAGCCGGA GTCCCCAGGG CATTGCTCAT GGCCAACGCC GGCGGCGCCT GGGACAACGC CAAGAAATAC ATCGAAATCC AAGGCCTCAA AAAGACGGAG CAACACAAAG CAGCTGTAAT TGGCGATACT GTAGGGGACC CATTCAAAGA TACGACAGGC CCATCTCTTA ACCCCTTGAT TAAGGTGCTA AATACGCTTT CAGTCGTGTT TGCGTACGCT ATCGTATTCA CCAACATAGC GCTTGGAATA TTCCCATTTG GATTGCTACC CCTCTAA
|
Protein sequence | MPEYAILGVV VGMLGVLYAV YLARWVLKQD PGTEKMRFIS QAIATGARAY LFRQYRTLAV LLAVLAVLIL LAIDVPRGTM GLTALGFVVG ALGSMIAGYL GMYVTTRSAS RVAQAAATGG MGKALQVSWR AGAVMGLSLA SIALLLISGF YLVFKAVTEE WAVPLVALGF GASLVTLFMR VGGGIYTKAA DLGADLVGKV EAGIPEDDPR NPGVIADNVG DNVGDVAGMA ADVYESYIVT VTAAIFLAAI LHLPAQFIEA IILFATLALV ATFAGVNMLK TTGVKHPLSS ISTAIYATIA LSIVLFFAGS FALGLDITKA LALAAATSLG AAIAPLVVKI TDYFTSYNYN PVKRIAEQAK ISPATVIITG YGVGLMSAIP VIAVIATVLG ISYMIGYYTV PVEGFGELSK YLAGIFGTAM ASVGLLVVAG IIITADSYGP VSDNAGGVVE MAGLPDEVRE ITDVLDSVGN TTKATTKGYA IASAALAALV LFIALIFEIV SSATTLLHKN LVDVMRESLS VLNVINANVL IGAFIGVSIV YFFSSRTLEA VGKTAMEIVE EIRRQFREKP GILEWKETPD YARVVDIATR RALGEFLVPG LAAIIVPLLT GLLLGWNALA GLIMGAIVAG VPRALLMANA GGAWDNAKKY IEIQGLKKTE QHKAAVIGDT VGDPFKDTTG PSLNPLIKVL NTLSVVFAYA IVFTNIALGI FPFGLLPL
|
| |