Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0739 |
Symbol | hppA |
ID | 3786563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 858935 |
End bp | 860968 |
Gene Length | 2034 bp |
Protein Length | 677 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637810821 |
Product | membrane-bound proton-translocating pyrophosphatase |
Protein accession | YP_411438 |
Protein GI | 82701872 |
COG category | [C] Energy production and conversion |
COG ID | [COG3808] Inorganic pyrophosphatase |
TIGRFAM ID | [TIGR01104] vacuolar-type H(+)-translocating pyrophosphatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00661783 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTACTG GTTTAATCAT CGCAATTGGT TGCGCGATGG CGGCACTCCT TTATGGAGTG GTTTCGATAA GATGGATTGT AGCGTTGCCA GCAGGCAACG AACGCATGCG GGATATTGCT ACGGCGATTC AGCAAGGCGC CTCGGCTTAC CTGAACCGCC AATACACCAC CATCAGCATA GTGGGTGTTA TTCTGCTGAT GGCAATCTTT CTTGCTCTGG GCTGGCAGAC AGCCGTCGGG TTTGCTCTGG GCGCTTTCCT GTCGGGGCTG ACCGGGTATA TCGGCATGAA CGTATCGGTG CGTGCCAACG TACGCACGGC GGAAGCAGCG CGGCACGGCC TCAATGCCGC CCTGGATGTT GCCTTCAAGG GAGGGGCCAT TACCGGTATG CTGGTGGTTG GTCTGGGTTT GCTGGGCGTA GCCGGCTACT TTGCCCTTCT GATCGGCATG GGCGCAAGTG AATCGCAAGC CACTCACGCC CTCGTGGGAG TTGCCTTCGG TAGCTCGCTG ATCTCCATTT TTGCACGTCT CGGCGGCGGA ATTTTCACCA AGGGCGCCGA CGTCGGCGCG GACCTGGTCG GCAAGGTGGA AGCCGGCATC CCTGAGGATG ATCCACGCAA CCCTGCAGTA ATTGCCGACA ACGTAGGCGA TAACGTGGGC GACTGTGCCG GGATGGCTGC AGACCTGTTT GAAACCTACG CCGTTACCAT TATCGCGACC ATGCTGCTGG GCGGCTTGCT CATCACCGAC GCCGGCCCCA ACGCGGTGCT CTATCCACTG GTATTGGGGG GCGTTTCGAT TATTGCTTCC ATCATTGGCT GTTACTTTGT CAAGGCACGT GAAGGCGGCA AAATCATGAA TGCGCTTTAC CGTGGTTTGG CAGTTGCCGG CGGGCTGGCG GCAATTGCCT ATTATCCCAT CACGACCATC ATGCTCGGCG AGGGCGTAAT GATCGAGGGA AAGCTGGTTA CCTCGACCAG TCTTTACCTC TCCGTACTGG TTGGCCTGGC GCTCACCGCT GCAATGGTGT GGATCACGGA GTACTACACT TCAACTGAAT TCAAACCGGT ACGCTCCATT GCCGAAGCTT CCAGCACCGG TCACGGCACC AACGTCATTG CCGGTCTGGG TATTTCAATG AAGGCAACTG CCTGGCCGGT TGTTGTCGTA TGTCTTTCCA TCTGGATCAC ATACGAACTG GCAGGCCTGT ATGGCATTGC CATCGCCGCC ACATCGATGC TTTCCATGGC CGGAATCATC GTCGCGCTGG ATGCTTACGG TCCCATCACG GATAATGCTG GGGGCATTGC CGAAATGTCC GGTCTGCCTT CGGAAGTACG AGACATCACC GATCCCCTCG ATGCCGTGGG CAACACCACC AAGGCCGTGA CCAAAGGCTA TGCGATCGGC TCTGCCGGTC TGGCTGCGCT GGTGCTGTTC GCCGACTACA CCCATGCACT CTCGAGTGGC GGCAAGAGCG TAAACTTTGA TTTGTCCGAT CACATGGTCA TCATCGGCCT GTTCCTCGGG GGCATGGTTC CCTACCTGTT TGGCGCCATG GCCATGGAAG CCGTCGGCCG TGCCGCCGGT TCGGTAGTAG TGGAAGTCCG CCGCCAGTTC AAGGAAATCC CTGGAATCAT GGAAGGAACA GCCAGGCCCG ACTACTCGCG TGCAGTGGAT ATGGTGACAA GAGCGGCGAT CAAGGAAATG ATCCTTCCCT CCCTGCTTCC GGTTGCCGTT CCCCTGATCG TCGGCCTCAT GCTGGGTCCG GTTGCTCTCG GCGGGGTACT GATCGGTGCG ATCATTACAG GCATTTTCGT GGCAATTTCG ATGACTGCCG GGGGCGGTGC CTGGGATAAC GCCAAGAAAT ACATTGAAGA TGGCCATTTC GGTGGAAAAG GTTCGGAAGC GCATAAGGCA GCCGTTACAG GTGATACAGT GGGCGATCCT TACAAGGATA CTGCGGGTCC AGCCGTGAAT CCGCTCATCA AGATCATGAA TATCGTGGCG CTCCTGATTG TGCCGTTGTT GTAA
|
Protein sequence | MSTGLIIAIG CAMAALLYGV VSIRWIVALP AGNERMRDIA TAIQQGASAY LNRQYTTISI VGVILLMAIF LALGWQTAVG FALGAFLSGL TGYIGMNVSV RANVRTAEAA RHGLNAALDV AFKGGAITGM LVVGLGLLGV AGYFALLIGM GASESQATHA LVGVAFGSSL ISIFARLGGG IFTKGADVGA DLVGKVEAGI PEDDPRNPAV IADNVGDNVG DCAGMAADLF ETYAVTIIAT MLLGGLLITD AGPNAVLYPL VLGGVSIIAS IIGCYFVKAR EGGKIMNALY RGLAVAGGLA AIAYYPITTI MLGEGVMIEG KLVTSTSLYL SVLVGLALTA AMVWITEYYT STEFKPVRSI AEASSTGHGT NVIAGLGISM KATAWPVVVV CLSIWITYEL AGLYGIAIAA TSMLSMAGII VALDAYGPIT DNAGGIAEMS GLPSEVRDIT DPLDAVGNTT KAVTKGYAIG SAGLAALVLF ADYTHALSSG GKSVNFDLSD HMVIIGLFLG GMVPYLFGAM AMEAVGRAAG SVVVEVRRQF KEIPGIMEGT ARPDYSRAVD MVTRAAIKEM ILPSLLPVAV PLIVGLMLGP VALGGVLIGA IITGIFVAIS MTAGGGAWDN AKKYIEDGHF GGKGSEAHKA AVTGDTVGDP YKDTAGPAVN PLIKIMNIVA LLIVPLL
|
| |