Gene Nmul_A0739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0739 
SymbolhppA 
ID3786563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp858935 
End bp860968 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content57% 
IMG OID637810821 
Productmembrane-bound proton-translocating pyrophosphatase 
Protein accessionYP_411438 
Protein GI82701872 
COG category[C] Energy production and conversion 
COG ID[COG3808] Inorganic pyrophosphatase 
TIGRFAM ID[TIGR01104] vacuolar-type H(+)-translocating pyrophosphatase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00661783 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACTG GTTTAATCAT CGCAATTGGT TGCGCGATGG CGGCACTCCT TTATGGAGTG 
GTTTCGATAA GATGGATTGT AGCGTTGCCA GCAGGCAACG AACGCATGCG GGATATTGCT
ACGGCGATTC AGCAAGGCGC CTCGGCTTAC CTGAACCGCC AATACACCAC CATCAGCATA
GTGGGTGTTA TTCTGCTGAT GGCAATCTTT CTTGCTCTGG GCTGGCAGAC AGCCGTCGGG
TTTGCTCTGG GCGCTTTCCT GTCGGGGCTG ACCGGGTATA TCGGCATGAA CGTATCGGTG
CGTGCCAACG TACGCACGGC GGAAGCAGCG CGGCACGGCC TCAATGCCGC CCTGGATGTT
GCCTTCAAGG GAGGGGCCAT TACCGGTATG CTGGTGGTTG GTCTGGGTTT GCTGGGCGTA
GCCGGCTACT TTGCCCTTCT GATCGGCATG GGCGCAAGTG AATCGCAAGC CACTCACGCC
CTCGTGGGAG TTGCCTTCGG TAGCTCGCTG ATCTCCATTT TTGCACGTCT CGGCGGCGGA
ATTTTCACCA AGGGCGCCGA CGTCGGCGCG GACCTGGTCG GCAAGGTGGA AGCCGGCATC
CCTGAGGATG ATCCACGCAA CCCTGCAGTA ATTGCCGACA ACGTAGGCGA TAACGTGGGC
GACTGTGCCG GGATGGCTGC AGACCTGTTT GAAACCTACG CCGTTACCAT TATCGCGACC
ATGCTGCTGG GCGGCTTGCT CATCACCGAC GCCGGCCCCA ACGCGGTGCT CTATCCACTG
GTATTGGGGG GCGTTTCGAT TATTGCTTCC ATCATTGGCT GTTACTTTGT CAAGGCACGT
GAAGGCGGCA AAATCATGAA TGCGCTTTAC CGTGGTTTGG CAGTTGCCGG CGGGCTGGCG
GCAATTGCCT ATTATCCCAT CACGACCATC ATGCTCGGCG AGGGCGTAAT GATCGAGGGA
AAGCTGGTTA CCTCGACCAG TCTTTACCTC TCCGTACTGG TTGGCCTGGC GCTCACCGCT
GCAATGGTGT GGATCACGGA GTACTACACT TCAACTGAAT TCAAACCGGT ACGCTCCATT
GCCGAAGCTT CCAGCACCGG TCACGGCACC AACGTCATTG CCGGTCTGGG TATTTCAATG
AAGGCAACTG CCTGGCCGGT TGTTGTCGTA TGTCTTTCCA TCTGGATCAC ATACGAACTG
GCAGGCCTGT ATGGCATTGC CATCGCCGCC ACATCGATGC TTTCCATGGC CGGAATCATC
GTCGCGCTGG ATGCTTACGG TCCCATCACG GATAATGCTG GGGGCATTGC CGAAATGTCC
GGTCTGCCTT CGGAAGTACG AGACATCACC GATCCCCTCG ATGCCGTGGG CAACACCACC
AAGGCCGTGA CCAAAGGCTA TGCGATCGGC TCTGCCGGTC TGGCTGCGCT GGTGCTGTTC
GCCGACTACA CCCATGCACT CTCGAGTGGC GGCAAGAGCG TAAACTTTGA TTTGTCCGAT
CACATGGTCA TCATCGGCCT GTTCCTCGGG GGCATGGTTC CCTACCTGTT TGGCGCCATG
GCCATGGAAG CCGTCGGCCG TGCCGCCGGT TCGGTAGTAG TGGAAGTCCG CCGCCAGTTC
AAGGAAATCC CTGGAATCAT GGAAGGAACA GCCAGGCCCG ACTACTCGCG TGCAGTGGAT
ATGGTGACAA GAGCGGCGAT CAAGGAAATG ATCCTTCCCT CCCTGCTTCC GGTTGCCGTT
CCCCTGATCG TCGGCCTCAT GCTGGGTCCG GTTGCTCTCG GCGGGGTACT GATCGGTGCG
ATCATTACAG GCATTTTCGT GGCAATTTCG ATGACTGCCG GGGGCGGTGC CTGGGATAAC
GCCAAGAAAT ACATTGAAGA TGGCCATTTC GGTGGAAAAG GTTCGGAAGC GCATAAGGCA
GCCGTTACAG GTGATACAGT GGGCGATCCT TACAAGGATA CTGCGGGTCC AGCCGTGAAT
CCGCTCATCA AGATCATGAA TATCGTGGCG CTCCTGATTG TGCCGTTGTT GTAA
 
Protein sequence
MSTGLIIAIG CAMAALLYGV VSIRWIVALP AGNERMRDIA TAIQQGASAY LNRQYTTISI 
VGVILLMAIF LALGWQTAVG FALGAFLSGL TGYIGMNVSV RANVRTAEAA RHGLNAALDV
AFKGGAITGM LVVGLGLLGV AGYFALLIGM GASESQATHA LVGVAFGSSL ISIFARLGGG
IFTKGADVGA DLVGKVEAGI PEDDPRNPAV IADNVGDNVG DCAGMAADLF ETYAVTIIAT
MLLGGLLITD AGPNAVLYPL VLGGVSIIAS IIGCYFVKAR EGGKIMNALY RGLAVAGGLA
AIAYYPITTI MLGEGVMIEG KLVTSTSLYL SVLVGLALTA AMVWITEYYT STEFKPVRSI
AEASSTGHGT NVIAGLGISM KATAWPVVVV CLSIWITYEL AGLYGIAIAA TSMLSMAGII
VALDAYGPIT DNAGGIAEMS GLPSEVRDIT DPLDAVGNTT KAVTKGYAIG SAGLAALVLF
ADYTHALSSG GKSVNFDLSD HMVIIGLFLG GMVPYLFGAM AMEAVGRAAG SVVVEVRRQF
KEIPGIMEGT ARPDYSRAVD MVTRAAIKEM ILPSLLPVAV PLIVGLMLGP VALGGVLIGA
IITGIFVAIS MTAGGGAWDN AKKYIEDGHF GGKGSEAHKA AVTGDTVGDP YKDTAGPAVN
PLIKIMNIVA LLIVPLL