Gene Nmar_1231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1231 
SymbolhppA 
ID5774371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp1128591 
End bp1130630 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content39% 
IMG OID641316875 
Productmembrane-bound proton-translocating pyrophosphatase 
Protein accessionYP_001582565 
Protein GI161528739 
COG category[C] Energy production and conversion 
COG ID[COG3808] Inorganic pyrophosphatase 
TIGRFAM ID[TIGR01104] vacuolar-type H(+)-translocating pyrophosphatase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATCT CTGAGATTCT GCCATTTATC GCAGGAATTG CATCATTTTT AGTTGCTGGT 
GGATTAGTTG CTTGGATTTC AAAGCAACCT GCTGGAACAA AAGAGATGAT GGATATCTCA
AATGCTGTCA AAGTTGGCGC TGCAGCATTT CTGAAAAGAG AGATGAAAAT TATCATTCCT
GTTGCAATTG CATTAACTGT AATTATTGGA GCATTCCTTA CTCCTTCAAA CGGTCTTGCA
TTTGCAGTTG GCGCAGCATT ATCGGCTGTT GCAGGAATTA TCTCATTAAA AATTACAGTA
AAAGCAGCAG TAAGAGCTGC ACATCTAAGC AATGACGGAC TAGGAAAAAC ATTTGCCATG
GCCTTTAGAG GCGGTGCAAC CGTTGGTCTT GCAGTACCTG CAATGGCATT AATGGCAATT
GCTGGTTTGT ATGTAATTTA TCCAGACCCA ATTACAATTG CAGGTGTTGG AATTGGAGCA
AGTCTTATTG CATTATTTAT CAGAATTGGT GGTGGTATCT TCACAAAAGC TGCGGATATG
GGAGCAGACT TGGTAGGAAA AGTTGAAGCA AATATTCCTG AAGATGATCC TAGAAACCCT
GCAACTATTG CAGACAACGT AGGAGATAAC GTTGGAGATG CAGCTGGAAT GGGTTCTGAT
GTTTACGAAT CTTATATTGT TACAATTTTG GCAGCATTAC TAATTGCAGC ATTAATTGGT
GCACCAAACT ATTTCCTTTA TCCAATCTTA GTCGGTTCAT CTGGAATGAT TGCATCAATC
ATTGGTGTTG TTATTGTAGG TTCAAAAGGT GTTACGGATG TAATGAAACC GCTTAATCGT
TCGTTTTATG TTTCAGCAGC AATTGCAATC GCATTAAACT ATGTGTTTAT CACTCAATTC
ATTGAACAAA ACCAAGCAGC ATATGCTTTG TTTGGAACTA CCGTAATTGG AGTAATTTTG
GTTCCTGTGA TTCAAAAAAT TACTGATAGA TATACTAGTT ACCAACACGG TCCTGTTAAT
GAAATTGCAG ACTCTGCAAA ATGGGGATAT GCATCATTAA CCTTGATGGG AATTATCAAA
GGCATGCAAT CAACTGGGCC ATTCATGATT GCATTAGTTG TTGCAATTAT CATATCTTAT
AGCATCGCAG CTGCAGCAGC TCCTGAAGGT GCAGACCCAG TACTATATGG CATATTTGGA
ACTTCTCTAA CTGCTATGGC AATGTTGAGT CTTGCAGGAA TTGTTCTAAG TATAGATGCA
TTTGGTCCAA TTGCTGATAA TGCAGGCGGT ATTGTTGAGA TGACTGGAAT GGGAGAAGAA
AATCGTAAAG TTACAGATGA AATTGATGCT GTTGGAAATA CAACTAAAGC AGTTACAAAA
GGATTTGCTA TTGCCAGTGC TGCATTAGCT GCATTAGCAA TGATTCAAGC ATTCCAATTT
GAAGCAGCAC ATATCTTTGA AGGCGTATTA GATTTAGATT ATAGTTTAAC AAATCCTGCA
ATCATTGTTG GTCTTTTGGT GGGTGGATTA ATTCCATTCA TCATCACTGG TCAGTTAATT
AACGGTGTAT CTCGTGCTGC TGGAAAGATG GTAGATGAGG TTAGACGTCA ATTCAAATCT
GATCCTGAAA TTTTAACTGG TAAATCAAAA CCTGATTATG CAAAGTGTGT AGATATTGCA
ACTGTTGCTT CTATCCGTGA ACTTTGGAAA CCAGCAATTG TTGCAATTAT AGCTCCAATT
ATTCTTGGTA TAATATTAGG TCCAACTGCC GTGGCTGGAT TACTTATGGG TTCTGTTGTA
ACTGGCATTC TCCTTGCTTA TCACTTGGCT AACACTGGTG GTGCATGGGA TAACGCAAAG
AAACTTGTTG AAATGAAAGG TGAGAAAGGT TCTGAAGTAC ACAAAGTAGC AGTCGTTGGT
GATATCATTG GTGACCCATA CAAAGACACT GCAGGTCCAG CTCTTAACAC AGTAATCAAA
CTACTTAATA CAATTGCAAT AGTGTTTGTA TCTGCATTTG TAGCAATACT TGCAATCTAA
 
Protein sequence
MEISEILPFI AGIASFLVAG GLVAWISKQP AGTKEMMDIS NAVKVGAAAF LKREMKIIIP 
VAIALTVIIG AFLTPSNGLA FAVGAALSAV AGIISLKITV KAAVRAAHLS NDGLGKTFAM
AFRGGATVGL AVPAMALMAI AGLYVIYPDP ITIAGVGIGA SLIALFIRIG GGIFTKAADM
GADLVGKVEA NIPEDDPRNP ATIADNVGDN VGDAAGMGSD VYESYIVTIL AALLIAALIG
APNYFLYPIL VGSSGMIASI IGVVIVGSKG VTDVMKPLNR SFYVSAAIAI ALNYVFITQF
IEQNQAAYAL FGTTVIGVIL VPVIQKITDR YTSYQHGPVN EIADSAKWGY ASLTLMGIIK
GMQSTGPFMI ALVVAIIISY SIAAAAAPEG ADPVLYGIFG TSLTAMAMLS LAGIVLSIDA
FGPIADNAGG IVEMTGMGEE NRKVTDEIDA VGNTTKAVTK GFAIASAALA ALAMIQAFQF
EAAHIFEGVL DLDYSLTNPA IIVGLLVGGL IPFIITGQLI NGVSRAAGKM VDEVRRQFKS
DPEILTGKSK PDYAKCVDIA TVASIRELWK PAIVAIIAPI ILGIILGPTA VAGLLMGSVV
TGILLAYHLA NTGGAWDNAK KLVEMKGEKG SEVHKVAVVG DIIGDPYKDT AGPALNTVIK
LLNTIAIVFV SAFVAILAI