Gene Nmul_A0434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0434 
Symbol 
ID3785902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp481487 
End bp483010 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content56% 
IMG OID637810510 
ProductPpx/GppA phosphatase 
Protein accessionYP_411134 
Protein GI82701568 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGAAT ACTCCACACT TGCGGCGGTA GATCTCGGTT CCAACAGCTT TCGTTTGCAA 
GTCGCGCGAG TGGAAGGCAG ACAGCTTTAT CCGCTGGACA ATCTGCGTGA AATGGTTCGT
CTTGCCGCTG GTCTTACTCG CGACAAGCGC CTGGATGAAG ATTCCCAGGC ACGCGCCCTG
GCCTGTCTCA CGCGTTTCAG TGAACGCCTG CGCGGTTTCC CTCCTCATGC TGTACGTGCG
GTTGCGACCA ACACTCTCAG GGTGGCGAAA AATGCGGCTG TATTTCTCAA AAAAGCCGAG
GCAGCGATGG GTTTTCCCAT TGAAGTCATC TCGGGCCACG AAGAGGCGCG CCTCATCTAT
CTGGGCGTTG CGCACGGCCT TCCTGCTTCT CCTGACGCTC GCCTGGTAAT GGACATCGGG
GGCGGCTCTA CTGAATTTAT CATTGGCCGC AGGCTGAAGC CGGTCAAGCT GGAAAGCCTC
TATATGGGTT GCGTCAGCTA TAGCCTGCGC TTTTTTCCAG GCGGCAGGAT CAGCAGGGAG
GCAATGAACC GTGCCGAACT ATCGGCCCGC AGCGAGATTC AGGCGATCGC AAAGGAATTT
TCCAGTGAAC ACTGGCAGCT CGCATATGGA TCCTCCGGTA CGGCACGCGC GTTAGGCGAT
ATCATCCAGT TGAACCAGCT CGGCAGTGGA AGCGGCAACG GAGAGATTAC GCGGGAAGGG
CTGGAAAATT TTCGTAATCA TCTGCTCAAG GTGGACGATA TCAAAAAACT CGACCTCGCG
GGTATCAAGA CCGATCGGGC GCCTGTCATA GCCGGCGGTT TCGCCATCAT GTCCGCCGCG
TTCGCGGAGC TGGGAATTTC CCGGATGGCG CAAGGCATGG GTGCCTTACG TCAGGGCGTG
CTGTACGATC TGCTGGGGCG CTTCCATAAG CATGATATGC GCGAAGTAAC GGTCAGGCAG
TTCATGCGCC GGTACCATGT GGATGGCGCG CAGGCGGGGC GGGTGGAATC GACTGCACTT
TTGCTCGGAG AACAGTTGCT GGCGGCCTTT CCCTGTGAGG GAGAAGAACA TCTGCAAGTT
CTCTCATGGG CTGCCCGGCT GCATGAAGTA GGTATTTCGG TCGCCCACTC CGGTTACCAC
AAGCATTCTG CATATATCCT GGGTAATGCC GATATGCCGG GTTTTTCCCA AAGGGAGCAG
GAGCGCTTAA GCATGCTTGT GCTCGCCCAC CGGGGCGATA TCGGCAAAGC GCGTGGAAAC
ATGATAGAGC GGGCTGATTT CGCCCTGCTG TTTGCGCTTC GCCTCGCGGC ATTGTTTCAT
CGCAGCCGCT GTGAGACGGC ACTTCCGAGG CTCGAGGTCA GCCTTCGAGG CAAGGAATTC
AGTTTATATC TTGAAAGAAA GTGGCTGGAA GGCAATCCCT TGACATATAA CGCGTTGCTT
GGCGAAATTG AACAGTGGGA TGCCCTTGGT TTTCGTTTTG GCATGGCCGG AGCCGACGGA
AGCAAGTTGT CCGCATCTGT ATAA
 
Protein sequence
MPEYSTLAAV DLGSNSFRLQ VARVEGRQLY PLDNLREMVR LAAGLTRDKR LDEDSQARAL 
ACLTRFSERL RGFPPHAVRA VATNTLRVAK NAAVFLKKAE AAMGFPIEVI SGHEEARLIY
LGVAHGLPAS PDARLVMDIG GGSTEFIIGR RLKPVKLESL YMGCVSYSLR FFPGGRISRE
AMNRAELSAR SEIQAIAKEF SSEHWQLAYG SSGTARALGD IIQLNQLGSG SGNGEITREG
LENFRNHLLK VDDIKKLDLA GIKTDRAPVI AGGFAIMSAA FAELGISRMA QGMGALRQGV
LYDLLGRFHK HDMREVTVRQ FMRRYHVDGA QAGRVESTAL LLGEQLLAAF PCEGEEHLQV
LSWAARLHEV GISVAHSGYH KHSAYILGNA DMPGFSQREQ ERLSMLVLAH RGDIGKARGN
MIERADFALL FALRLAALFH RSRCETALPR LEVSLRGKEF SLYLERKWLE GNPLTYNALL
GEIEQWDALG FRFGMAGADG SKLSASV