Gene Nmul_A0530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0530 
Symbol 
ID3784519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp606028 
End bp607542 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content55% 
IMG OID637810612 
Productperoxidase, putative 
Protein accessionYP_411230 
Protein GI82701664 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.970895 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAT CAAGCGACTT TGAATTCGAT GATCTTCAGG GGCTGCTGCG CTTTGGATAT 
GGGAAGCTGA CGGATACCTC TTTCCTGCTG CTGAATATCG TAAATGCCGA TCTGGCCAAA
AAATGGCTGG AGACAGCACC CATCAGCAGC GCCGTCACGC AGGATCCTCC GCCAGAAACG
GCGCTGCAGA TTGCCTTTTC CGTGGAGGGT CTGCGTGTCC TCGGACTGGA GGAATCCGTT
GTCGAGGGCT TTTCCGATGA GTTCATCGTG GGTATGGCAG GAGATGAAAG CCGTTCCCGA
CGCCTGGGTG ATATCGGCAG CAATGCGCCC CAGCAGTGGA AATGGGGAGG CAATGCGGCG
CAGGTACCCC ATGTCCTGTT ACTGCTTTAC GCCACGGAAG GAGGGATCGG CGCCTGGCAA
AAAACCATAG AAGGCGAACA TTTCTCCAAC GCATTTGAGT TGCTGCAGGA ATTACCTACG
CTTTATATCG GTGACATCGA GCCTTTCGGA TTTCCGGACA GCATCAGCCA GCCAACCGTG
GACTGGATGC AGCGGCAAAG CACGGATACT CACGAGCGCG ATCGTTATTC CAACTTGCTG
GCTCCTGGAG AAATCGTACT TGGCTATCGC AATGAATATG GGCAGTACAC TGCGCGCCCG
CTCATCGATC CCCGGAAAGA CAAGCTTGCA ACCATGTTGC CTGGTGCCGA GGATGATCCC
CCCCTAAAGG ATTTCGGCCG TAATGGCACT TATCTGGTGC TTCGGCAACT GGGCCAGGAT
GTTCCGGGTT TTTGGCAGTT TCTTGACCAG GTGGCGGATC ACATACCGGA AAAACGCGAG
CAACTTGCCG CAGCGATGGT GGGCCGGGAA CTCGACGGCA CGCCCTTGGT TCCGGCCCAC
ATTCCCGGCA TTTCGCGCAA AGAATATGAG AACGATTTTA CTTATGAGCA GGACCCAAAG
GGGAACTATT GCCCCCTCGG CGCCCATGTG CGCCGGTCCA ATCCGCGTAC AGGCGATTTG
CCCACGAGTG CATGCGGTCC CGTAGACCGG TTGATCAAAA TACTGGGTTT TGGTCAAAGA
CCGGATGAAG ATCTGGTCGC CTCTTCCCGT TTTCATCGTT TGCTGCGGCG TGGTCGCACT
TATGGACCCG CCCTTGCACC GAAAGACGCT GTCGAACGCC ATGCGCCTGT CGCCGAACGG
GGAATACAGT TTATCTGCCT CGTGGGCAAC ATATCGCGGC AATTCGAATT TGTTCAGAAT
GCATGGACCA TGAACAGTAA GTTCGATGGC GTGCAAGATG AAAAAGATCC CATGCTGGGG
AATCGCGAAC CCTTGATGAG CGGGGAGAGC ACTGATCATT TCAATTCTCC TGATCCGGCC
GGGCCGATGC GGACAACCTG TCATTTGCCG CAATTTGTTA CCACCCTTGG GGGGGGATAT
TTTTTTATGC CCGGGCTGCG CGCACTCAAA TACATTTCCG CCTTACCTGC TAACAGGACT
GGTAGCCCAT CATGA
 
Protein sequence
MKESSDFEFD DLQGLLRFGY GKLTDTSFLL LNIVNADLAK KWLETAPISS AVTQDPPPET 
ALQIAFSVEG LRVLGLEESV VEGFSDEFIV GMAGDESRSR RLGDIGSNAP QQWKWGGNAA
QVPHVLLLLY ATEGGIGAWQ KTIEGEHFSN AFELLQELPT LYIGDIEPFG FPDSISQPTV
DWMQRQSTDT HERDRYSNLL APGEIVLGYR NEYGQYTARP LIDPRKDKLA TMLPGAEDDP
PLKDFGRNGT YLVLRQLGQD VPGFWQFLDQ VADHIPEKRE QLAAAMVGRE LDGTPLVPAH
IPGISRKEYE NDFTYEQDPK GNYCPLGAHV RRSNPRTGDL PTSACGPVDR LIKILGFGQR
PDEDLVASSR FHRLLRRGRT YGPALAPKDA VERHAPVAER GIQFICLVGN ISRQFEFVQN
AWTMNSKFDG VQDEKDPMLG NREPLMSGES TDHFNSPDPA GPMRTTCHLP QFVTTLGGGY
FFMPGLRALK YISALPANRT GSPS