Gene Nmul_A0448 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0448 
Symbol 
ID3785916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp497199 
End bp498200 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content54% 
IMG OID637810524 
Productcytochrome-c peroxidase 
Protein accessionYP_411148 
Protein GI82701582 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATTGC GGGCATTAAT AACGTCGCTG CTAGCTGTTG GAACGATTGT CGCTCCGCCT 
GCCCAGGCGG CTGCCGCCGA CGAACCCATA AAACCGATCG AGGCAGCCAA GCCCAAGAAT
GAAAACAAGG TGGAACTGGG CAAAATGCTT TTTTTTGATC CCCGTCTTTC CAAATCCGGC
TTCATCTCGT GCAACTCCTG TCACAACCTG AGTATGGGGG GATCCGACAA TCTCCCCTCA
TCCATTGGCC ACAAATGGCA CCAGGGTCCG ATCAATTCGC CCACGGTATT GAATTCCAGC
CTGAGTCTGG CCCAATTCTG GGACGGTCGC GCCAAGGACC TGAAAGATCA GGCGGGCGGT
CCCATTGCCA ATCCGGGGGA AATGGCATTC AGCCATGAAT TGGCAGTGGG CGTGCTGCAA
TCCATTCCCC AGTACAGGGC GCGCTTCAAG CAGATATACA GTTCGGACAA GGTCGATATA
GGCATGGCAA CGGACGCGAT CGCTGCCTTC GAAGAAACAC TGGTAACGCC GGATTCCCGT
TTCGACAAAT GGCTCAAAGG CGACAAGAAC GCCATCAACA AGACGGAACT CGAAGGGTAC
AAACTGTTCA AGGACGCGGG CTGCACAGGT TGTCACAACG GACCGGCCGT AGGCGGGGCA
TCGTTTCAGA AAATGGGCGT ACTTGAACCC TATAAAACCC AGAGCAAGGC TGAAGGCCGT
TTTGCCGTAA CCGGCAAAGA GGAGGACCGC CTGTTCTTCA AAGTGCCTAC ATTGCGAAAT
GTGGAATTGA CCTACCCCTA TTTCCATGAC GGGGCCGCGG CAACCCTGGA AGACGCGGTA
AATACCATGG GCCGGATACA ATTGGGGCGT AATTTCACCA AGGACGAAAA TGCCAAAATC
GTGGCATTTC TGAAGACATT GACCGGCAAA CAACCCCATC TCACCTTGCC TATTCTCCCC
CCCTCGAGCA AGGATACACC CAAACCTCAT CCGTTCGATT GA
 
Protein sequence
MILRALITSL LAVGTIVAPP AQAAAADEPI KPIEAAKPKN ENKVELGKML FFDPRLSKSG 
FISCNSCHNL SMGGSDNLPS SIGHKWHQGP INSPTVLNSS LSLAQFWDGR AKDLKDQAGG
PIANPGEMAF SHELAVGVLQ SIPQYRARFK QIYSSDKVDI GMATDAIAAF EETLVTPDSR
FDKWLKGDKN AINKTELEGY KLFKDAGCTG CHNGPAVGGA SFQKMGVLEP YKTQSKAEGR
FAVTGKEEDR LFFKVPTLRN VELTYPYFHD GAAATLEDAV NTMGRIQLGR NFTKDENAKI
VAFLKTLTGK QPHLTLPILP PSSKDTPKPH PFD