Gene Nmul_A0679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0679 
Symbol 
ID3784056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp777961 
End bp779508 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content52% 
IMG OID637810761 
Productcytochrome c peroxidase 
Protein accessionYP_411378 
Protein GI82701812 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAAA ATATACATAT CCTTATCCTG AGTACAGGAT TCCTCAGCTT GAGTGCGGGA 
ATGAATGTTG TCTTGGGTCA GGACGATCAC GCGAAGAATT CCGCAAATCG TGCCCCGCAT
GATGCGGTTG AAGATGCCCA GACCCCTCGC CGCGTCTCTG CCACCCGTCC GGAGCAGAAA
AAAGACGAAC ACGATCGTTC CAATATATTT GATGCAAAGA ATGCAATGCC TTCCTCGCAA
GCTTTTGGGA ATCAGCCTGA CAACGGAAAG GTTATGGGAT TTGATTTTTC CCGGGATGCG
TTCAACGCCA AGAAGCCGAT GCAAACCTTT GAGGAAATCA TGAAAGCGGA TATGGCTGAG
CGACCAAAGG TAATGCAAAC GCAGCGTGAA TTGCTGCAGC GGCGGTATGA TTTGAAGCCT
CGGCTGGATA AGGAGGCAAA AATGTCGCGC GGCAAGCCGT TGGCGGTGGG GCCCACGGCA
CGCTTATCGT CCGGCATGAC GTGGCAAAGA CTCGCCGGAA TCGCGCCTGA AGAGATAAAG
AGCAAAAACG CATTTCCGTA TCCAGCCCTT CCGCATCCCA AGCAATCCAC GGGAGGGCAG
GTGTTTCCCC AAATGCAGAT CGATATGTTT CCTCGGCTGC AAAGGTTTGA TGTCGATTTC
GATTTGCCCG AGGCATTCTT GCCGGAGTTT CCACCAGCGA TATTTCTCCA GAATCGACCT
GAGTTAGGGG ATGTGTCGCG TGGAGAGGTG GTTTCCATCA ATAATTTTCA CAGGTTATTC
AAGGACATCC TGACCCCCGT ACAGCTCAAC GGGTTGCAGA TGTTACTCAC GCCACTGCCT
CAGGAAGAAT TTAATCCGAC GGATGATCGT AAAAGTAACG AACCGAGTCT TGGAGTAGCT
TGCCTCGATT GCCATGTCAA CGGCCACACG ACAGGGCAAT TTCATCTGAG TCCCGATATC
CGCCCGGAAG AACGCCGATT CCGTCTCGAC ACGGTAAGCC TGCGCGGTAT GTATAACCAG
CAGATACACG GTTCGAAACG TAGCTTGCGT TCAGTGGAAG ACTTCACGGA GTTCGAGTTC
CGGACTGCTT ATTTCAATGG CGATCCGATC CGGGCAATGA AAAAAGGCTT TCCCGTATTT
GACCGGATTC ATGTAAGTCA TATGGCACAG CTTCAGAACA TGTTCGATTT CCCGCCTGCG
CCAAAACTCA AGAGTGACGG GAGACTTGAT CCGGCCAAAG CCACGGAAAA GGAAAGGCGC
GGGGAGAAAC TCTTCTTCGG AAAGGGGCAG TGCGCAAGCT GCCACCAGCC TCCCGCATTC
CTGGATCACC AAATGCATGA TCTGAAGCTC GAACGCTTCC TGAATGAACC GGGGGATGGG
CCGATAAAGA CTTTCACACT GCGGGGGATC AAGGACAGTC CGCCATACCT GCATGATGGC
CGTTTGCTGA CCCTTGAAGA TACAGTCGAG TTTTTCAATC TGGTACTGGA GCTCAAGCTC
ACAGACGAGG AGAAGTCCGA TCTGGTCGCG TATATGCGTC AACTGTAA
 
Protein sequence
MMKNIHILIL STGFLSLSAG MNVVLGQDDH AKNSANRAPH DAVEDAQTPR RVSATRPEQK 
KDEHDRSNIF DAKNAMPSSQ AFGNQPDNGK VMGFDFSRDA FNAKKPMQTF EEIMKADMAE
RPKVMQTQRE LLQRRYDLKP RLDKEAKMSR GKPLAVGPTA RLSSGMTWQR LAGIAPEEIK
SKNAFPYPAL PHPKQSTGGQ VFPQMQIDMF PRLQRFDVDF DLPEAFLPEF PPAIFLQNRP
ELGDVSRGEV VSINNFHRLF KDILTPVQLN GLQMLLTPLP QEEFNPTDDR KSNEPSLGVA
CLDCHVNGHT TGQFHLSPDI RPEERRFRLD TVSLRGMYNQ QIHGSKRSLR SVEDFTEFEF
RTAYFNGDPI RAMKKGFPVF DRIHVSHMAQ LQNMFDFPPA PKLKSDGRLD PAKATEKERR
GEKLFFGKGQ CASCHQPPAF LDHQMHDLKL ERFLNEPGDG PIKTFTLRGI KDSPPYLHDG
RLLTLEDTVE FFNLVLELKL TDEEKSDLVA YMRQL