Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0679 |
Symbol | |
ID | 3784056 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 777961 |
End bp | 779508 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637810761 |
Product | cytochrome c peroxidase |
Protein accession | YP_411378 |
Protein GI | 82701812 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1858] Cytochrome c peroxidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAAAA ATATACATAT CCTTATCCTG AGTACAGGAT TCCTCAGCTT GAGTGCGGGA ATGAATGTTG TCTTGGGTCA GGACGATCAC GCGAAGAATT CCGCAAATCG TGCCCCGCAT GATGCGGTTG AAGATGCCCA GACCCCTCGC CGCGTCTCTG CCACCCGTCC GGAGCAGAAA AAAGACGAAC ACGATCGTTC CAATATATTT GATGCAAAGA ATGCAATGCC TTCCTCGCAA GCTTTTGGGA ATCAGCCTGA CAACGGAAAG GTTATGGGAT TTGATTTTTC CCGGGATGCG TTCAACGCCA AGAAGCCGAT GCAAACCTTT GAGGAAATCA TGAAAGCGGA TATGGCTGAG CGACCAAAGG TAATGCAAAC GCAGCGTGAA TTGCTGCAGC GGCGGTATGA TTTGAAGCCT CGGCTGGATA AGGAGGCAAA AATGTCGCGC GGCAAGCCGT TGGCGGTGGG GCCCACGGCA CGCTTATCGT CCGGCATGAC GTGGCAAAGA CTCGCCGGAA TCGCGCCTGA AGAGATAAAG AGCAAAAACG CATTTCCGTA TCCAGCCCTT CCGCATCCCA AGCAATCCAC GGGAGGGCAG GTGTTTCCCC AAATGCAGAT CGATATGTTT CCTCGGCTGC AAAGGTTTGA TGTCGATTTC GATTTGCCCG AGGCATTCTT GCCGGAGTTT CCACCAGCGA TATTTCTCCA GAATCGACCT GAGTTAGGGG ATGTGTCGCG TGGAGAGGTG GTTTCCATCA ATAATTTTCA CAGGTTATTC AAGGACATCC TGACCCCCGT ACAGCTCAAC GGGTTGCAGA TGTTACTCAC GCCACTGCCT CAGGAAGAAT TTAATCCGAC GGATGATCGT AAAAGTAACG AACCGAGTCT TGGAGTAGCT TGCCTCGATT GCCATGTCAA CGGCCACACG ACAGGGCAAT TTCATCTGAG TCCCGATATC CGCCCGGAAG AACGCCGATT CCGTCTCGAC ACGGTAAGCC TGCGCGGTAT GTATAACCAG CAGATACACG GTTCGAAACG TAGCTTGCGT TCAGTGGAAG ACTTCACGGA GTTCGAGTTC CGGACTGCTT ATTTCAATGG CGATCCGATC CGGGCAATGA AAAAAGGCTT TCCCGTATTT GACCGGATTC ATGTAAGTCA TATGGCACAG CTTCAGAACA TGTTCGATTT CCCGCCTGCG CCAAAACTCA AGAGTGACGG GAGACTTGAT CCGGCCAAAG CCACGGAAAA GGAAAGGCGC GGGGAGAAAC TCTTCTTCGG AAAGGGGCAG TGCGCAAGCT GCCACCAGCC TCCCGCATTC CTGGATCACC AAATGCATGA TCTGAAGCTC GAACGCTTCC TGAATGAACC GGGGGATGGG CCGATAAAGA CTTTCACACT GCGGGGGATC AAGGACAGTC CGCCATACCT GCATGATGGC CGTTTGCTGA CCCTTGAAGA TACAGTCGAG TTTTTCAATC TGGTACTGGA GCTCAAGCTC ACAGACGAGG AGAAGTCCGA TCTGGTCGCG TATATGCGTC AACTGTAA
|
Protein sequence | MMKNIHILIL STGFLSLSAG MNVVLGQDDH AKNSANRAPH DAVEDAQTPR RVSATRPEQK KDEHDRSNIF DAKNAMPSSQ AFGNQPDNGK VMGFDFSRDA FNAKKPMQTF EEIMKADMAE RPKVMQTQRE LLQRRYDLKP RLDKEAKMSR GKPLAVGPTA RLSSGMTWQR LAGIAPEEIK SKNAFPYPAL PHPKQSTGGQ VFPQMQIDMF PRLQRFDVDF DLPEAFLPEF PPAIFLQNRP ELGDVSRGEV VSINNFHRLF KDILTPVQLN GLQMLLTPLP QEEFNPTDDR KSNEPSLGVA CLDCHVNGHT TGQFHLSPDI RPEERRFRLD TVSLRGMYNQ QIHGSKRSLR SVEDFTEFEF RTAYFNGDPI RAMKKGFPVF DRIHVSHMAQ LQNMFDFPPA PKLKSDGRLD PAKATEKERR GEKLFFGKGQ CASCHQPPAF LDHQMHDLKL ERFLNEPGDG PIKTFTLRGI KDSPPYLHDG RLLTLEDTVE FFNLVLELKL TDEEKSDLVA YMRQL
|
| |