Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0068 |
Symbol | |
ID | 3785792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 73643 |
End bp | 74869 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637810137 |
Product | coproporphyrinogen III oxidase |
Protein accession | YP_410769 |
Protein GI | 82701203 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases |
TIGRFAM ID | [TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGCAA TTATTTCTCC CGCAGCGATT CTCGGTTCCC AGACCCTCCA GCTGAAGTCT CTGCCCCCGC TGAGCCTTTA CATTCACATT CCCTGGTGCA TGAAAAAATG CCCCTATTGC GATTTCAACT CGTATGAAGT TCGAGACCGG AGTGGCGGCA TGCCGGAGGC CGAGTATGTA GCCGCCCTCA TCCGCGACCT GGAAGCTTCA CTGCCCCAGA TATGGGGGCG CAAGGTCATC AGTATTTTTT TCGGGGGTGG AACGCCGAGC CTGTTCAGTC CTCATTCCAT CGATGGAATT CTTGCTGCCG TCAGGGCCTT GCTGCCGCTT GAGCACTTGG CTGAAATAAC GCTGGAGGCC AATCCCGGAA CGTTCGAGGC ACAAAGATTT GCGGATTTCC AGGCTGCCGG CATCAATCGC CTCTCCATCG GCATTCAGAG TTTCAACGCC CGTCACCTTG CATCACTGGG CCGCATACAT GACGGCAAGG ACGCTCGTCG TGCGATCGAA ATCGCGCAAA AGAACTTCGA CAACATCAAT CTCGACCTGA TGTATGGGCT GCCGAACCAG ACGCTGGAAG AAGCGCGGGA GGATATCGAG ACAGCGATTG CTCATGGCGT GCAGCATATT TCCGCCTATC ATCTGGCGCT GGAACCCAAT ACATTGTTTC ATCGTTATCC ACCCTCCCTG CCCGATGATG AGCTGACGGC GGACATGCAG GCAATGATCG AACAAACGCT CGCGCGGGAA GGTTATGCAA ATTATGAAAC ATCGGCCTTT GCCCGACCTG GCCGGGAATC CCGTCACAAC ATGAACTACT GGCTGTTTGG AGATTATCTG GGGATCGGGG CCGGAGCGCA CAGCAAAATA AGCTTTCGGG ACAGGATAGT GCGCCAGATG CGGTACAGGC AGCCGAAGGA GTATCTGATC AAATCGGTCC CTGAAATGGC ATCTGAGCCC CCTGTCATGG AGCAGCACGA AGTAGGACGG AATGATCGCG CGTTTGAATT CGTGATGAAC GCACTACGCC TGACCGACGG GTTTGCACCG CAAATGTTCA TTGAACGCAC AGGACTGGCT CTCACTCACA TCCAGCGCCA GCTCGACGAG GCGGAACGGC GGCAGTTGAT AACGCGGGAT TTTCAGCGTA TTGCGCCTAC TTTCGCAGGC AGGCGTTTTT TGAACGATTT GCTGCAGATT TTCCTTCCCG GGCAAAGCCG CGTATAA
|
Protein sequence | MTAIISPAAI LGSQTLQLKS LPPLSLYIHI PWCMKKCPYC DFNSYEVRDR SGGMPEAEYV AALIRDLEAS LPQIWGRKVI SIFFGGGTPS LFSPHSIDGI LAAVRALLPL EHLAEITLEA NPGTFEAQRF ADFQAAGINR LSIGIQSFNA RHLASLGRIH DGKDARRAIE IAQKNFDNIN LDLMYGLPNQ TLEEAREDIE TAIAHGVQHI SAYHLALEPN TLFHRYPPSL PDDELTADMQ AMIEQTLARE GYANYETSAF ARPGRESRHN MNYWLFGDYL GIGAGAHSKI SFRDRIVRQM RYRQPKEYLI KSVPEMASEP PVMEQHEVGR NDRAFEFVMN ALRLTDGFAP QMFIERTGLA LTHIQRQLDE AERRQLITRD FQRIAPTFAG RRFLNDLLQI FLPGQSRV
|
| |