Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3865 |
Symbol | |
ID | 3911669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4420208 |
End bp | 4421557 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637885766 |
Product | coproporphyrinogen III oxidase |
Protein accession | YP_487469 |
Protein GI | 86750973 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases |
TIGRFAM ID | [TIGR00538] oxygen-independent coproporphyrinogen III oxidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.955864 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTCTG CCCTCGACAA ATACGCCAAG AGCTCGGTGC CGCGCTACAC CAGCTATCCG ACGGCGCCGC ATTTCGCCAA GGATTTTCCG GAGGCCGTGT ATCGTGGCTG GCTCGCCCAG CTCGACACCG ACGAGCCGAT CTCGCTTTAT CTCCATGTGC CGTTCTGCAA GCAGATGTGC TGGTACTGCG GCTGCAACAT GAAGCTGGCG TCGAAATACG GGCCGGTCGC CGACTATGTC GAGAGCCTGA TCAACGAGAT CGATCTGGTC GCCGATGCGA TGCCCGGCAC GATGCCGGTC CGCCATCTGC ATTTCGGGGG CGGCACGCCG ACGGTGATCG AGCCCGAGGA CCTCGGCGCG ATCATGACGC TGCTGCGCGA GCGCTTCGAA TTCGTGCCCG ATGCGGAGCT CGCGATCGAG AGCGATCCGC GCACCCTGAC CGACGAGATG GTCGCCAAGA TCGGCGAACT CGGCTTCACC CGCGCCAGCT TCGGCGTCCA GGAGTTCGAC CCCAAGGTCC AGGCGGCGAT CAACCGGATT CAGCCGCCGG AGATGGTCGC GCACGCCATG AGCCGCTTCA AGGCGGCCGG CGTCGAGCGG ATCAATTTCG ACCTGATCTA CGGCCTGCCC TATCAGACCG CCGAAGATCT GCGTAGCACC GTCGAGCAGT GCGTCGAGAT GAAGCCCGAC CGCGTCGCGC TGTTCGGCTA CGCCCATGTG CCGTGGGTCG CCAAGAACCA GCGGATGATC CCGGACGATT CGCTGCCGCA GTCGGATCTG CGCGCCGAGC AGGCCGACGC CGCCGCCGAG GCGCTGGTCA AGGGCGGCTA TGTGCGGATC GGCATCGATC ACTTCGCGCT GCCCAAGGAT TCGCTGGCGA TCGCCGCCGC GACCGGCGAA CTGCACCGCA ATTTTCAGGG CTACACCAGC GACGCGGCGC AGACCCTGAT CGGGATCGGC GCCACCTCGA TCGGTCGGAC CCCGAGCGGC TATCTGCAGA ACATCAGCGA GACCGGCGCC TGGGCGCGCG CCGTCGCGGC CGGCCAGTTG CCGGTGGCGC GCGGGCACGC CCTGACCCAG CAGGACAATC TGCGGGCGCA TGTGATCGAA CGGATCATGT GCGACGGCAA GATCGACCTC GCCGCCGCCG GCCGCGCTTT CGGCTGCAGC GATGATTGGT ATGCGCCCGA GCAGGATGCG CTCGCCGAGC TGCAGCGCGA CGGCGCCGTG ATCTGCGACC AGGGCAAGCT GACGCTGACG CCGGACGGCG TCCGGCTGTC ACGCGTGGTC GCCGCGGTGT TCGATACCTA CCTGCGGAAT TCGTCGGTGC GGCATTCCAT CGCCGTCTGA
|
Protein sequence | MSSALDKYAK SSVPRYTSYP TAPHFAKDFP EAVYRGWLAQ LDTDEPISLY LHVPFCKQMC WYCGCNMKLA SKYGPVADYV ESLINEIDLV ADAMPGTMPV RHLHFGGGTP TVIEPEDLGA IMTLLRERFE FVPDAELAIE SDPRTLTDEM VAKIGELGFT RASFGVQEFD PKVQAAINRI QPPEMVAHAM SRFKAAGVER INFDLIYGLP YQTAEDLRST VEQCVEMKPD RVALFGYAHV PWVAKNQRMI PDDSLPQSDL RAEQADAAAE ALVKGGYVRI GIDHFALPKD SLAIAAATGE LHRNFQGYTS DAAQTLIGIG ATSIGRTPSG YLQNISETGA WARAVAAGQL PVARGHALTQ QDNLRAHVIE RIMCDGKIDL AAAGRAFGCS DDWYAPEQDA LAELQRDGAV ICDQGKLTLT PDGVRLSRVV AAVFDTYLRN SSVRHSIAV
|
| |