Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1820 |
Symbol | |
ID | 4022302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 2037686 |
End bp | 2040913 |
Gene Length | 3228 bp |
Protein Length | 1075 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637962014 |
Product | cytochrome P450 |
Protein accession | YP_568957 |
Protein GI | 91976298 |
COG category | [P] Inorganic ion transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0369] Sulfite reductase, alpha subunit (flavoprotein) [COG2124] Cytochrome P450 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGTCCA CCAACAAGCT CGATCCAATT CCGCATCCGC CGAAGAAGCC GGTGGTCGGC AACATGCTGT CGCTCGATAC GACGGCCCCG GTGCAGCATC TGGTGCGGCT CGCCAAGGAG CTCGGGCCGA TCTTCTGGCT CGACATGATG GGCGCGCCGC TGGTGATCGT GTCGGGTTAC GATCTGGTCG ACGAGATCAG CGACGAGAAG CGGTTCGACA AGGCGGTGCG CGGCGCGCTG CGCCGGGCGC GCGCGGTCGG CGGCGACGGC CTGTTCACCG CCGACACCAA GGAGCCGAAC TGGAGCAAGG CGCACAACAT TCTGCTGACG CCGTTCGGCG GCCGCGCGAT GCAGTCGTAT CACCCGAGCA TGGTCGATAT CGCCGAGCAG CTCGTGAAGA AGTGGGAGCG GCTCAACGCC GACGACGAGA TCGACGTCGT CCACGACATG ACCGCGCTGA CGCTCGACAC CATCGGCCTG TGCGGCTTCG ACTATCGCTT CAATTCGTTC TATCGCCGCG ACTACCACCC CTTCGTGGAA TCGCTGGTGC GCTCGCTCGA GACCATCATG ATGACCCGCG GCCTGCCGCT GGAAAATCTC TGGATGAAGA AGCGGCGGGA GACGCTCGCC GACGATGTCG TCTTCATGAA TGCGATGGTC GACGAAATCA TCGCCGAGCG CCGCAAGGCG TCGGAAAGCG CCGCCGACAA GAAGGACATG CTCGGCGCGA TGTTGGCGGG CGTCGACCGC GCCACCGGCG AGCCGCTCGA CGACGTCAAC ATCCGCTACC AGATCAATAC GTTCCTGATC GCCGGCCACG AGACCACCAG CGGGCTGTTG TCCTGCGCGA TCTACGCGCT GCTGAAGCAT CCCGACGTGT TGCAGAAGGC GTATGACGAG GTCGACCGCG TGCTCGGCTC CGACACCGCC GTCCGGCCGA GCTATCAGCA GGTCAACCAG CTCAGCTACA TCACGCAGAT TCTGAAAGAG ACGCTGCGAA TGTGGCCGCC GGCGCCGGCC TACGGCGTCG CGCCGATCAA GGACGAAGTG ATCGGCGGCA AATATCATCT GAAGCGCGGC ACCTTCGTCA CCGTGCTGGT GCTGGCGCTG CATCGCGACC CGGCGATCTG GGGGCCGAAC CCGGACGCGT TCGATCCGGA GAATTTTTCG CGGGAAGCCG AATCGAAGCG GCCCGCCAAT GCCTGGAAGC CGTTCGGCAA CGGCCAGCGC GCCTGCATCG GCCGGGGCTT CGCGATGCAC GAGGCGGCGC TGGCGCTCGG CATGATCCTG CAGCGCTTCC AGCTGATCGA TCACCAGCGC TATCGCATGG TGCTGAAGGA GACGCTGACG ATCAAGCCCG AGGGCTTCAA GATCAAGGTG CGTCCGCGCA GCGACAAGGA CCGCGGCGAT TTCGTCGCGG CCGGCGCATC GCAAGTTTCG ACGCCGGCTC TGGCCCAGGC CGCGCCGCGC GCGCGTCCGG ACCACAACAC GCCGCTGCTG GTGCTGTACG GCTCCAACCT CGGCACCGCC GAGGAGCTGG CGACCCGCGT CGCCGATCTC GCCGAACTCA ACGGCTTTTC GACGCGGCTC GGTGCGCTCG ATCAATATGT CGGGCACTTG CCGGAAGAGG GCGGCGTGCT GATCTTCACC GCCTCCTACA ACGGCGCGCC GCCGGACAAT GCGACCCAGT TCGTGCAATG GCTGTCTGGC GATCTGCCGA AGGATGCGTT CGCCAAGCTG CGCTACGCCG TGTTCGGCTG TGGCAATCGC GACTGGACCG CGACCTATCA GGCGATCCCG CGGCTGGTCG ACGAGCGGCT CGCCGCGCAT GGCGGCCGCA ACATCTTCCT GCGCGGCGAG GGCGACGCCC GCGACGATCT CGAAGGCCAG TTCGAATCCT GGTTCGCCAA ACTCGGCCCG CTGGCGGTGA AGGAGTTCGG GATCGACGCC AAATTCGCTC GCGCGGTCGA TGATGCGCCG CTGTACCGGA TCGAGCCGGT GGCGCCCGCA GCGGGGAACG CGGTCGCCGC AGCGGGGGGC GCGGTGCCGA TGAAGGTGCT CGCCAATCGC GAGCTGCAGG ATTGCGCCGC CTCGGGGCGC TCGACCCGCC ATATCGAGAT CGCGCTGCCG GAAGGGATCA GCTATCGCGT CGGCGACCAC CTCAGCGTGA TGCCGCGCAA CGATCCGGCG CTGGTCGCCG CCGTCGCGCA GCGGCTCGGC TTTGCGCCGG ATGATCAGAT CAAGCTGCAG GTCGCGCCCG GCCGCCGCGC GCAATTGCCG ATCGGCGAAG CGATTTCGGT CGGCCGCCTG CTCGGCGACT TCGTCGAACT GCAGCAGGTC GCGACCCGCA AGCAGATCGC AGTCATGGCC GAGCACACGC GCTGTCCGCA GACCCGGCCG AAGCTGCAAG CGCTCGCCGG CGGCGATGGC GCTGCCGACG AGGCCTATCG CGCCGGCGTT CTGGCGAAGC GCAAGTCGGT CTATGATCTG ATGCAGGAGC ATCCCGCCTG CGAGTTGCCG CTGCACGCTT ATCTGGAAAT GCTGTCGCCG CTGGCGCCGC GCTACTACTC GATCTCGTCG TCGCCGTTGC GCGATCCGTC GCGCGCCGCG ATCACCGTCG CCGTGGTCGA TGGCCCGGCA TTGTCCGGCC GTGGTCATTA TCGCGGCGTC TGCTCGACCT GGCTCGCCGG CCGAAGCGTC GGCGACACCA TCCACGCCAC GGTGCGCGCG ACCAAAGCAG GTTTCCGCCT GCCCGACGAC GACCGCGTGC CGCTGATCAT GATCGGGCCG GGCACCGGGC TCGCGCCGTT CCGCGGCTTC CTGCAGGAGC GCGCCGCGCG CCAGCAGAAC GGCGCGACGC TCGGTCCGGC GCTGCTGTTC TTCGGCTGCC GACATCCGGC GCAGGACTAT CTCTATGCCG ACGAGCTGCA GGGCTTCGCT GCCGAGGGCG TCGTCGAGCT GCATACCGCG TTCTCGCGCG GCGAGGGGCC CAAGACCTAT GTGCAGCATC TGATTGCCGC GCAGAAGGAT CGGGTGTTCA CGTTGATCGA GCAGGGCGCG ATCATCTATG TCTGTGGCGA CGGCGGCAAA ATGGAGCCCG ACGTCAGGGC GGCGCTGATG GCGATCCATC GCGAGCGCAG CGGCGCCGAT GCTGCGGCGG CGTCGACATG GATCGACGAT CTCGGCGCAT GCAATCGCTA TGTGCTCGAC GTCTGGGCGA GCGCGTAA
|
Protein sequence | MPSTNKLDPI PHPPKKPVVG NMLSLDTTAP VQHLVRLAKE LGPIFWLDMM GAPLVIVSGY DLVDEISDEK RFDKAVRGAL RRARAVGGDG LFTADTKEPN WSKAHNILLT PFGGRAMQSY HPSMVDIAEQ LVKKWERLNA DDEIDVVHDM TALTLDTIGL CGFDYRFNSF YRRDYHPFVE SLVRSLETIM MTRGLPLENL WMKKRRETLA DDVVFMNAMV DEIIAERRKA SESAADKKDM LGAMLAGVDR ATGEPLDDVN IRYQINTFLI AGHETTSGLL SCAIYALLKH PDVLQKAYDE VDRVLGSDTA VRPSYQQVNQ LSYITQILKE TLRMWPPAPA YGVAPIKDEV IGGKYHLKRG TFVTVLVLAL HRDPAIWGPN PDAFDPENFS REAESKRPAN AWKPFGNGQR ACIGRGFAMH EAALALGMIL QRFQLIDHQR YRMVLKETLT IKPEGFKIKV RPRSDKDRGD FVAAGASQVS TPALAQAAPR ARPDHNTPLL VLYGSNLGTA EELATRVADL AELNGFSTRL GALDQYVGHL PEEGGVLIFT ASYNGAPPDN ATQFVQWLSG DLPKDAFAKL RYAVFGCGNR DWTATYQAIP RLVDERLAAH GGRNIFLRGE GDARDDLEGQ FESWFAKLGP LAVKEFGIDA KFARAVDDAP LYRIEPVAPA AGNAVAAAGG AVPMKVLANR ELQDCAASGR STRHIEIALP EGISYRVGDH LSVMPRNDPA LVAAVAQRLG FAPDDQIKLQ VAPGRRAQLP IGEAISVGRL LGDFVELQQV ATRKQIAVMA EHTRCPQTRP KLQALAGGDG AADEAYRAGV LAKRKSVYDL MQEHPACELP LHAYLEMLSP LAPRYYSISS SPLRDPSRAA ITVAVVDGPA LSGRGHYRGV CSTWLAGRSV GDTIHATVRA TKAGFRLPDD DRVPLIMIGP GTGLAPFRGF LQERAARQQN GATLGPALLF FGCRHPAQDY LYADELQGFA AEGVVELHTA FSRGEGPKTY VQHLIAAQKD RVFTLIEQGA IIYVCGDGGK MEPDVRAALM AIHRERSGAD AAAASTWIDD LGACNRYVLD VWASA
|
| |