Gene RPD_1820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1820 
Symbol 
ID4022302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2037686 
End bp2040913 
Gene Length3228 bp 
Protein Length1075 aa 
Translation table11 
GC content67% 
IMG OID637962014 
Productcytochrome P450 
Protein accessionYP_568957 
Protein GI91976298 
COG category[P] Inorganic ion transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0369] Sulfite reductase, alpha subunit (flavoprotein)
[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTCCA CCAACAAGCT CGATCCAATT CCGCATCCGC CGAAGAAGCC GGTGGTCGGC 
AACATGCTGT CGCTCGATAC GACGGCCCCG GTGCAGCATC TGGTGCGGCT CGCCAAGGAG
CTCGGGCCGA TCTTCTGGCT CGACATGATG GGCGCGCCGC TGGTGATCGT GTCGGGTTAC
GATCTGGTCG ACGAGATCAG CGACGAGAAG CGGTTCGACA AGGCGGTGCG CGGCGCGCTG
CGCCGGGCGC GCGCGGTCGG CGGCGACGGC CTGTTCACCG CCGACACCAA GGAGCCGAAC
TGGAGCAAGG CGCACAACAT TCTGCTGACG CCGTTCGGCG GCCGCGCGAT GCAGTCGTAT
CACCCGAGCA TGGTCGATAT CGCCGAGCAG CTCGTGAAGA AGTGGGAGCG GCTCAACGCC
GACGACGAGA TCGACGTCGT CCACGACATG ACCGCGCTGA CGCTCGACAC CATCGGCCTG
TGCGGCTTCG ACTATCGCTT CAATTCGTTC TATCGCCGCG ACTACCACCC CTTCGTGGAA
TCGCTGGTGC GCTCGCTCGA GACCATCATG ATGACCCGCG GCCTGCCGCT GGAAAATCTC
TGGATGAAGA AGCGGCGGGA GACGCTCGCC GACGATGTCG TCTTCATGAA TGCGATGGTC
GACGAAATCA TCGCCGAGCG CCGCAAGGCG TCGGAAAGCG CCGCCGACAA GAAGGACATG
CTCGGCGCGA TGTTGGCGGG CGTCGACCGC GCCACCGGCG AGCCGCTCGA CGACGTCAAC
ATCCGCTACC AGATCAATAC GTTCCTGATC GCCGGCCACG AGACCACCAG CGGGCTGTTG
TCCTGCGCGA TCTACGCGCT GCTGAAGCAT CCCGACGTGT TGCAGAAGGC GTATGACGAG
GTCGACCGCG TGCTCGGCTC CGACACCGCC GTCCGGCCGA GCTATCAGCA GGTCAACCAG
CTCAGCTACA TCACGCAGAT TCTGAAAGAG ACGCTGCGAA TGTGGCCGCC GGCGCCGGCC
TACGGCGTCG CGCCGATCAA GGACGAAGTG ATCGGCGGCA AATATCATCT GAAGCGCGGC
ACCTTCGTCA CCGTGCTGGT GCTGGCGCTG CATCGCGACC CGGCGATCTG GGGGCCGAAC
CCGGACGCGT TCGATCCGGA GAATTTTTCG CGGGAAGCCG AATCGAAGCG GCCCGCCAAT
GCCTGGAAGC CGTTCGGCAA CGGCCAGCGC GCCTGCATCG GCCGGGGCTT CGCGATGCAC
GAGGCGGCGC TGGCGCTCGG CATGATCCTG CAGCGCTTCC AGCTGATCGA TCACCAGCGC
TATCGCATGG TGCTGAAGGA GACGCTGACG ATCAAGCCCG AGGGCTTCAA GATCAAGGTG
CGTCCGCGCA GCGACAAGGA CCGCGGCGAT TTCGTCGCGG CCGGCGCATC GCAAGTTTCG
ACGCCGGCTC TGGCCCAGGC CGCGCCGCGC GCGCGTCCGG ACCACAACAC GCCGCTGCTG
GTGCTGTACG GCTCCAACCT CGGCACCGCC GAGGAGCTGG CGACCCGCGT CGCCGATCTC
GCCGAACTCA ACGGCTTTTC GACGCGGCTC GGTGCGCTCG ATCAATATGT CGGGCACTTG
CCGGAAGAGG GCGGCGTGCT GATCTTCACC GCCTCCTACA ACGGCGCGCC GCCGGACAAT
GCGACCCAGT TCGTGCAATG GCTGTCTGGC GATCTGCCGA AGGATGCGTT CGCCAAGCTG
CGCTACGCCG TGTTCGGCTG TGGCAATCGC GACTGGACCG CGACCTATCA GGCGATCCCG
CGGCTGGTCG ACGAGCGGCT CGCCGCGCAT GGCGGCCGCA ACATCTTCCT GCGCGGCGAG
GGCGACGCCC GCGACGATCT CGAAGGCCAG TTCGAATCCT GGTTCGCCAA ACTCGGCCCG
CTGGCGGTGA AGGAGTTCGG GATCGACGCC AAATTCGCTC GCGCGGTCGA TGATGCGCCG
CTGTACCGGA TCGAGCCGGT GGCGCCCGCA GCGGGGAACG CGGTCGCCGC AGCGGGGGGC
GCGGTGCCGA TGAAGGTGCT CGCCAATCGC GAGCTGCAGG ATTGCGCCGC CTCGGGGCGC
TCGACCCGCC ATATCGAGAT CGCGCTGCCG GAAGGGATCA GCTATCGCGT CGGCGACCAC
CTCAGCGTGA TGCCGCGCAA CGATCCGGCG CTGGTCGCCG CCGTCGCGCA GCGGCTCGGC
TTTGCGCCGG ATGATCAGAT CAAGCTGCAG GTCGCGCCCG GCCGCCGCGC GCAATTGCCG
ATCGGCGAAG CGATTTCGGT CGGCCGCCTG CTCGGCGACT TCGTCGAACT GCAGCAGGTC
GCGACCCGCA AGCAGATCGC AGTCATGGCC GAGCACACGC GCTGTCCGCA GACCCGGCCG
AAGCTGCAAG CGCTCGCCGG CGGCGATGGC GCTGCCGACG AGGCCTATCG CGCCGGCGTT
CTGGCGAAGC GCAAGTCGGT CTATGATCTG ATGCAGGAGC ATCCCGCCTG CGAGTTGCCG
CTGCACGCTT ATCTGGAAAT GCTGTCGCCG CTGGCGCCGC GCTACTACTC GATCTCGTCG
TCGCCGTTGC GCGATCCGTC GCGCGCCGCG ATCACCGTCG CCGTGGTCGA TGGCCCGGCA
TTGTCCGGCC GTGGTCATTA TCGCGGCGTC TGCTCGACCT GGCTCGCCGG CCGAAGCGTC
GGCGACACCA TCCACGCCAC GGTGCGCGCG ACCAAAGCAG GTTTCCGCCT GCCCGACGAC
GACCGCGTGC CGCTGATCAT GATCGGGCCG GGCACCGGGC TCGCGCCGTT CCGCGGCTTC
CTGCAGGAGC GCGCCGCGCG CCAGCAGAAC GGCGCGACGC TCGGTCCGGC GCTGCTGTTC
TTCGGCTGCC GACATCCGGC GCAGGACTAT CTCTATGCCG ACGAGCTGCA GGGCTTCGCT
GCCGAGGGCG TCGTCGAGCT GCATACCGCG TTCTCGCGCG GCGAGGGGCC CAAGACCTAT
GTGCAGCATC TGATTGCCGC GCAGAAGGAT CGGGTGTTCA CGTTGATCGA GCAGGGCGCG
ATCATCTATG TCTGTGGCGA CGGCGGCAAA ATGGAGCCCG ACGTCAGGGC GGCGCTGATG
GCGATCCATC GCGAGCGCAG CGGCGCCGAT GCTGCGGCGG CGTCGACATG GATCGACGAT
CTCGGCGCAT GCAATCGCTA TGTGCTCGAC GTCTGGGCGA GCGCGTAA
 
Protein sequence
MPSTNKLDPI PHPPKKPVVG NMLSLDTTAP VQHLVRLAKE LGPIFWLDMM GAPLVIVSGY 
DLVDEISDEK RFDKAVRGAL RRARAVGGDG LFTADTKEPN WSKAHNILLT PFGGRAMQSY
HPSMVDIAEQ LVKKWERLNA DDEIDVVHDM TALTLDTIGL CGFDYRFNSF YRRDYHPFVE
SLVRSLETIM MTRGLPLENL WMKKRRETLA DDVVFMNAMV DEIIAERRKA SESAADKKDM
LGAMLAGVDR ATGEPLDDVN IRYQINTFLI AGHETTSGLL SCAIYALLKH PDVLQKAYDE
VDRVLGSDTA VRPSYQQVNQ LSYITQILKE TLRMWPPAPA YGVAPIKDEV IGGKYHLKRG
TFVTVLVLAL HRDPAIWGPN PDAFDPENFS REAESKRPAN AWKPFGNGQR ACIGRGFAMH
EAALALGMIL QRFQLIDHQR YRMVLKETLT IKPEGFKIKV RPRSDKDRGD FVAAGASQVS
TPALAQAAPR ARPDHNTPLL VLYGSNLGTA EELATRVADL AELNGFSTRL GALDQYVGHL
PEEGGVLIFT ASYNGAPPDN ATQFVQWLSG DLPKDAFAKL RYAVFGCGNR DWTATYQAIP
RLVDERLAAH GGRNIFLRGE GDARDDLEGQ FESWFAKLGP LAVKEFGIDA KFARAVDDAP
LYRIEPVAPA AGNAVAAAGG AVPMKVLANR ELQDCAASGR STRHIEIALP EGISYRVGDH
LSVMPRNDPA LVAAVAQRLG FAPDDQIKLQ VAPGRRAQLP IGEAISVGRL LGDFVELQQV
ATRKQIAVMA EHTRCPQTRP KLQALAGGDG AADEAYRAGV LAKRKSVYDL MQEHPACELP
LHAYLEMLSP LAPRYYSISS SPLRDPSRAA ITVAVVDGPA LSGRGHYRGV CSTWLAGRSV
GDTIHATVRA TKAGFRLPDD DRVPLIMIGP GTGLAPFRGF LQERAARQQN GATLGPALLF
FGCRHPAQDY LYADELQGFA AEGVVELHTA FSRGEGPKTY VQHLIAAQKD RVFTLIEQGA
IIYVCGDGGK MEPDVRAALM AIHRERSGAD AAAASTWIDD LGACNRYVLD VWASA