Gene RPD_3440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3440 
Symbol 
ID4023954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3824062 
End bp3825243 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content65% 
IMG OID637963644 
Productputative cytochrome P450 
Protein accessionYP_570564 
Protein GI91977905 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0666165 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACG CTCCGCATTT CGAGATCGAC GTCGCTTCGT TCTGGGCCGA TCCTTATCCC 
GCGCTTGCGA GGATGCGCGC CGAGGCGCCG ATCGCCTTCG TGCCGCAACT CGGCTCGACC
ATCTTCACCC GGCGCGACGA CATCTTCGTC ACCGAGAAGC GCATCGACGT GTTCTCGTCG
CACCAGCCGG CCGGCCTGAT GAACCGGTTG ATGGGCCACA ACATGATGCG CAAGGATGGC
GACGCGCATA TCGCCGAGCG CAGCGCCTTG TTTCCAGCGG TGTCGCCGCG CACCGTGAAG
GACGTATGGC GCGCACAGTT TCAGGCCCAT GCCGATCGGA TCCTCGACGA ACTCGCGCCG
CAGGGTCACG CCGATCTGGT CAAGGCTTTC GCGCTGCCGC TGTCGGGTGA GTGCCTGAAG
CACATCACCG GCCTCACCAA TATCAGCTAT CACGAGATGG ATTCGTGGTC GCAGGCGATG
ATCGACGGCA TCGCCAACTA CACCGGCGAC AAGGCGGTCG AGGATCGTTG CCATGCGGCG
ACTGCAGGCA TCGATGCCGC GATCGACGAC ATGGCCCCGG TGGTGAGCAA ACATTCCAAC
CACTCGATGC TGAGCGTGCT GCTCGCCTCG GGCATGGCGA TGGACAGTAT CCGCGCCAAC
ATCAAGCTCG CGATTTCGGG CGGGCAGAAC GAGCCGCGCG ACGCGATCGC GGGCTGCATC
TGGGCGCTGC TGACGCATCC CGCAGAATAC GCCAAGGTCG TCGCCGGGGA TGCGAGCTGG
CTCGCCGTGT TCGAGGAATA CGCCCGCTGG ATCGCACCGA TCGGAATGTC GCCGCGCCGC
GTCGCGCAGC CGTTCCATTA TCGCGGCGTC GATTTCGAGC CGGAGGATCG GGTGTTCTTC
ATGTTCGGCT CGGCCAATCG CGACGAGGCC TGCTTCACTG ATCCGGACCT GTTCGACGTC
AGCCGCGATC ATGCCAAGAG CATCGCCTTC GGCGCTGGTC CGCATTACTG CGCGGGCGCC
TTCGCCTCGC GCGCGATGGT CGCCGACGTC GCGCTGCCGA GTGTGTTCGC ACGGTTGAAA
GCGCTGCGGC TCGACGAAGG CGAGCCGGTG CGGATCGGCG GCTGGGCGTT TCGCGGGCTG
CTCAATCTGC CGGTCGCATG GAGCAGCGCC GCGCCGAATT GA
 
Protein sequence
MSNAPHFEID VASFWADPYP ALARMRAEAP IAFVPQLGST IFTRRDDIFV TEKRIDVFSS 
HQPAGLMNRL MGHNMMRKDG DAHIAERSAL FPAVSPRTVK DVWRAQFQAH ADRILDELAP
QGHADLVKAF ALPLSGECLK HITGLTNISY HEMDSWSQAM IDGIANYTGD KAVEDRCHAA
TAGIDAAIDD MAPVVSKHSN HSMLSVLLAS GMAMDSIRAN IKLAISGGQN EPRDAIAGCI
WALLTHPAEY AKVVAGDASW LAVFEEYARW IAPIGMSPRR VAQPFHYRGV DFEPEDRVFF
MFGSANRDEA CFTDPDLFDV SRDHAKSIAF GAGPHYCAGA FASRAMVADV ALPSVFARLK
ALRLDEGEPV RIGGWAFRGL LNLPVAWSSA APN