Gene RPD_0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0041 
Symbol 
ID4020495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp51851 
End bp53074 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content63% 
IMG OID637960217 
Productcytochrome P450 
Protein accessionYP_567182 
Protein GI91974523 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAATC TCGACTTCGC CGATCCCAAG ATCAACGCCG ATCCGTTCCC TGTTTTCGCG 
CAGTTGCGGG AGAGTGATCC GGTGCATTGG TCGCCGGTCC TGAAGGCGTG GGTGATTACG
CGCTATGACG ACGTCCGCCG TGTGGCGGTG TCGAACGCCG ACATGTCGGC GGAGCGACTG
GCGCCGTTCT TCGCGACGGT GCCCGCCGAG AGCCAAAGCG GCTTTGCCAA TCTGATGACC
TATCTCGGAA AATGGATGGT TTTCCGGGAT CCGCCCGAGC ACACCCGGCT GCGGCGGCTG
TTCACCAAGG CTTTCACCTC GCGGTCTGTG ATGGCGCTGG AGCCCAATGT CGGGGAGATC
GTCGCGCTGC TGTTCGACGA GATGGAGCAA AAGGCGCGCA GCACCGGTGT GGTCGACTGG
ATTGCGGATT TTGCCTATCC TCTGCCGGCG ACGGTGATGA TGGATCTGCT CGGCGTGCCG
CGAGACGATC TGCATCGCGT CAAGGACTGG TCGAACGACA TCGCTTTGTT CATCGGAACG
TCGCGCGCGA CGGCGGACAA ATATCTCCGC GCAGAGGCCG GGGCGAAGGC CATGGCAGAG
TATTTTCGCG GCATCATCGC CAGCCGGACG GTCGATCCGC AGGATGATAT CATCAGCCAG
CTCGTGACCG AGCCCGACAA GCGCGAGGCG CTGACCGATG ACGAGGTGAT CGCAACCTGC
ATCCTCCTGC TGTTCGCAGG GCACGAAACG ACCACCAACC TTCTCGGCAA CGGGTTCTAC
TACACGATGA ACGCGCCGGA GCAGTGGGCG CGCGTCAAGG ACGATCCATC GTTGGCTGAA
ACGGCGGTCG AGGAATGGCT GCGGTATGAC GGGCCGAGCG GCGCACTGGT GCGCGTCGTC
ACCGCCGATG TGGAGTTCGG CGGCCGGACG ATGCTGCAGG GGCAGCGCGT GTTCGCCTTC
ATCAATTCGG CCAACCGCGA TCCCGAGCAG TTCGGGGATG CGGATCGCCT CGATCTCGGC
CGATCGCCGA ATCCGCATTT GACATTCGGT CATGGCATTC ACTTTTGCCT CGGCGCTCAG
CTCGCCAGGC TCGAAGGACA GATCGCCTTG CGCGCCCTGA TCGAGCGGTT TCCCGGGATT
TCCCTTGCAA CGGATTCCGC GCCGGGGTGG AGGGATTCGA TCATCCTCCG AGGCATGCAA
TCGCTCCCGA TCCGCCTGCG CTGA
 
Protein sequence
MPNLDFADPK INADPFPVFA QLRESDPVHW SPVLKAWVIT RYDDVRRVAV SNADMSAERL 
APFFATVPAE SQSGFANLMT YLGKWMVFRD PPEHTRLRRL FTKAFTSRSV MALEPNVGEI
VALLFDEMEQ KARSTGVVDW IADFAYPLPA TVMMDLLGVP RDDLHRVKDW SNDIALFIGT
SRATADKYLR AEAGAKAMAE YFRGIIASRT VDPQDDIISQ LVTEPDKREA LTDDEVIATC
ILLLFAGHET TTNLLGNGFY YTMNAPEQWA RVKDDPSLAE TAVEEWLRYD GPSGALVRVV
TADVEFGGRT MLQGQRVFAF INSANRDPEQ FGDADRLDLG RSPNPHLTFG HGIHFCLGAQ
LARLEGQIAL RALIERFPGI SLATDSAPGW RDSIILRGMQ SLPIRLR