Gene RPB_1938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1938 
Symbol 
ID3908017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2206614 
End bp2207795 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content66% 
IMG OID637883832 
Productputative cytochrome P450 
Protein accessionYP_485557 
Protein GI86749061 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0501163 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACG CTCCCCACTT CGAGATCAAT GTCGCCGCGT TCTGGGCCGA TCCCTATCCG 
GCCTTGGCGA AGATGCGCGC GCAAGCGCCG ATCGCCTTCG TGCCGCAGCT CGGCTCGACC
ATCTTCACCC GGCGCGACGA CATCTTCGTC AACGAGAAGC GCATCGACGT GTTCTCGTCG
CATCAGCCAG CCGGTTTGAT GAACCGGCTG ATGGGCCACA ACATGATGCG CAAGGACGGC
GACGCGCACC TCGCCGAACG CACCGCGATG TTTCCCGCGG TGTCGCCGCG CACCGTGAAG
GAGGTCTGGC GCAAGCAGTT TCAGGCCCAC GCCGATCGCA TCCTCGAGGA TCTCGCGCCG
CGGGGCGCCG CCGATCTGGT CAAGGCGTTC GCGCTGCCGC TGTCGGGCGA ATGTCTCAAA
GACGTCACCG GCCTCACCAA TATCAGCTAT CACGAGATGG ATGCGTGGTC GCAGGCGATG
ATCGACGGCA TCGCCAACTA CACCGGCGAC CGAGCGGTGG AGGATCGTTG TCACGCCGCG
ACCGCGGGAA TCGACGCCGC GATCGACGAC ATGGCCCCGG TGGTGCGCAA GCATCCCGAC
CACTCGATGC TGAGCGTGCT GATCGCTGCC GGCATGGCGA TGGATTCGAT CCGCGCCAAT
ATCAAGCTCG CGATCTCCGG CGGACAGAAC GAGCCGCGCG ACGCGATCTC GGGCTGCGTC
TGGGCGCTGC TGACCCATCC GTCGGAATAC GCCCGGGTGG TCGCGGGCGA AGCCAGCTGG
CTCGACGTGT TCGAGGAATA CGCCCGCTGG ATCGCGCCGA TCGGGATGTC GCCGCGCCGC
GTGGCGCAGC CGTTCCATTA TCGCGGCGTC GATTTCGAGC CGGAGGACCG GGTGTTCTTC
ATGTTCGGCT CGGCCAATCG CGACGAGGCC TGCTTCAGCG ATCCGGATGT GTTCGACGTC
GGCCGCGATC ACGCCAAGAG CATCGCCTTC GGTGCCGGGC CGCATTATTG CGCCGGCGCC
TTCGCCTCCC GCGCGATGGT CGCCGACGTC GCGCTGCCGG GCGTGTTCGC GCGGCTGACG
GATCTGCGGC TCGATCCGCG CGAGCCGGTG CGGATCGGCG GCTGGGCGTT TCGCGGATTG
CTCAATCTGC CGGTCGTCTG GAACAGCGCA GCGCCGAATT GA
 
Protein sequence
MSNAPHFEIN VAAFWADPYP ALAKMRAQAP IAFVPQLGST IFTRRDDIFV NEKRIDVFSS 
HQPAGLMNRL MGHNMMRKDG DAHLAERTAM FPAVSPRTVK EVWRKQFQAH ADRILEDLAP
RGAADLVKAF ALPLSGECLK DVTGLTNISY HEMDAWSQAM IDGIANYTGD RAVEDRCHAA
TAGIDAAIDD MAPVVRKHPD HSMLSVLIAA GMAMDSIRAN IKLAISGGQN EPRDAISGCV
WALLTHPSEY ARVVAGEASW LDVFEEYARW IAPIGMSPRR VAQPFHYRGV DFEPEDRVFF
MFGSANRDEA CFSDPDVFDV GRDHAKSIAF GAGPHYCAGA FASRAMVADV ALPGVFARLT
DLRLDPREPV RIGGWAFRGL LNLPVVWNSA APN