Gene RPB_3613 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3613 
Symbol 
ID3911415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4148782 
End bp4150014 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content66% 
IMG OID637885515 
Productcytochrome P450 
Protein accessionYP_487219 
Protein GI86750723 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.235331 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.886685 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAGCA ATAGCTCCGC GGAGTCGATC TCCGCGCCCC CGAACGACAG CACTATTCCG 
CATCTCGCGA TCGATCCGTT CTCGCTCGAC TTCTTCGACG ATCCCTACCC GGATCAGCAA
ACCCTGCGCG ACGCCGGGCC CGTCGTCTAT CTCGACAAAT GGAACGTCTA CGGCGTGGCG
CGCTACGCCG AGGTCCATGC GGTGCTCAAT GATCCGACGA CGTTCTGCTC CAGCCGCGGG
GTTGGGCTCA GCGACTTCAA GAAGGAAAAG CCGTGGCGGC CGCCGAGCCT GATTCTCGAG
GCCGATCCGC CGGCCCATAC GCGCCCGCGC GCGGTGCTCA GCAAGGTGCT GTCGCCGGCC
ACCATGAAGA CCATCCGCGA CGGCTTCGCG GCGGCGGCCG ACGCCAAAGT CGACGAACTG
CTGCAACGTG GCTGCATCGA TGCGATCGCC GATCTCGCGG AGGCCTATCC GCTATCGGTT
TTTCCCGATG CGATGGGGCT GAAGCAGGAA GGTCGCGAGC ATCTGCTGCC CTATGCCGGC
CTGGTGTTCA ACGCATTCGG GCCGCCCAAT GAATTGCGCC AGACTGCGAT CGAGCGCTCG
GCGCCGCATC AGGCCTATGT CAACGAGCAG TGCCAGCGGC CGAACCTCGC TCCGGGTGGC
TTCGGCGCCT GCATCCATGC CTTCACCGAC ACCGGCGAAA TCACCCCGGA CGAAGCGCCG
CTGCTGGTGC GCTCGCTGCT GTCCGCGGGG CTGGACACGA CCGTCAACGG CATCGGCGCC
GCAGTGTATT GCCTGGCCCG CTTCCCCGGC GAATTGCAGC GGCTGCGCAG CGATCCGACG
CTGGCGCGCA ATGCATTCGA AGAAGCGGTG CGGTTCGAGA GCCCGGTGCA GACGTTTTTC
CGGACGACGA CGCGCGAGGT CGAGCTCGGC GGCGCGGTGA TCGGCGAAGG CGAAAAGGTG
CTGATGTTCC TGGGGTCCGC CAACCGCGAT CCGCGACGCT GGAGCGATCC CGACCTCTAC
GACATCACCC GCAAGACCTC TGGCCATGTC GGCTTCGGCT CCGGCGTCCA TATGTGCGTC
GGCCAGTTGG TGGCGCGGCT GGAGGGCGAA GTGATGCTGT CCGCGCTCGC CCGCAAGGTC
GCCGCCATCG ACATCGACGG CCCGGTCAAG CGCCGCTTCA ACAACACGCT GCGCGGGCTG
GAAAGCCTGC CGGTCAAGCT GACTCCTGCC TGA
 
Protein sequence
MISNSSAESI SAPPNDSTIP HLAIDPFSLD FFDDPYPDQQ TLRDAGPVVY LDKWNVYGVA 
RYAEVHAVLN DPTTFCSSRG VGLSDFKKEK PWRPPSLILE ADPPAHTRPR AVLSKVLSPA
TMKTIRDGFA AAADAKVDEL LQRGCIDAIA DLAEAYPLSV FPDAMGLKQE GREHLLPYAG
LVFNAFGPPN ELRQTAIERS APHQAYVNEQ CQRPNLAPGG FGACIHAFTD TGEITPDEAP
LLVRSLLSAG LDTTVNGIGA AVYCLARFPG ELQRLRSDPT LARNAFEEAV RFESPVQTFF
RTTTREVELG GAVIGEGEKV LMFLGSANRD PRRWSDPDLY DITRKTSGHV GFGSGVHMCV
GQLVARLEGE VMLSALARKV AAIDIDGPVK RRFNNTLRGL ESLPVKLTPA