Gene RPD_3694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3694 
Symbol 
ID4024210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4122320 
End bp4123585 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content63% 
IMG OID637963898 
Productcytochrome P450 
Protein accessionYP_570816 
Protein GI91978157 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGGCA CCATCGAGAT CGACAAAGCC GCCCGTCAGC GCGCCGCGCG CGAGGAAGCC 
TATTCGACGC CGCTGGCGCA ATTCCATCCC GGCGCGCCGC GGCACTTTCG CGACGACACG
CTATGGCCGT GGTTCGAGCG GCTGCGCGCC GAGGAGCCGG TGCACTACTG CACCAATGCG
CCGATCGAGC CTTATTGGAG CGTGACAAAG TACAACGACA TCATGCATGT CGACACCCAC
CACGGCATCT TCTCGTCGGA CTCGACGCTT GGCGGCATCG CGATCCGCGA CGCGCCGGTC
GGCTACGACT GGCCGAGCTT CATCGCGATG GACGAGCCGC GGCATTCGGC GCAGCGCAAG
ACGGTGTCGC CGATGTTCAC GCCGCAACAT CTCGACGAGC TCGCGGTGCT GATCCGCGGC
CGGACCGAGA AGGTTCTCGA CGCCCTGCCC CGCAATGAGA CCTTCAATTT CGTGGAGCGG
GTCTCGATCG AGCTGACCAC GCAGATGCTG GCGACGCTGT TCGACTTTCC GTTCGAGGAG
CGGCGCAAGC TGACGCGTTG GTCGGACGTC GCCACCGCGC TGCCGAAGAG CTTGATTGTC
GCATCCGAAG AGGAGCGCCG CACCGTGCTG AACGAATGCG CGGCCACGTT CATCAAGCTA
TGGAATGAGC GGGTCAATTC CGAGCCGCGC AACGATCTGC TATCAATGAT GGCGCATCAC
GACGCGACGC GGCAGATGGA CCGCGACAAT CTGATCGGCA ACATCCTGCT GCTGATTGTC
GGCGGCAACG ACACCACCCG CAACACCATG TCGGGCTCGG TGCTGGCGTT GAACGAAAAT
CCCGACCAGT TCGCGAAATT GCGGGCGAAT CCGGCACTGA TCGACACCAT GGTACCGGAG
GTGATCCGCT GGCAGACGCC GCTGGCGCAT ATGCGCCGGA CCGCGCTGGA GGACACCGAA
CTCGGCGGCA AGACCATCAA GAAGGGCGAC CGGGTCGTGA TGTGGTACGT CTCCGGCAAT
CGCGATGACG AGGTGATCGA GCGGCCGAAC GAATTCATCA TCGACCGCAA GCGGCCGAAG
ATCCATCTGT CGTTCGGCTT CGGCATCCAC CGCTGCGTCG GGATGCGGCT CGCGGAGTTG
CAGCTCAAGA TCGTCTGGGA AGAAATGCTC AAGCGGTTCG ACCGCATTGA AGTTGTCGGG
GAGCCGAAGC GGGTGTATTC GAGCTTCGTC AAGGGCTACG AGTCCTTGCC GGTTCGCATA
TCCTGA
 
Protein sequence
MHGTIEIDKA ARQRAAREEA YSTPLAQFHP GAPRHFRDDT LWPWFERLRA EEPVHYCTNA 
PIEPYWSVTK YNDIMHVDTH HGIFSSDSTL GGIAIRDAPV GYDWPSFIAM DEPRHSAQRK
TVSPMFTPQH LDELAVLIRG RTEKVLDALP RNETFNFVER VSIELTTQML ATLFDFPFEE
RRKLTRWSDV ATALPKSLIV ASEEERRTVL NECAATFIKL WNERVNSEPR NDLLSMMAHH
DATRQMDRDN LIGNILLLIV GGNDTTRNTM SGSVLALNEN PDQFAKLRAN PALIDTMVPE
VIRWQTPLAH MRRTALEDTE LGGKTIKKGD RVVMWYVSGN RDDEVIERPN EFIIDRKRPK
IHLSFGFGIH RCVGMRLAEL QLKIVWEEML KRFDRIEVVG EPKRVYSSFV KGYESLPVRI
S