Gene RPB_3934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3934 
Symbol 
ID3911741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4488655 
End bp4489920 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content64% 
IMG OID637885838 
Productcytochrome P450 
Protein accessionYP_487538 
Protein GI86751042 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.184448 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGGCA CCATCGAGAT CGACAACGCC GCCCGCCAGC GCGCCGCGCG CGAGGAAGCC 
TATGCGACGC CGCTGTCGCA ATTCCACCCC GGCGCGCCGC GGCACTTCCG CGACGACACG
CTGTGGCCGT GGTTCGAGCG GCTGCGCGCC GAGGAGCCGG TGCACTACTG CACCAACGCG
CCGATCGCGC CGTATTGGAG CGTGACCAAG TACAACGACA TCATGCATGT CGACACCAGC
CATCAGATCT TCTCGTCGGA TTCGACGCTC GGCGGCATTT CGATCCGCGA CGCGCCGCAG
GGCTACGACT GGCCGAGCTT CATCGCGATG GACGAGCCGC GGCACTCGGC GCAGCGCAAG
ACGGTGTCGC CGATGTTCAC GCCGGACCAT CTCGACGAAC TCGCGGTGCT GATCCGCGGC
CGGACGCAGA AAGTGCTCGA TGGCCTGCCG CGCAACGAGA CCTTCAACTT CGTCGAGCGG
GTCTCGATCG AGCTGACGAC GCAGATGCTG GCCACCTTGT TCGACTTCCC GTTCGCGCAG
CGCCGCAAGT TGACGCGCTG GTCCGACGTC GCCACCGCGC TGCCCAAGAG CATGATCGTG
GCGTCGGAGG AGGAACGCCG CAGCGAGCTG AACGAATGCG CCGCGACCTT CGCGGCGATG
TGGAACGAGC GCGTCAATTC CGAGCCACGG AATGACCTGC TGTCGATGAT GGCGCATCAC
GACGCCACAC GGCAGATGGA CCGCGACAAT CTGATCGGCA ACATCCTGCT GTTGATCGTC
GGCGGCAACG ACACCACCCG CAACACCATG ACCGGCTCGG TGCTGGCGCT GAACCAGAAC
CCGGACCAGT TCGCCAAGCT GCGCGCCAAC CCGGCGCTGA TCGACACCAT GGTACCCGAG
GTGATCCGCT GGCAGACGCC GCTGGCGCAT ATGCGCCGCA CCGCCTTGCA GGACACCGAA
CTCGGCGGCA AGACCATCAA GAAGGGCGAC CGGGTGGTGA TGTGGTACGT CTCCGGCAAC
CGCGACGACG AGGTGATCGA GCGCCCGAAC GAGTTCATCA TCGACCGCAA GCGGGCGAAG
ATCCATTTGT CGTTCGGCTT CGGTATCCAC CGCTGCGTCG GGATGCGGCT GGCCGAATTG
CAACTGAAGA TCGTCTGGGA AGAAATGCTC AAACGGTTCG AGCGCATTGA AGTTGTCGGG
GAGCCGAAGC GGGTGTATTC GAGCTTCGTC AAGGGCTACG AGTCCTTGCC GGTTCGCATC
TCATGA
 
Protein sequence
MHGTIEIDNA ARQRAAREEA YATPLSQFHP GAPRHFRDDT LWPWFERLRA EEPVHYCTNA 
PIAPYWSVTK YNDIMHVDTS HQIFSSDSTL GGISIRDAPQ GYDWPSFIAM DEPRHSAQRK
TVSPMFTPDH LDELAVLIRG RTQKVLDGLP RNETFNFVER VSIELTTQML ATLFDFPFAQ
RRKLTRWSDV ATALPKSMIV ASEEERRSEL NECAATFAAM WNERVNSEPR NDLLSMMAHH
DATRQMDRDN LIGNILLLIV GGNDTTRNTM TGSVLALNQN PDQFAKLRAN PALIDTMVPE
VIRWQTPLAH MRRTALQDTE LGGKTIKKGD RVVMWYVSGN RDDEVIERPN EFIIDRKRAK
IHLSFGFGIH RCVGMRLAEL QLKIVWEEML KRFERIEVVG EPKRVYSSFV KGYESLPVRI
S