Gene Rpal_1803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1803 
Symbol 
ID6409460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1937151 
End bp1938416 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content65% 
IMG OID642711690 
Productcytochrome P450 
Protein accessionYP_001990805 
Protein GI192290200 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.677425 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGGCA CCATCGAGAC CGGAAAAGCG GCGCGGCTGC GGGCGGCGCG CGAGGAAGCC 
TATGCGACCC CGCTGAAGGA CTTCCACCCC GGCGCGCCGC GGCACTTCCG CGACGACACG
CTGTGGCCGT GGTTCGAACG GCTGCGCGCC GAAGAGCCGG TGCATTACTG CACCAACGCG
CCGATCGAGC CGTATTGGAG CGTCACCAAG TACAACGACA TCATGCATGT CGACACCAAC
CACCAGATCT TCTCGTCGGA CTCCACGCTC GGCGGCATCT CGATCCGCGA CGCGCCGGTC
GGCTACGACT GGCCGAGCTT CATCGCGATG GACGAGCCGC GGCACTCGGC GCAGCGCAAG
ACGGTGTCGC CGATGTTCAC GCCGCAGCAT CTGGACGAGC TCGCGGTGCT GATCCGCGGC
CGGACCCAGA AGGTGCTGGA CGGCCTGCCG CGGGGCGAGA CCTTCAACTT CGTCGACCGC
GTCTCGATCG AGCTGACCAC GCAGATGCTG GCGACGCTGT TTGACTTTCC GTTCGACGAG
CGCCGCAAGC TGACGCGCTG GTCGGACGTC GCCACCGCGC TGCCGAAGAG CGGCGTGGTC
GACTCCGAGC AGCAGCGCCG CGACGAGCTG AACGAATGCG CCGCGTATTT CGCCCGGATG
TGGAACGAGC GGGTGAACTC GGAGCCGCGC AACGACCTGC TGTCGATGAT GGCGCATCAC
GACGCCACCC GGACGATGGA CCGCGACAAT CTGATCGGCA ACATCCTGCT GTTGATCGTC
GGCGGCAACG ACACCACCCG CAACACCATG TCGGGTTCTG TGCTGGCGCT GAACGAGAAC
CCGCACGAAT TCGAAAAGCT GCGGGCCAAT CCGAAGCTGA TCGACACCAT GGTGCCCGAG
GTGATCCGCT GGCAGACGCC GCTGGCGCAC ATGCGCCGCA CCGCCCTCCA GGACACCGAG
CTCGGCGGCA AGACCATCCG CAAGGGCGAC CGCGTGGTGA TGTGGTACGT CTCCGGCAAC
CGCGATGACG AGGTGATCGA GCGGCCGGAG GAATTCATCA TCGACCGCGC CCGGGCCCGG
ATCCACCTAT CGTTCGGTTT CGGTATCCAT CGGTGCGTCG GGATGCGCTT GGCCGAATTG
CAACTGAGGA TCGTATGGGA GGAGATGCTC AAGCGTTTCG AGCGTATTGA AGTTGTCGGG
GAGCCGAAGC GGGTGTATTC GAGCTTCGTC AAGGGCTACG AGTCCTTGCC GGTCCGGGTC
TCATGA
 
Protein sequence
MHGTIETGKA ARLRAAREEA YATPLKDFHP GAPRHFRDDT LWPWFERLRA EEPVHYCTNA 
PIEPYWSVTK YNDIMHVDTN HQIFSSDSTL GGISIRDAPV GYDWPSFIAM DEPRHSAQRK
TVSPMFTPQH LDELAVLIRG RTQKVLDGLP RGETFNFVDR VSIELTTQML ATLFDFPFDE
RRKLTRWSDV ATALPKSGVV DSEQQRRDEL NECAAYFARM WNERVNSEPR NDLLSMMAHH
DATRTMDRDN LIGNILLLIV GGNDTTRNTM SGSVLALNEN PHEFEKLRAN PKLIDTMVPE
VIRWQTPLAH MRRTALQDTE LGGKTIRKGD RVVMWYVSGN RDDEVIERPE EFIIDRARAR
IHLSFGFGIH RCVGMRLAEL QLRIVWEEML KRFERIEVVG EPKRVYSSFV KGYESLPVRV
S