Gene RPB_4636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4636 
Symbol 
ID3912453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5240150 
End bp5241529 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content68% 
IMG OID637886540 
Productcytochrome P450 
Protein accessionYP_488230 
Protein GI86751734 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATCC AGGTCGCGGA TTCGTCGCTC GTTGCCACGC TGGCGCCGCC GCCGCGCAGT 
GCGCTCGCCC ATATTCCCGG GGACGAAGGC TGGCCGATCA TCGGTCGCAC CCTGCACGTG
CTCGCGGACC CGAAGGGGCA GGTCGAGATG ATGGGCCGGC TGTACGGCCC GGTGTATCGC
AGCCGGGTGC TGGGCGAGAC CAGCATCACC CTGCTCGGGC CGGAGGCCAA CGAGCTGGTG
CTGTTCGACA ACACCAAACT GTTCTCCTCG ACCCATGGCT GGGGATCGAT TCTCGGACTG
CTGTTTCCAC GCGGACTGAT GATGCTGGAT TTCGACGAGC ACCGGCTGCA CCGCAAGGCG
CTGTCGGTGG CGTTCAAGGC CGGGCCGATG CAGTCCTACC TCGCCGAACT CAACTCGGGC
ATCGCACGTC AGGTGGCGCA GTGGCGGGCG CAGCCCGGCG AGATGCTGTG CTATCCGGCG
ATGAAGCAAC TCACCCTCGA TCTCGCCGCG ACCTCGTTCC TCGGAACCGC GATCGGCGCC
GAGACCGCCG AGGTCAACGC CGCCTTCGTC GACATGGTCG CGGCCTCGGT CGCGCCGATC
CGCAAGCCCT GGCCCGGCAC CGCGATGGCA CGCGGCGTCC GCGGCCGCCA GCGCATCGTC
GCCTACTTCT CCGAACAGAT CCCGATCCGC CGCGCCGAGG GCGGCGACGA CCTGTTTTCA
CATCTGTGCC GTGCCACCCA CGACGACGGC GCATTGCTGT CGACGCAGGA CATCGTCGAC
CATATGAGCT TCCTGATGAT GGCGGCGCAC GACACGCTGA CGTCGTCGCT GACGTCGTTC
GTCGCGGCGC TCGCCGCGGC TCCGGAATGG CAGCGACGGC TGTGCGAAGA AATCGGCGGC
CTCGGGCTGA AGCCGGGCGA GCCGATCGCG TTCGAGCAGC TCGACGCGCT GCCGCTGACC
GAGATGGCAT TCAAGGAGGC GATGCGGCTG CGGCCGCCGG TGCCGTCGCT GCCACGCCGC
GCCACCCGCG AATTCAGCTT CAAGGGCTAC ACCATTCCGG CCGGCACAAT GGTGGCGATC
AATCCGCTGT ACACGCACCA CATGCCGGAG ATCTGGCCCG CCCCGGACAG ATTCGATCCG
CTGCGCTTCA CCGACGAGGC GCAGCGTGGC CGCCACCGCT TCGCCTGGGT CGCCTATGGC
GGCGGCGCGC ATATGTGCCT CGGGTTGAAC TTCGCCTACA TGCAGGCGAA GTGCTTCGCG
GTGCACCTGC TGCAGAACCT CAGCCTCGAC CTGCCGCCGA ACTATCAATC GTCGTGGCAG
ATGTGGCCGA TCCCGAAGCC GAAAGACGGC CTGCGGGTGC GGATCGCGCC CCTGCAGTAG
 
Protein sequence
MSIQVADSSL VATLAPPPRS ALAHIPGDEG WPIIGRTLHV LADPKGQVEM MGRLYGPVYR 
SRVLGETSIT LLGPEANELV LFDNTKLFSS THGWGSILGL LFPRGLMMLD FDEHRLHRKA
LSVAFKAGPM QSYLAELNSG IARQVAQWRA QPGEMLCYPA MKQLTLDLAA TSFLGTAIGA
ETAEVNAAFV DMVAASVAPI RKPWPGTAMA RGVRGRQRIV AYFSEQIPIR RAEGGDDLFS
HLCRATHDDG ALLSTQDIVD HMSFLMMAAH DTLTSSLTSF VAALAAAPEW QRRLCEEIGG
LGLKPGEPIA FEQLDALPLT EMAFKEAMRL RPPVPSLPRR ATREFSFKGY TIPAGTMVAI
NPLYTHHMPE IWPAPDRFDP LRFTDEAQRG RHRFAWVAYG GGAHMCLGLN FAYMQAKCFA
VHLLQNLSLD LPPNYQSSWQ MWPIPKPKDG LRVRIAPLQ