Gene Rpal_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1201 
Symbol 
ID6408857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1271054 
End bp1272241 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content66% 
IMG OID642711099 
Productcytochrome P450 
Protein accessionYP_001990216 
Protein GI192289611 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0703508 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAGCT TCGATCCCTA TTCGCCCGTT GTCGATGCCG ACCCGTTCCC GCTCTACAAG 
ACGCTGCGCG ACGAATATCC GGTGTTCTGG AGCGAGCCGG CCCAGATGTG GATTCTGTCG
CGCTATCTCG ACGTCGCCGG CGCCGGCAGC AACTGGCAGG TGTTCTCGTC GGCCAAGGGC
AACCTGATGA CCGAACTGCC GAACCGGGCC GGCGCCACGC TCGGCACCAC CGATCCACCG
CGCCACGACC GGCTGCGCGG GCTGGTGCAG CACGCCTTCA TGAAGCGCAA TCTCGAAGCG
CTGGCCGAAC CGATGCGGGA GATCGCCCGC GATGCCGCGG AGGCGCTGCG CGGCCGTGAC
CAATTCGATT TCATCAGCGA CTTTTCGTCC AAGTTCACCG TGCGGGTGCT GTTTGCAGCT
CTTGGCCTGC CGATGGGCGA TGAGCAGACC GTGCGGGACA AGGCGGTGCT GATGGTGCAG
AGCGATCCGG TGACCCGCGC CAAGGGACCG GAGCATCTCG CCGCCTACGC GTGGATGCAG
GAGTACGCGT CGGGCGTGAT CGCGCAGCGC CGCGCCGAGC CGAAGAACGA TCTGATCTCG
CATTTCAGCA TGGCGGAGAT CGATGGTGAC CGGCTCGACG AGCGCGAGGT GCTGCTCACC
ACCACCACGC TGATCATGGC CGGCATCGAG TCGCTCGGTG GCTTCATGAG CATGCTGGCG
CTGAACCTGG CTGACTTCGC CGATGCGCGC CGTGCGGTGG TGGCCGACCC TGCGCTGCTG
CCGGACGCGG TCGAGGAGTC GCTGCGCTAC AACACCTCGG CTCAGCGCTT CAAACGCTGC
CTGCAGAGCG ACCTGACGCT GCACGGCGTC ACCATGAAGG CCGGGGACTT CGTGTGCTTG
GCGTATGGCT CGGCCAATCG CGACGAACGG CAGTTTCCGA ATCCGGACGT GTACGACGTC
AAGCGCAAGC CGAAGGGCCA CCTCGGCTTC GGCGGCGGCG TCCATGCCTG CCTCGGCTCG
GCGATCGCCC GGATGGCGAT CAGGATCGCA TTCGATGAGT TCCACAAGGT GGTGCCGGAC
TATACGCGCA CCGAGCAACA GCTGAACTGG ATGCCGTCGT CGACCTTCCG CAGCCCGCTG
CGGCTCGATT TCGCGGTCGA GCAGGCCGCG GCGCGATCCG CGGCGTAG
 
Protein sequence
MFSFDPYSPV VDADPFPLYK TLRDEYPVFW SEPAQMWILS RYLDVAGAGS NWQVFSSAKG 
NLMTELPNRA GATLGTTDPP RHDRLRGLVQ HAFMKRNLEA LAEPMREIAR DAAEALRGRD
QFDFISDFSS KFTVRVLFAA LGLPMGDEQT VRDKAVLMVQ SDPVTRAKGP EHLAAYAWMQ
EYASGVIAQR RAEPKNDLIS HFSMAEIDGD RLDEREVLLT TTTLIMAGIE SLGGFMSMLA
LNLADFADAR RAVVADPALL PDAVEESLRY NTSAQRFKRC LQSDLTLHGV TMKAGDFVCL
AYGSANRDER QFPNPDVYDV KRKPKGHLGF GGGVHACLGS AIARMAIRIA FDEFHKVVPD
YTRTEQQLNW MPSSTFRSPL RLDFAVEQAA ARSAA