Gene P9303_03561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_03561 
SymbolcypX 
ID4778383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp360385 
End bp361683 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content55% 
IMG OID640085859 
Productcytochrome P450 enzyme 
Protein accessionYP_001016373 
Protein GI124022066 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.726457 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAGTT CTGATGATGC CAAGCTTCGC CCTCTTCCCA ATACAGCTGC CCTGAGTGGG 
GTACTTGAAG CGTTTGCATT TTTTCGCGAT CCAGCCTTCG CACAGAAGCG CTTTAAGCGT
CACGGGAATG TTTTTGAAAC ATCACTGCTC GGACAGCCAA TGGTGTTCAT TCAGGGCGGG
CAAGCGATTA GTGATCTCCT AGCTCAACCC AACGCAGTAG AAGGTTGGTG GCCGGAGAGT
GTTCGGAAGC TGTTGGGCAG CCATTCGCTG GCTAACCGCA ACGGAGCCTC CCATCGAGCA
CGCCGAAGGG TGATAGGCCA ACTGTTCTCG GCCTCAGCAC TACAGCGCTA CAGCGCTGGC
ATCATCAGCA TGGTTAAGGA TCTTGCCGAC GAACTGCAAG CTGCAACAAC AGCTCTTCCT
CTCGCAGAGA GAATGCGTCG TTTCGCTTTC TCGGTAATCG CCACGACGGT TCTCGGATTG
GAGGGGACTG ATCGCGATGA GTTGTTCGTT GATTTCGAGA TCTGGACCAG AGCCCTCTTC
TCAATACCGA TAGCTCTTCC AGGTAGCTCC TTTGCTAAAG CGCTTAAGGC ACGGGAACGG
TTGTTGAAGA GACTTCAGAA GGTGCTTTTA AAAGCCTCCA ATGGCAACGG TGGTCTTGAT
CTGCTTGCGG GAGGTCTCGA TGAAGCAGGT ATTCCTTTAA CGGACGAAGA CGTAGTTGAG
CAGCTTCTGC TTCTTCTGTT CGCGGGCTAC GAGACCACGG CTTCTTCGCT CAGCTGTCTG
ATGCGGGAGC TCCTTCTCAA CCCGCAAGTG GAAACGTGGC TGCGAGAGGA AATCGATGGA
CTTGACTGGC CTCCAGCACC CGAGCAGGCC ACCACTGCCT ACGACCAGGT CAACGCACCA
AAACTTGACG CTGTCGTTTC TGAGATCATG CGACTCACGC CAGCGGTGGG AGGTTTTTTT
CGCCGCACTA AGTGCGCCTT GGTAATCGAC GGTGTGGAGG TGCCAAAAAA CCGTGTGGTA
CAGGTTGCTC TGGCAGCTTC TAACCGTCAT GGTGCTGGCG ATCTGGAAGC CTTTCGTCCC
CAGCGCCACC TAGATGATGG CTGTTCAGCA ACCCTGCTGC CTTTTGGGGG GGGAGAGCGA
GTATGCCTAG GCAAACCACT AGCGGAACTT GAAATACGTT TGATGGTGGT CGGCTTGTTT
CACCAGTTGC GACTTCACTT GATCCCTGAT CAAGACCTCA CTCTGCAAAT GCTGCCTAGC
CCTACACCCC GAGATGGGCT GCTAACGAAG GTGCTGTAA
 
Protein sequence
MASSDDAKLR PLPNTAALSG VLEAFAFFRD PAFAQKRFKR HGNVFETSLL GQPMVFIQGG 
QAISDLLAQP NAVEGWWPES VRKLLGSHSL ANRNGASHRA RRRVIGQLFS ASALQRYSAG
IISMVKDLAD ELQAATTALP LAERMRRFAF SVIATTVLGL EGTDRDELFV DFEIWTRALF
SIPIALPGSS FAKALKARER LLKRLQKVLL KASNGNGGLD LLAGGLDEAG IPLTDEDVVE
QLLLLLFAGY ETTASSLSCL MRELLLNPQV ETWLREEIDG LDWPPAPEQA TTAYDQVNAP
KLDAVVSEIM RLTPAVGGFF RRTKCALVID GVEVPKNRVV QVALAASNRH GAGDLEAFRP
QRHLDDGCSA TLLPFGGGER VCLGKPLAEL EIRLMVVGLF HQLRLHLIPD QDLTLQMLPS
PTPRDGLLTK VL