Gene P9303_23511 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_23511 
Symbol 
ID4778318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2070654 
End bp2071934 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content55% 
IMG OID640087872 
Productputative lycopene epsilon cyclase 
Protein accessionYP_001018351 
Protein GI124024044 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID[TIGR01790] lycopene cyclase family protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.96532 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGAGG CAGTGGTGGA TGTCTTAGTG CTTGGCGCTG GTCCGGCGGC CCTGGCCATT 
GCTGCAGCAA TGGGCAAGGA GGGTTTACAG GTCTCAGCAC TGACGGTTGG CAATCCTCGC
GAGCCATGGC CATATACCTA TGGCATTTGG GGTGATGAGG TGGATGCCTT TGATATGGGG
CACCTGCTCG AGCACCGCTG GTCGAACACA GTGAGTTTTT TTGGTCCAGG CGCCTCTGAC
CCCAATGCTG ACGCGAATCG GCCCAGTCTT CATCATCGGG ATTACGGCCT GTTCGACAAG
ATCAAGCTCC AGGAGCATTG GTTGCAGCAA TGCGAAGCTG CTGGTTTGAC TTGGCATCAA
GGTCTCGCAA CTGATTTGGC TGTTGATGCC ACCGTTAGTA CTGTGACCAC CGCTGAAGGT
CTTGAGCTGC AAGCTCGTTT GGTGGTTGAT GCAACTGGCT ATAAGCCTGT ATTTCTGCGT
CATATAGATC ATGGGCCGGT GGCGGTCCAG ACCTGCTTTG GCGTGGTGGG ACGTTTCAAT
AAGCCGCCTG TAGAGCCTGG GCAGTTTGTC TTGATGGACT ACCGCTGTGA TCATCTGAGC
CCTGCGGAGA AGGCTGAGCC ACCAACGTTT CTTTATGCAA TGGACTTTGG TGGGGGATGC
TTCTTCTTAG AGGAAACTTC GCTAGGGCTC GCACCTCCGG TGTCATTAGA AACGCTGCGT
TCACGCTTGG AGCGGCGATT GGCTCATCAA GGCTTAACGA TCACAGAACT GCAGCACGAG
GAGCTTGGTT ATTTCCTGCC GATGAATTTG CCTCTACCTG ACTTGCAGCA ACCGCTGCTT
GGCTTCGGAG GCTCGGCGGC GATGGTGCAC CCTGCTTCGG GCTATTTGGT AGGCAGCATG
CTGCGCCGCG CACCTTATGT TGCAAAGGCT GTGGCTGAAG CCATGGCCGA TCCAGTGGCG
GGGCCGGCGG TGCTCGCGGC TGCAGGGTGG GAGACTCTTT GGCCCAAGGA GTTGCGTCGT
AAGCATGCCC TTTATCAATT TGGGCTTGAG AAGCTGATGC GTTTTAAAGA GCCCCAACTG
CGTGATTTCT TTATCAGTTT CTTTGCTTTG CCGAGCGATG AATGGTACGG CTTTTTGACC
AACACTCTTA GCCTGCGCGA ATTGGTCGCT GCCATGGTGA ATATGTTTGT TAGCGCCCCT
TGGAGTGTTC GCTGGGGCCT AATGGGTATG CAGGGGCGGG AGCTGAAATT GCTTTCACGC
TTTCTTTTCC CGCCTCGCTA A
 
Protein sequence
MTEAVVDVLV LGAGPAALAI AAAMGKEGLQ VSALTVGNPR EPWPYTYGIW GDEVDAFDMG 
HLLEHRWSNT VSFFGPGASD PNADANRPSL HHRDYGLFDK IKLQEHWLQQ CEAAGLTWHQ
GLATDLAVDA TVSTVTTAEG LELQARLVVD ATGYKPVFLR HIDHGPVAVQ TCFGVVGRFN
KPPVEPGQFV LMDYRCDHLS PAEKAEPPTF LYAMDFGGGC FFLEETSLGL APPVSLETLR
SRLERRLAHQ GLTITELQHE ELGYFLPMNL PLPDLQQPLL GFGGSAAMVH PASGYLVGSM
LRRAPYVAKA VAEAMADPVA GPAVLAAAGW ETLWPKELRR KHALYQFGLE KLMRFKEPQL
RDFFISFFAL PSDEWYGFLT NTLSLRELVA AMVNMFVSAP WSVRWGLMGM QGRELKLLSR
FLFPPR