Gene P9303_09161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_09161 
Symbol 
ID4777515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp832157 
End bp833383 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content53% 
IMG OID640086425 
Productputative lycopene beta cyclase 
Protein accessionYP_001016932 
Protein GI124022625 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID[TIGR01790] lycopene cyclase family protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.345698 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCAGCC GTACGGATGT TTTAGTGATG GGAGCCGGTC CGGCGGCCCT CTGTATCGTT 
GCTGAGTTAG TCGAGCAAGG CTTGGCAGTG ACTGCCCTCG CCTCGCATAC ACCTGAGCAG
CCATGGCCCA ATACTTATGG AATCTGGGCT GAGGAGCTTG AGTCTCTTGG GATGGCTTCT
TTACTGGGCC AGCGTTGGAC CAACACAGTT AGCTATTTCG GTGATGGTGA TAACGAGGAG
GGTCTTGCTC CAATTCAACA TCATTTCGAC TACGGCCTTT TCGATCCAGC CGCTTTACAA
GATTCATTGC TGAGCCGTTG TGGTGAATTG AGTTGGAATG TTGAAACCGC TGTGAGCATT
AAGGTGTTGG GGATAGATAC GGAAGTTCTC TGTCACTCTG GCAACGCTTA TCGAGCAAGG
GTTGTCATTG ATGCCAGTGG TCATCGCAGT CGATTTATCA GTCGCCCTGA TCATGGGCCC
GTAGCGGAAC AGGCGGCCTA TGGCGTCGTG GGACGCTTCA GTTCATCACC AGTGGAGTCT
GGTCAGTTCG TATTAATGGA TTTTCGTCCC GACCATCTCA GTGATGAACA ACGAGAGAAA
CCCCCCAGCT TTCTCTATGC CATGGATTTT GGTGAGGGAA TCTTTTTTGT CGAGGAGACT
TCGCTTGCCT GCGCCCCTCC ATTGTCATGG AGTGAATTAA GGGAAAGGCT GCATGCACGC
TTGTCTCACC GCGGGGTGGA GATTAAAGAA GTGATCCATG AGGAATATTG CCTGTTCCCG
ATGAACTTGC CTTTGCCTGA TCGTCGTCAG CCCCTGCTCG CGTTTGGTGG TGCAGCAAGC
ATGGTCCATC CTGCTTCGGG CTACATGGTT GGAGCTCTTT TACGTCGAGC TCCGGCATTG
GCAAAGCATT TGGCAATGGC TATGGCTGTT GAGCCCCCTC TGGACTCTTC CGCTCTTGCT
CGGGAAGGTT GGCAGGTGCT TTGGTCGCCG GAGCTTGTCC AGCGTCATCG CCTTTATCAG
TTTGGTTTGC GGCGACTGAT GAGCTTTAAC GAAGCTCGCT TGCGCAGCTT CTTTGCCACT
TTCTTTCAGT TGCCTCGTGA GGATTGGACA GGTTTTCTGG CTAACACGCT GCCACTACCT
CGTCTTCTGA TGGTGATGCT GCGGTTGTTC TTGCTTTCCT CCTGGGAGAT TCGGTTAGGG
ATGCTTTTTG GGGCCTCTTC GACCTAG
 
Protein sequence
MTSRTDVLVM GAGPAALCIV AELVEQGLAV TALASHTPEQ PWPNTYGIWA EELESLGMAS 
LLGQRWTNTV SYFGDGDNEE GLAPIQHHFD YGLFDPAALQ DSLLSRCGEL SWNVETAVSI
KVLGIDTEVL CHSGNAYRAR VVIDASGHRS RFISRPDHGP VAEQAAYGVV GRFSSSPVES
GQFVLMDFRP DHLSDEQREK PPSFLYAMDF GEGIFFVEET SLACAPPLSW SELRERLHAR
LSHRGVEIKE VIHEEYCLFP MNLPLPDRRQ PLLAFGGAAS MVHPASGYMV GALLRRAPAL
AKHLAMAMAV EPPLDSSALA REGWQVLWSP ELVQRHRLYQ FGLRRLMSFN EARLRSFFAT
FFQLPREDWT GFLANTLPLP RLLMVMLRLF LLSSWEIRLG MLFGASST