Gene P9211_07411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_07411 
Symbol 
ID5730746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp644663 
End bp645946 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content40% 
IMG OID641285104 
Productputative lycopene epsilon cyclase 
Protein accessionYP_001550626 
Protein GI159903282 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID[TIGR01790] lycopene cyclase family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0418129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.37911 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACTA ATGCAAATAT AGATGCATTA GTTCTGGGTT CTGGACCAGC TGCATTGGCA 
ATTGCATCTG CGCTTGCCAA TGAGAGACTC TCTGTTCATG TCCTTTCTCC TCTAGACCGC
AGGCACACAT GGCCATATAC ATATGGGATA TGGGGAGAGG AAGTAGATGA CCTGGGAATA
GGAGACCTTC TAAAACATCG TTGGACTAAT ACTGTCAGTT TCTTTGGTAG CGGTTCAAAA
GAAGAGAATT CTCCTAAGAA CAAGGAAACT AGGCACAATC ACGATTATGG GTTGTTTGAC
AAAAACAAAT TACAAGCCTA TTGGTTAAAG CAATGTGATG AAGCTCTTGT TGAATGGCAC
CTTGGAACAG CTACTAATCT CAAAGTAAAT CAATCTATTA GTACTGTTAC GACATCTGAG
GGGGAAGAAG TGACCGCTCG ACTCATAATA GATGCCACTG GATACAAACC TGTCTTCCTA
AAAGTTCCTA ATAATGGAGA GGTTGCTGTT CAAACCTGTT TCGGGATAGT AGGCAAATTT
ACATCTCCAC CAATAGAAAA AGAGCAATTC GTTTTAATGG ATTACAGAAA TAATCATCTG
ACTGAGGCCG AAAAAGATGA GCCTCCTACA TTTCTATATG CCATGGATAT GGGGAATGGA
ACTTTTTTTT TAGAAGAAAC CTCGCTAGGC CTAGCTCCAC CAGTCTCACT AGATACTCTC
AAAACAAGGC TGGAAAAACG TCTGCAACAT AAAGGGATAC AGATTACTGA AATTGAGCAT
GAAGAGCACG GTTTGTTTCT TCCAATGAAT ATTCCCATTC CTTACTTAGA TCAACCAATC
CTTGGTTTTG GTGGCGCGGC TGGCATGGTT CACCCAGCTT CTGGATATTT AGTTGGAACT
TTGCTAAGAC GAGCACCTTC TGTAGCAAAG GCAATTGCAA AAGCAATGGA AAACGAGCAA
GAAAGCCCTG CAGTGATAGC ACAAAAAGGA TGGGAAGCAT TATGGCCAAA AGATTTAAGA
AGGAAGCAAG CCTTATATCA ATTTGGACTT GAAAAGTTAA TGCGGTTTAA AGAGTCTCAA
TTAAGAGACT TCTTTACTGG CTTTTTTAGT CTCTCAGAAA GCCAGTGGTA TGGTTTTCTG
ACAAACTCTC TTACTCTAGG TGAACTTATA AACGCAATGT GGAAAATGTT TACCAAAGCT
CCTTTAAATG TAAAATGGGG ATTGTTTGAA ATGAAAGGAA GAGAAATGAA GTTATTACTG
AAGTTCCTAA GCCCTGAAGT TTAA
 
Protein sequence
MATNANIDAL VLGSGPAALA IASALANERL SVHVLSPLDR RHTWPYTYGI WGEEVDDLGI 
GDLLKHRWTN TVSFFGSGSK EENSPKNKET RHNHDYGLFD KNKLQAYWLK QCDEALVEWH
LGTATNLKVN QSISTVTTSE GEEVTARLII DATGYKPVFL KVPNNGEVAV QTCFGIVGKF
TSPPIEKEQF VLMDYRNNHL TEAEKDEPPT FLYAMDMGNG TFFLEETSLG LAPPVSLDTL
KTRLEKRLQH KGIQITEIEH EEHGLFLPMN IPIPYLDQPI LGFGGAAGMV HPASGYLVGT
LLRRAPSVAK AIAKAMENEQ ESPAVIAQKG WEALWPKDLR RKQALYQFGL EKLMRFKESQ
LRDFFTGFFS LSESQWYGFL TNSLTLGELI NAMWKMFTKA PLNVKWGLFE MKGREMKLLL
KFLSPEV