Gene NATL1_06951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_06951 
Symbol 
ID4779627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp638527 
End bp639813 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content38% 
IMG OID640083971 
Productputative lycopene epsilon cyclase 
Protein accessionYP_001014520 
Protein GI124025404 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID[TIGR01790] lycopene cyclase family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.261974 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACTA GAGCATTAAA AGATGCCCTA GTACTTGGCT CAGGCCCAGG CGCTTTATCA 
ATAGCGGCAG CATTAGCGAT TGAGAACTTA GATGTTGAAC TCTTATCTGA ACAATCGCCA
GAGGAACCTT GGCCCTTCAC TTATGGGATT TGGGGCGAAG AAGTTGACGA ACTAGGTCTA
AGTCATTTAC TTGAACATAG ATGGGTAAAT ACCATTAGTT ATTTTGGGGA GGGAGACAAG
GATCCAAATT CTAAAAAGAA TGAAATCACT AAACACAACA GAGATTATGG GCTTTTTGAT
AAAAACAAAT TACAAGCTTA TTGGTTAGAA CAATGCAACA ATGCTGAAAT AGAATGGCAT
AAAGGATCAG CAGTCAATTT AGAAACGAAT CAATTAACCA GCACAGTTAA AACGTCTAAT
GGAAAGGAAC TTAATGCTCG AGTAGTCATA GACGCAACTG GCTACAAACC TGTTTTTATT
AAGTCTCCTA ACCAAGGACC AGTAGCCGTT CAAACTTGTT ACGGAATCGT AGGGGAGTTC
AACGCACCCC CTGTAGAGAA AGGCCAATTT GTTTTAATGG ATTATCGTTG CGACCACTTG
AATCCAGAGG AGAGAAAAGA AGCTCCAACA TTTTTATACG CTATGGATAT GGGAAATGGG
AAGTTTTTCT TAGAAGAAAC ATCCTTGGGT CTATTTCCTC CAGTATCTCT TGATGAGTTA
AAAAGAAGAC TGGAAAAAAG ATTAGCGACT CGGGGGTTAG AAATAAAAAG TCTTGATCAT
GAAGAGCATG GTTCATATCT GCCAATGAAC ATGCCAATCC CTGACCTAAC ACAGCCAGTC
CTTGGATTTG GCGGTTCTGC TGGGATGGTA CATCCTGCAT CTGGATACAT GGTTGGCAGC
CTATTAAGAA GAGCTCCTAA AGTTGCCAAA GCCCTTTCAT TAGCAATGAA AGACCCAAAA
GCATCCTCAG CTTCATTAGC AAAAAAAGGA TGGCAAACCT TATGGCCATC AGAACTTAGA
AGAAAACAAG CTATTTATAA ATTTGGATTA GAAAAATTGA TGCGCTTTGA AGAGAATTTG
CTAAGAGGAT TTTTTATCGA GTTTTTTAGT TTACCTAATA AACAATGGTA TGGATTCCTT
ACAAATACTC TTAGCCTTAA AGAACTAATA TCCGCAATGT GGAAGATGTT TAGGAAATCA
CCCTGGACTA TCAAACAAGG CTTAATGAAT ATGCATGGTA GAGAATTAAA TTTATTATTT
AAAGCATTAT TAGTTAATAA TAAATGA
 
Protein sequence
MSTRALKDAL VLGSGPGALS IAAALAIENL DVELLSEQSP EEPWPFTYGI WGEEVDELGL 
SHLLEHRWVN TISYFGEGDK DPNSKKNEIT KHNRDYGLFD KNKLQAYWLE QCNNAEIEWH
KGSAVNLETN QLTSTVKTSN GKELNARVVI DATGYKPVFI KSPNQGPVAV QTCYGIVGEF
NAPPVEKGQF VLMDYRCDHL NPEERKEAPT FLYAMDMGNG KFFLEETSLG LFPPVSLDEL
KRRLEKRLAT RGLEIKSLDH EEHGSYLPMN MPIPDLTQPV LGFGGSAGMV HPASGYMVGS
LLRRAPKVAK ALSLAMKDPK ASSASLAKKG WQTLWPSELR RKQAIYKFGL EKLMRFEENL
LRGFFIEFFS LPNKQWYGFL TNTLSLKELI SAMWKMFRKS PWTIKQGLMN MHGRELNLLF
KALLVNNK