Gene P9301_06601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_06601 
Symbol 
ID4911084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp585814 
End bp587097 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content35% 
IMG OID640160241 
Productputative lycopene epsilon cyclase 
Protein accessionYP_001090884 
Protein GI126695998 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID[TIGR01790] lycopene cyclase family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAAAG AAAATATGCC AGATGTTCTT GTTTTGGGTG CAGGGCCTGC AGGTATGGCT 
ATTGCCTCAG CTTTAGGTAA GGAAAAATTA GATGTTGAAG TGCTTTCTCC AAATGGACCA
GATGAGCCTT GGCCAAATAC ATATGGCATT TGGGGGAAAG AAGTTGATCA ACTCGGGCTT
CAGGATTTAC TTGAATATAG ATGGAAGAAT ACTGTAAGTT TTTTTGGGCA TGGCGCTTTA
GAAGAGCAGG ACGACGAAAA TAAAGCCACG GAACATTCAC TAGATTATGG ATTATTTGAT
AAGAAGAAAC TCCACAATTA TTGGTTTAAT GAATGCAATA AGTCTTTTAT TAAATGGCAT
CAAGGCTTTG CCAACAAAAT ACATTTTGAA AAATACAAAA GTACAGTAAC TACAAAAGAT
GGCAAAATTT ACTCTGCAAG ATTAGTAGTA GATGCAACAG GGTATGATCC TGTTTTTCTA
AAATTAAAAT CATGTGGTCC CTTAGCTGTC CAAACTTGTT ATGGGATAGT AGGTAATTTT
AGTAAACCTC CACTTAAGAA AGGGCAGTTT GTATTAATGG ACTATAGAAA TGACCATCTT
AACGATGAGC AAAAAAAAGA ACCGCCAACT TTTCTTTATG CCATGGATAT GGGGGATGGG
AAATATTTTC TAGAAGAGAC ATCTCTTGGT TTAGTAAATC CTCTAACAAT GGAAAATTTA
AAAGAGAGAC TAGAGAAGAG GCTTTCTTAT CGAAATATAT CAATCACAAG CATGCAACAC
GAAGAGCTTG GCTTATTTCT TCCTATGAAT ATGCCAATCC CAGATTTCAA ACAACAAATA
CTTGGATACG GTGGTGCTGC TTCAATGGTT CATCCTGCAT CTGGATATTT AATTGGTAAT
GTTTTAAGAA GAGCTCCACT TGTCGCTAAG GCAGTTTCAG AAGCAATTAA AAACAAAAAT
CTAAGTACCT ATCATATTGC TAGAAAAGGT TGGGAAACTT TATGGTCAAA AGAATTAATT
AGGAAGAAAT CACTTTACCA ATTTGGATTA GAAAAACTCA TGAGGTTTGA CGAGAAACTG
TTGAGAGAAT TTTTTGGAAG TTTTTTCCAA CTACCTAAAA ATCAATGGTA TGGTTTTCTA
ACTGATACTC TTTCTTTAAA AGAGATTGTG TATGCGATGT GCGTAATGTT TATAAAGGCT
CCATGGAGTG TAAAGAAGGG TCTTATGATT ATGCATGGAA GAGAATTTAA AATGTTACTT
AGGATAATAT TTCCAAACAT ATAG
 
Protein sequence
MSKENMPDVL VLGAGPAGMA IASALGKEKL DVEVLSPNGP DEPWPNTYGI WGKEVDQLGL 
QDLLEYRWKN TVSFFGHGAL EEQDDENKAT EHSLDYGLFD KKKLHNYWFN ECNKSFIKWH
QGFANKIHFE KYKSTVTTKD GKIYSARLVV DATGYDPVFL KLKSCGPLAV QTCYGIVGNF
SKPPLKKGQF VLMDYRNDHL NDEQKKEPPT FLYAMDMGDG KYFLEETSLG LVNPLTMENL
KERLEKRLSY RNISITSMQH EELGLFLPMN MPIPDFKQQI LGYGGAASMV HPASGYLIGN
VLRRAPLVAK AVSEAIKNKN LSTYHIARKG WETLWSKELI RKKSLYQFGL EKLMRFDEKL
LREFFGSFFQ LPKNQWYGFL TDTLSLKEIV YAMCVMFIKA PWSVKKGLMI MHGREFKMLL
RIIFPNI