Gene A9601_06891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_06891 
Symbol 
ID4717392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp611395 
End bp612663 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content35% 
IMG OID640078402 
Productputative lycopene epsilon cyclase 
Protein accessionYP_001009082 
Protein GI123968224 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID[TIGR01790] lycopene cyclase family protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.895812 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGATG TTCTTGTTTT GGGTGCAGGG CCTGCCGGTA TGGCTATTGC CTCAGCTTTA 
GGGAAGGAAA AATTAGATGT TGAAGTGCTT TCTCCAAATG GACCAGATGA ACCTTGGCCA
AATACATATG GTATTTGGGG GAAAGAAGTT GATCAACTCG GGCTTCAGGA TTTACTTGAA
TATAGATGGA AGAATACTGT AAGTTTTTTT GGGCATGGCG CTTTAGAAGA ACAGGACGAC
GAGAATAAAG CCACGGAACA TTCACTAGAT TATGGACTAT TTGATAAGAA GAAACTCCAT
AATTATTGGT TTAACGAATG CAACAAGTCT TTTATTAAAT GGCATCAAGG TTTTGCAAAC
AAAATACATT TTGAAAAATA CAAAAGTACA GTAACTACAA AAGATGGAAA AACTTACTCT
GCAAGATTAG TAGTAGATGC AACAGGCTAT GATCCTGTTT TTCTTAAATT AAAATCCTGT
GGTCCCTTAG CAGTCCAAAC TTGTTATGGG ATAGTAGGAA ATTTTAGTAA ACCTCCACTT
AAGAAAGGGC AGTTTGTATT AATGGACTAT AGAAATGATC ATCTTAACGA TGAGCAAAAA
AAAGAACCGC CAACTTTTCT TTATGCAATG GATATGGGGG ATGGGAAATA TTTTCTTGAA
GAGACATCTC TTGGTTTGGT AAATCCTCTA ACAATGGAAA ATTTAAAAGA GAGACTAGAG
AAGAGGCTTT CTTATCGAAA TATATCAATC ACAAGCATGC AGCACGAAGA GCTTGGCTTA
TTTCTTCCTA TGAATATGCC AATCCCAGAT TTCAAACAAC AAATACTTGG ATATGGTGGT
GCTGCTTCAA TGGTACATCC TGCATCTGGA TATTTAATTG GTAATGTTTT AAGAAGAGCT
CCACTTGTCG CAAAGGCAGT CTCAGAAGCA ATTAAAAACA AAAATCTAAG TACCTATCAT
ATTGCTAGAA AAGGTTGGGA AACTTTATGG TCAAAAGAAT TAATTAGGAA GAAATCACTT
TACCAATTTG GATTAGAAAA ACTCATGAGG TTTGATGAGA AACTATTGAG AGAATTTTTT
GGCAGTTTTT TCCAACTACC TAAAAATCAA TGGTATGGTT TTCTAACTGA TACTCTTTCT
TTAAAAGAGA TTGTATATGC TATGTGCGTA ATGTTTATAA AGGCTCCATG GAGTGTAAAG
AAAGGTCTTA TGATTATGCA TGGAAGAGAA TTTAAAATGT TACTTAGGAT AATATTTCCA
AACATATAG
 
Protein sequence
MPDVLVLGAG PAGMAIASAL GKEKLDVEVL SPNGPDEPWP NTYGIWGKEV DQLGLQDLLE 
YRWKNTVSFF GHGALEEQDD ENKATEHSLD YGLFDKKKLH NYWFNECNKS FIKWHQGFAN
KIHFEKYKST VTTKDGKTYS ARLVVDATGY DPVFLKLKSC GPLAVQTCYG IVGNFSKPPL
KKGQFVLMDY RNDHLNDEQK KEPPTFLYAM DMGDGKYFLE ETSLGLVNPL TMENLKERLE
KRLSYRNISI TSMQHEELGL FLPMNMPIPD FKQQILGYGG AASMVHPASG YLIGNVLRRA
PLVAKAVSEA IKNKNLSTYH IARKGWETLW SKELIRKKSL YQFGLEKLMR FDEKLLREFF
GSFFQLPKNQ WYGFLTDTLS LKEIVYAMCV MFIKAPWSVK KGLMIMHGRE FKMLLRIIFP
NI