Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_23511 |
Symbol | |
ID | 4778318 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2070654 |
End bp | 2071934 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640087872 |
Product | putative lycopene epsilon cyclase |
Protein accession | YP_001018351 |
Protein GI | 124024044 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | [TIGR01790] lycopene cyclase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.96532 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACAGAGG CAGTGGTGGA TGTCTTAGTG CTTGGCGCTG GTCCGGCGGC CCTGGCCATT GCTGCAGCAA TGGGCAAGGA GGGTTTACAG GTCTCAGCAC TGACGGTTGG CAATCCTCGC GAGCCATGGC CATATACCTA TGGCATTTGG GGTGATGAGG TGGATGCCTT TGATATGGGG CACCTGCTCG AGCACCGCTG GTCGAACACA GTGAGTTTTT TTGGTCCAGG CGCCTCTGAC CCCAATGCTG ACGCGAATCG GCCCAGTCTT CATCATCGGG ATTACGGCCT GTTCGACAAG ATCAAGCTCC AGGAGCATTG GTTGCAGCAA TGCGAAGCTG CTGGTTTGAC TTGGCATCAA GGTCTCGCAA CTGATTTGGC TGTTGATGCC ACCGTTAGTA CTGTGACCAC CGCTGAAGGT CTTGAGCTGC AAGCTCGTTT GGTGGTTGAT GCAACTGGCT ATAAGCCTGT ATTTCTGCGT CATATAGATC ATGGGCCGGT GGCGGTCCAG ACCTGCTTTG GCGTGGTGGG ACGTTTCAAT AAGCCGCCTG TAGAGCCTGG GCAGTTTGTC TTGATGGACT ACCGCTGTGA TCATCTGAGC CCTGCGGAGA AGGCTGAGCC ACCAACGTTT CTTTATGCAA TGGACTTTGG TGGGGGATGC TTCTTCTTAG AGGAAACTTC GCTAGGGCTC GCACCTCCGG TGTCATTAGA AACGCTGCGT TCACGCTTGG AGCGGCGATT GGCTCATCAA GGCTTAACGA TCACAGAACT GCAGCACGAG GAGCTTGGTT ATTTCCTGCC GATGAATTTG CCTCTACCTG ACTTGCAGCA ACCGCTGCTT GGCTTCGGAG GCTCGGCGGC GATGGTGCAC CCTGCTTCGG GCTATTTGGT AGGCAGCATG CTGCGCCGCG CACCTTATGT TGCAAAGGCT GTGGCTGAAG CCATGGCCGA TCCAGTGGCG GGGCCGGCGG TGCTCGCGGC TGCAGGGTGG GAGACTCTTT GGCCCAAGGA GTTGCGTCGT AAGCATGCCC TTTATCAATT TGGGCTTGAG AAGCTGATGC GTTTTAAAGA GCCCCAACTG CGTGATTTCT TTATCAGTTT CTTTGCTTTG CCGAGCGATG AATGGTACGG CTTTTTGACC AACACTCTTA GCCTGCGCGA ATTGGTCGCT GCCATGGTGA ATATGTTTGT TAGCGCCCCT TGGAGTGTTC GCTGGGGCCT AATGGGTATG CAGGGGCGGG AGCTGAAATT GCTTTCACGC TTTCTTTTCC CGCCTCGCTA A
|
Protein sequence | MTEAVVDVLV LGAGPAALAI AAAMGKEGLQ VSALTVGNPR EPWPYTYGIW GDEVDAFDMG HLLEHRWSNT VSFFGPGASD PNADANRPSL HHRDYGLFDK IKLQEHWLQQ CEAAGLTWHQ GLATDLAVDA TVSTVTTAEG LELQARLVVD ATGYKPVFLR HIDHGPVAVQ TCFGVVGRFN KPPVEPGQFV LMDYRCDHLS PAEKAEPPTF LYAMDFGGGC FFLEETSLGL APPVSLETLR SRLERRLAHQ GLTITELQHE ELGYFLPMNL PLPDLQQPLL GFGGSAAMVH PASGYLVGSM LRRAPYVAKA VAEAMADPVA GPAVLAAAGW ETLWPKELRR KHALYQFGLE KLMRFKEPQL RDFFISFFAL PSDEWYGFLT NTLSLRELVA AMVNMFVSAP WSVRWGLMGM QGRELKLLSR FLFPPR
|
| |