Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9515_06991 |
Symbol | |
ID | 4719515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9515 |
Kingdom | Bacteria |
Replicon accession | NC_008817 |
Strand | - |
Start bp | 632583 |
End bp | 633866 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640080377 |
Product | putative lycopene epsilon cyclase |
Protein accession | YP_001011015 |
Protein GI | 123965934 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | [TIGR01790] lycopene cyclase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.28746 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTCCA AAGGTTTACC AGATGTACTA GTTTTGGGCG CAGGACCGGC AGGGATGGCG ATTGCATCGG CTTTAGGTAA AGAGAAGTTA GAGGTAGAGG TACTCTCACC TAAGGGCCCT GACGAACCTT GGCCAAATAC TTATGGTATA TGGGGAAAGG AAGTAGATCA ACTTGGACTT CAGGATTTAC TTGAGTATAG ATGGAAAAAT ACCGTTAGTT TTTTTGGACA TGGTTCTATT GAAGAACATC ACTATGAAAA CAAAGCAACC GAGCACTCTT TAGATTATGG ATTATTTGAT AAGAAGAAAT TACATAGTTA CTGGTTAAAT GAATGTAATA AGTCCCTTAT TAAGTGGCAT GAAGGTTTTG CAGAGAAGAT AAATTTTGAA AAATATAAAA CTACAGTTAC TACCAAAAAT GGGGAAACTT ACTCAGCAAG ATTAGTTGTA GATGCAACTG GATATGATCC CGTATTTCTT AAATTAAAAT CATGTGGACC ATTAGCAGTT CAGACTTGTT ATGGGATAGT AGGAACTTTT AGTAAGCCTC CACTAAAAAA AGGCCAATTT GTCCTAATGG ATTATCGAAA TGATCATTTA AACGAAGAAC AAAAAAAAGA ACCTCCTACT TTTTTGTATG CAATGGATAT GGGGAATGGT AAGTATTTTC TTGAAGAGAC TTCTCTTGGA TTAGTAAATC CTTTGACAAT GGAAAATCTA AAGGAAAGGT TAGAGAAAAG GCTTTCTTAC AGAAATATAT CAATTACCAG CATGCAACAT GAAGAACTCG GTTTGTTTTT ACCAATGAAT ATGCCTATCC CAAATTTCAA ACAACAAATA TTAGGCTATG GTGGCGCGGC CTCAATGGTT CATCCAGCAT CTGGATATTT AATTGGTAAC GTCCTCAGAA GAGCTCCATT AGTCGCAAAA GCCATATCAA CGGCAATGAA TGATAAGAAA CTAAGTACCT ATAATATTGC TCGTAAAGGA TGGGAGAGTT TATGGCCAAC AGAATTAATA AGAAAAAAAT CAATTTATCA ATTTGGTTTA GAAAAACTAA TGAGATTCGA TGAGAAATTA TTAAGAGAAT TTTTTGGCAG TTTTTTTCAA TTACCAAAAA CTCAATGGTA TGGATTTCTA ACTGATACAC TCTCCTTAAG AGAAATAGTA TACGCAATGT GCATAATGTT TATAAGGGCT CCTTGGAGCG TCAAAAAGGG ATTAATGATT ATGCATGGTA AAGAATTAAA AATGTTACTC AGGATAGTTT TGCCTAATAT ATAA
|
Protein sequence | MSSKGLPDVL VLGAGPAGMA IASALGKEKL EVEVLSPKGP DEPWPNTYGI WGKEVDQLGL QDLLEYRWKN TVSFFGHGSI EEHHYENKAT EHSLDYGLFD KKKLHSYWLN ECNKSLIKWH EGFAEKINFE KYKTTVTTKN GETYSARLVV DATGYDPVFL KLKSCGPLAV QTCYGIVGTF SKPPLKKGQF VLMDYRNDHL NEEQKKEPPT FLYAMDMGNG KYFLEETSLG LVNPLTMENL KERLEKRLSY RNISITSMQH EELGLFLPMN MPIPNFKQQI LGYGGAASMV HPASGYLIGN VLRRAPLVAK AISTAMNDKK LSTYNIARKG WESLWPTELI RKKSIYQFGL EKLMRFDEKL LREFFGSFFQ LPKTQWYGFL TDTLSLREIV YAMCIMFIRA PWSVKKGLMI MHGKELKMLL RIVLPNI
|
| |