Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PMN2A_0073 |
Symbol | |
ID | 3605480 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL2A |
Kingdom | Bacteria |
Replicon accession | NC_007335 |
Strand | - |
Start bp | 626181 |
End bp | 627467 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637686928 |
Product | lycopene cyclase (CrtL-type) |
Protein accession | YP_291268 |
Protein GI | 72381913 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | [TIGR01790] lycopene cyclase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTACTA GAGCATTAAA AGATGCCCTA GTACTTGGCT CAGGCCCAGG CGCTTTATCT ATAGCGGCAG CATTAGCGAT TGAGAACTTA GATGTTGAAA TCTTATCTGA ACAATCTCCA GAGGAACCTT GGCCCTTCAC CTATGGGATT TGGGGCGAAG AAGTTGACGA ACTTGGTCTA AGTCATTTAC TTGAACATAG ATGGGTAAAT ACCATTAGTT ATTTTGGCGA GGGAGACAAG GATCCAAATT CTAAAAAGAA TGAAATCACT AAACACAATA GAGATTATGG GCTTTTTGAT AAAAACAAAT TACAAGCTTA TTGGTTAGAA CAATGCAACA ATGCTGAAAT AGAATGGCAT AAAGGATCAG CAGTCAATTT AGAAACGAAT CAATTAATTA GCACAGTTAA AACGTCTAAT GGAAAGGAGC TTAATGCTCG GGTAGTCATA GACGCAACTG GCTACAAACC TGTTTTCATT AAGTCTCCTA ACCAAGGGCC AGTAGCCGTT CAAACTTGTT ACGGAATCGT AGGGGAGTTC AGCGCACCCC CTGTAGAGAA AGGCCAATTT GTCTTAATGG ATTATCGTTG CGACCACTTG AATCCAGAGG AGAGAAAAGA AGCTCCAACA TTTTTATACG CTATGGATAT GGGAAATGGG AAGTTTTTCT TAGAAGAAAC ATCCTTGGGC CTATTTCCTC CAGTATCTCT TGATGAGTTA AAAAGAAGAC TGGAAAAAAG ATTAGAGACT CGGGGTTTAG AAATAAAAAG TCTTGATCAT GAAGAGCATG GTTCATATCT GCCGATGAAC ATGCCAATCC CTTACCTAAC ACAGCCGGTC CTTGGATTTG GCGGTTCTGC TGGAATGGTA CATCCTGCAT CTGGATACAT GGTTGGCAGC CTATTAAGAA GAGCTCCTAA AGTTGCCAAA GCCCTTTCAT TAGCAATGAA AGACCCAAAA GCATCCTCAG CTTCATTAGC AAAAAAAGGA TGGCAAACCT TATGGCCATC AGAACTTAGA AGAAAACAAG CTATTTATAA ATTTGGATTA GAAAAATTGA TGCGCTTTGA AGAGAATTTG CTAAGAGGAT TTTTTATAGA ATTTTTTAGT TTACCTAATA AACAATGGTA TGGATTCCTT ACAAATACTC TTAGCCTTAA AGAACTAATA TCCGCAATGT GGAAGATGTT TAGGAAATCA CCCTGGACTA TCAAACAAGG ATTAATGAAT ATGCATGGTA GAGAATTATA TTTATTATTT AAAGCATTAT TAGTTAATAA TAAATGA
|
Protein sequence | MSTRALKDAL VLGSGPGALS IAAALAIENL DVEILSEQSP EEPWPFTYGI WGEEVDELGL SHLLEHRWVN TISYFGEGDK DPNSKKNEIT KHNRDYGLFD KNKLQAYWLE QCNNAEIEWH KGSAVNLETN QLISTVKTSN GKELNARVVI DATGYKPVFI KSPNQGPVAV QTCYGIVGEF SAPPVEKGQF VLMDYRCDHL NPEERKEAPT FLYAMDMGNG KFFLEETSLG LFPPVSLDEL KRRLEKRLET RGLEIKSLDH EEHGSYLPMN MPIPYLTQPV LGFGGSAGMV HPASGYMVGS LLRRAPKVAK ALSLAMKDPK ASSASLAKKG WQTLWPSELR RKQAIYKFGL EKLMRFEENL LRGFFIEFFS LPNKQWYGFL TNTLSLKELI SAMWKMFRKS PWTIKQGLMN MHGRELYLLF KALLVNNK
|
| |