Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_06951 |
Symbol | |
ID | 4779627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 638527 |
End bp | 639813 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640083971 |
Product | putative lycopene epsilon cyclase |
Protein accession | YP_001014520 |
Protein GI | 124025404 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | [TIGR01790] lycopene cyclase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.261974 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACTA GAGCATTAAA AGATGCCCTA GTACTTGGCT CAGGCCCAGG CGCTTTATCA ATAGCGGCAG CATTAGCGAT TGAGAACTTA GATGTTGAAC TCTTATCTGA ACAATCGCCA GAGGAACCTT GGCCCTTCAC TTATGGGATT TGGGGCGAAG AAGTTGACGA ACTAGGTCTA AGTCATTTAC TTGAACATAG ATGGGTAAAT ACCATTAGTT ATTTTGGGGA GGGAGACAAG GATCCAAATT CTAAAAAGAA TGAAATCACT AAACACAACA GAGATTATGG GCTTTTTGAT AAAAACAAAT TACAAGCTTA TTGGTTAGAA CAATGCAACA ATGCTGAAAT AGAATGGCAT AAAGGATCAG CAGTCAATTT AGAAACGAAT CAATTAACCA GCACAGTTAA AACGTCTAAT GGAAAGGAAC TTAATGCTCG AGTAGTCATA GACGCAACTG GCTACAAACC TGTTTTTATT AAGTCTCCTA ACCAAGGACC AGTAGCCGTT CAAACTTGTT ACGGAATCGT AGGGGAGTTC AACGCACCCC CTGTAGAGAA AGGCCAATTT GTTTTAATGG ATTATCGTTG CGACCACTTG AATCCAGAGG AGAGAAAAGA AGCTCCAACA TTTTTATACG CTATGGATAT GGGAAATGGG AAGTTTTTCT TAGAAGAAAC ATCCTTGGGT CTATTTCCTC CAGTATCTCT TGATGAGTTA AAAAGAAGAC TGGAAAAAAG ATTAGCGACT CGGGGGTTAG AAATAAAAAG TCTTGATCAT GAAGAGCATG GTTCATATCT GCCAATGAAC ATGCCAATCC CTGACCTAAC ACAGCCAGTC CTTGGATTTG GCGGTTCTGC TGGGATGGTA CATCCTGCAT CTGGATACAT GGTTGGCAGC CTATTAAGAA GAGCTCCTAA AGTTGCCAAA GCCCTTTCAT TAGCAATGAA AGACCCAAAA GCATCCTCAG CTTCATTAGC AAAAAAAGGA TGGCAAACCT TATGGCCATC AGAACTTAGA AGAAAACAAG CTATTTATAA ATTTGGATTA GAAAAATTGA TGCGCTTTGA AGAGAATTTG CTAAGAGGAT TTTTTATCGA GTTTTTTAGT TTACCTAATA AACAATGGTA TGGATTCCTT ACAAATACTC TTAGCCTTAA AGAACTAATA TCCGCAATGT GGAAGATGTT TAGGAAATCA CCCTGGACTA TCAAACAAGG CTTAATGAAT ATGCATGGTA GAGAATTAAA TTTATTATTT AAAGCATTAT TAGTTAATAA TAAATGA
|
Protein sequence | MSTRALKDAL VLGSGPGALS IAAALAIENL DVELLSEQSP EEPWPFTYGI WGEEVDELGL SHLLEHRWVN TISYFGEGDK DPNSKKNEIT KHNRDYGLFD KNKLQAYWLE QCNNAEIEWH KGSAVNLETN QLTSTVKTSN GKELNARVVI DATGYKPVFI KSPNQGPVAV QTCYGIVGEF NAPPVEKGQF VLMDYRCDHL NPEERKEAPT FLYAMDMGNG KFFLEETSLG LFPPVSLDEL KRRLEKRLAT RGLEIKSLDH EEHGSYLPMN MPIPDLTQPV LGFGGSAGMV HPASGYMVGS LLRRAPKVAK ALSLAMKDPK ASSASLAKKG WQTLWPSELR RKQAIYKFGL EKLMRFEENL LRGFFIEFFS LPNKQWYGFL TNTLSLKELI SAMWKMFRKS PWTIKQGLMN MHGRELNLLF KALLVNNK
|
| |