Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_06601 |
Symbol | |
ID | 4911084 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 585814 |
End bp | 587097 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640160241 |
Product | putative lycopene epsilon cyclase |
Protein accession | YP_001090884 |
Protein GI | 126695998 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | [TIGR01790] lycopene cyclase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAAAG AAAATATGCC AGATGTTCTT GTTTTGGGTG CAGGGCCTGC AGGTATGGCT ATTGCCTCAG CTTTAGGTAA GGAAAAATTA GATGTTGAAG TGCTTTCTCC AAATGGACCA GATGAGCCTT GGCCAAATAC ATATGGCATT TGGGGGAAAG AAGTTGATCA ACTCGGGCTT CAGGATTTAC TTGAATATAG ATGGAAGAAT ACTGTAAGTT TTTTTGGGCA TGGCGCTTTA GAAGAGCAGG ACGACGAAAA TAAAGCCACG GAACATTCAC TAGATTATGG ATTATTTGAT AAGAAGAAAC TCCACAATTA TTGGTTTAAT GAATGCAATA AGTCTTTTAT TAAATGGCAT CAAGGCTTTG CCAACAAAAT ACATTTTGAA AAATACAAAA GTACAGTAAC TACAAAAGAT GGCAAAATTT ACTCTGCAAG ATTAGTAGTA GATGCAACAG GGTATGATCC TGTTTTTCTA AAATTAAAAT CATGTGGTCC CTTAGCTGTC CAAACTTGTT ATGGGATAGT AGGTAATTTT AGTAAACCTC CACTTAAGAA AGGGCAGTTT GTATTAATGG ACTATAGAAA TGACCATCTT AACGATGAGC AAAAAAAAGA ACCGCCAACT TTTCTTTATG CCATGGATAT GGGGGATGGG AAATATTTTC TAGAAGAGAC ATCTCTTGGT TTAGTAAATC CTCTAACAAT GGAAAATTTA AAAGAGAGAC TAGAGAAGAG GCTTTCTTAT CGAAATATAT CAATCACAAG CATGCAACAC GAAGAGCTTG GCTTATTTCT TCCTATGAAT ATGCCAATCC CAGATTTCAA ACAACAAATA CTTGGATACG GTGGTGCTGC TTCAATGGTT CATCCTGCAT CTGGATATTT AATTGGTAAT GTTTTAAGAA GAGCTCCACT TGTCGCTAAG GCAGTTTCAG AAGCAATTAA AAACAAAAAT CTAAGTACCT ATCATATTGC TAGAAAAGGT TGGGAAACTT TATGGTCAAA AGAATTAATT AGGAAGAAAT CACTTTACCA ATTTGGATTA GAAAAACTCA TGAGGTTTGA CGAGAAACTG TTGAGAGAAT TTTTTGGAAG TTTTTTCCAA CTACCTAAAA ATCAATGGTA TGGTTTTCTA ACTGATACTC TTTCTTTAAA AGAGATTGTG TATGCGATGT GCGTAATGTT TATAAAGGCT CCATGGAGTG TAAAGAAGGG TCTTATGATT ATGCATGGAA GAGAATTTAA AATGTTACTT AGGATAATAT TTCCAAACAT ATAG
|
Protein sequence | MSKENMPDVL VLGAGPAGMA IASALGKEKL DVEVLSPNGP DEPWPNTYGI WGKEVDQLGL QDLLEYRWKN TVSFFGHGAL EEQDDENKAT EHSLDYGLFD KKKLHNYWFN ECNKSFIKWH QGFANKIHFE KYKSTVTTKD GKIYSARLVV DATGYDPVFL KLKSCGPLAV QTCYGIVGNF SKPPLKKGQF VLMDYRNDHL NDEQKKEPPT FLYAMDMGDG KYFLEETSLG LVNPLTMENL KERLEKRLSY RNISITSMQH EELGLFLPMN MPIPDFKQQI LGYGGAASMV HPASGYLIGN VLRRAPLVAK AVSEAIKNKN LSTYHIARKG WETLWSKELI RKKSLYQFGL EKLMRFDEKL LREFFGSFFQ LPKNQWYGFL TDTLSLKEIV YAMCVMFIKA PWSVKKGLMI MHGREFKMLL RIIFPNI
|
| |