Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_07411 |
Symbol | |
ID | 5730746 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 644663 |
End bp | 645946 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641285104 |
Product | putative lycopene epsilon cyclase |
Protein accession | YP_001550626 |
Protein GI | 159903282 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | [TIGR01790] lycopene cyclase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0418129 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.37911 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACTA ATGCAAATAT AGATGCATTA GTTCTGGGTT CTGGACCAGC TGCATTGGCA ATTGCATCTG CGCTTGCCAA TGAGAGACTC TCTGTTCATG TCCTTTCTCC TCTAGACCGC AGGCACACAT GGCCATATAC ATATGGGATA TGGGGAGAGG AAGTAGATGA CCTGGGAATA GGAGACCTTC TAAAACATCG TTGGACTAAT ACTGTCAGTT TCTTTGGTAG CGGTTCAAAA GAAGAGAATT CTCCTAAGAA CAAGGAAACT AGGCACAATC ACGATTATGG GTTGTTTGAC AAAAACAAAT TACAAGCCTA TTGGTTAAAG CAATGTGATG AAGCTCTTGT TGAATGGCAC CTTGGAACAG CTACTAATCT CAAAGTAAAT CAATCTATTA GTACTGTTAC GACATCTGAG GGGGAAGAAG TGACCGCTCG ACTCATAATA GATGCCACTG GATACAAACC TGTCTTCCTA AAAGTTCCTA ATAATGGAGA GGTTGCTGTT CAAACCTGTT TCGGGATAGT AGGCAAATTT ACATCTCCAC CAATAGAAAA AGAGCAATTC GTTTTAATGG ATTACAGAAA TAATCATCTG ACTGAGGCCG AAAAAGATGA GCCTCCTACA TTTCTATATG CCATGGATAT GGGGAATGGA ACTTTTTTTT TAGAAGAAAC CTCGCTAGGC CTAGCTCCAC CAGTCTCACT AGATACTCTC AAAACAAGGC TGGAAAAACG TCTGCAACAT AAAGGGATAC AGATTACTGA AATTGAGCAT GAAGAGCACG GTTTGTTTCT TCCAATGAAT ATTCCCATTC CTTACTTAGA TCAACCAATC CTTGGTTTTG GTGGCGCGGC TGGCATGGTT CACCCAGCTT CTGGATATTT AGTTGGAACT TTGCTAAGAC GAGCACCTTC TGTAGCAAAG GCAATTGCAA AAGCAATGGA AAACGAGCAA GAAAGCCCTG CAGTGATAGC ACAAAAAGGA TGGGAAGCAT TATGGCCAAA AGATTTAAGA AGGAAGCAAG CCTTATATCA ATTTGGACTT GAAAAGTTAA TGCGGTTTAA AGAGTCTCAA TTAAGAGACT TCTTTACTGG CTTTTTTAGT CTCTCAGAAA GCCAGTGGTA TGGTTTTCTG ACAAACTCTC TTACTCTAGG TGAACTTATA AACGCAATGT GGAAAATGTT TACCAAAGCT CCTTTAAATG TAAAATGGGG ATTGTTTGAA ATGAAAGGAA GAGAAATGAA GTTATTACTG AAGTTCCTAA GCCCTGAAGT TTAA
|
Protein sequence | MATNANIDAL VLGSGPAALA IASALANERL SVHVLSPLDR RHTWPYTYGI WGEEVDDLGI GDLLKHRWTN TVSFFGSGSK EENSPKNKET RHNHDYGLFD KNKLQAYWLK QCDEALVEWH LGTATNLKVN QSISTVTTSE GEEVTARLII DATGYKPVFL KVPNNGEVAV QTCFGIVGKF TSPPIEKEQF VLMDYRNNHL TEAEKDEPPT FLYAMDMGNG TFFLEETSLG LAPPVSLDTL KTRLEKRLQH KGIQITEIEH EEHGLFLPMN IPIPYLDQPI LGFGGAAGMV HPASGYLVGT LLRRAPSVAK AIAKAMENEQ ESPAVIAQKG WEALWPKDLR RKQALYQFGL EKLMRFKESQ LRDFFTGFFS LSESQWYGFL TNSLTLGELI NAMWKMFTKA PLNVKWGLFE MKGREMKLLL KFLSPEV
|
| |