Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_15221 |
Symbol | |
ID | 4780699 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1237265 |
End bp | 1238509 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640084804 |
Product | putative lycopene beta cyclase |
Protein accession | YP_001015344 |
Protein GI | 124026228 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | [TIGR01790] lycopene cyclase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAATTGA ATAATGTCGC TGATGTATTA GTCATGGGAG CAGGACCTGC TGCTCTTTGC ATCGCTGCTG AGTTAGTTCA ACACGGACTT GATGTTCAGG CGATTGCTTC TAAGTCTCCT TTAGAACCTT GGCCAAACAC TTATGGGATT TGGGCATCTG AACTTGAGTC TTTAAATATG CAAGAACTAT TGAAATATAG ATGGGAAGAT ACTGTTAGCT TTTTTGGAGA TGGATTAGGT GGAAAAGGTA ATATTTGTAC AAATCATTAT CTTGATTACG GCCTCTTTAA TTCAATTAAT TTTCAGGAGG CTCTTCTTGA GAGATGTAAT GGACTTCCTT GGCAACTAGA AACTGTTGAT AATATTGATT TTAGAGAAAG AGAGACCGTT GTTATTTGTA CTTCCGGAAA AAAATATTTT GCTAGGCTCG TTATTGATGC AAGTGGTTAT AAAACCCCTT TTATCAGGAG GCCTAAGCAT GATCAAATCG CTAAGCAAGC GGCATATGGG GTGGTTGGGA AATTTAGTTC TGCTCCTGTA GAGAAAAATC GTTTTGTATT GATGGATTTT AGATCAGACC ATTTAAATGC CAACGAATTA GAGGAGCCAC CTTCTTTCCT TTATGCTATG GACCTTGGAG ATGGTAGTTA TTTTGTAGAA GAAACATCTT TGGCTTGTTC ACCTCCAATT TCATTTGAAT CATTAAAAGC AAGATTAAAT TTACGACTAT CTAATAAAGG TATTCAAATA GACGAAATTT TCCATGAAGA ACATTGTCTT TTTCCAATGA ACTTGCCATT GCCTTATAGA GATCAACCCC TTTTGGCCTT TGGAGGCTCG GCCAGTATGG TTCATCCTGC TTCGGGATAT CTTGTTGGAT CCCTTTTAAG GAGAGCACCT TCATTAGCAA GTGAAATAGC AAAAGTAATT AAAAAAGAAC CTCTTATGAC TACATCTCAG ATAGCCATAA GAGGATGGAA AACCCTATGG ACAAATGAAT TAGTTCAAAG ACATCGTCTT TATCAGTTTG GACTTCAAAG ACTAATGAGC TTTGACGAAA CTTTATTAAG ATCTTTTTTT GATACTTTTT TTAAATTACC TAAAAAAGAT TGGTTCGGAT ATTTAACTAA TACGCTTCCT TTGCCAAGAC TTTTTATTGT GATGCTCAAA CTATTTTACA TCGCCCCATC CAAGGTCAGG TTGGGAATGA CTGGTTTACT TATAAACAAG CGTGAAAAAA CTTAG
|
Protein sequence | MKLNNVADVL VMGAGPAALC IAAELVQHGL DVQAIASKSP LEPWPNTYGI WASELESLNM QELLKYRWED TVSFFGDGLG GKGNICTNHY LDYGLFNSIN FQEALLERCN GLPWQLETVD NIDFRERETV VICTSGKKYF ARLVIDASGY KTPFIRRPKH DQIAKQAAYG VVGKFSSAPV EKNRFVLMDF RSDHLNANEL EEPPSFLYAM DLGDGSYFVE ETSLACSPPI SFESLKARLN LRLSNKGIQI DEIFHEEHCL FPMNLPLPYR DQPLLAFGGS ASMVHPASGY LVGSLLRRAP SLASEIAKVI KKEPLMTTSQ IAIRGWKTLW TNELVQRHRL YQFGLQRLMS FDETLLRSFF DTFFKLPKKD WFGYLTNTLP LPRLFIVMLK LFYIAPSKVR LGMTGLLINK REKT
|
| |