Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_06891 |
Symbol | |
ID | 4717392 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 611395 |
End bp | 612663 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640078402 |
Product | putative lycopene epsilon cyclase |
Protein accession | YP_001009082 |
Protein GI | 123968224 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | [TIGR01790] lycopene cyclase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.895812 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGATG TTCTTGTTTT GGGTGCAGGG CCTGCCGGTA TGGCTATTGC CTCAGCTTTA GGGAAGGAAA AATTAGATGT TGAAGTGCTT TCTCCAAATG GACCAGATGA ACCTTGGCCA AATACATATG GTATTTGGGG GAAAGAAGTT GATCAACTCG GGCTTCAGGA TTTACTTGAA TATAGATGGA AGAATACTGT AAGTTTTTTT GGGCATGGCG CTTTAGAAGA ACAGGACGAC GAGAATAAAG CCACGGAACA TTCACTAGAT TATGGACTAT TTGATAAGAA GAAACTCCAT AATTATTGGT TTAACGAATG CAACAAGTCT TTTATTAAAT GGCATCAAGG TTTTGCAAAC AAAATACATT TTGAAAAATA CAAAAGTACA GTAACTACAA AAGATGGAAA AACTTACTCT GCAAGATTAG TAGTAGATGC AACAGGCTAT GATCCTGTTT TTCTTAAATT AAAATCCTGT GGTCCCTTAG CAGTCCAAAC TTGTTATGGG ATAGTAGGAA ATTTTAGTAA ACCTCCACTT AAGAAAGGGC AGTTTGTATT AATGGACTAT AGAAATGATC ATCTTAACGA TGAGCAAAAA AAAGAACCGC CAACTTTTCT TTATGCAATG GATATGGGGG ATGGGAAATA TTTTCTTGAA GAGACATCTC TTGGTTTGGT AAATCCTCTA ACAATGGAAA ATTTAAAAGA GAGACTAGAG AAGAGGCTTT CTTATCGAAA TATATCAATC ACAAGCATGC AGCACGAAGA GCTTGGCTTA TTTCTTCCTA TGAATATGCC AATCCCAGAT TTCAAACAAC AAATACTTGG ATATGGTGGT GCTGCTTCAA TGGTACATCC TGCATCTGGA TATTTAATTG GTAATGTTTT AAGAAGAGCT CCACTTGTCG CAAAGGCAGT CTCAGAAGCA ATTAAAAACA AAAATCTAAG TACCTATCAT ATTGCTAGAA AAGGTTGGGA AACTTTATGG TCAAAAGAAT TAATTAGGAA GAAATCACTT TACCAATTTG GATTAGAAAA ACTCATGAGG TTTGATGAGA AACTATTGAG AGAATTTTTT GGCAGTTTTT TCCAACTACC TAAAAATCAA TGGTATGGTT TTCTAACTGA TACTCTTTCT TTAAAAGAGA TTGTATATGC TATGTGCGTA ATGTTTATAA AGGCTCCATG GAGTGTAAAG AAAGGTCTTA TGATTATGCA TGGAAGAGAA TTTAAAATGT TACTTAGGAT AATATTTCCA AACATATAG
|
Protein sequence | MPDVLVLGAG PAGMAIASAL GKEKLDVEVL SPNGPDEPWP NTYGIWGKEV DQLGLQDLLE YRWKNTVSFF GHGALEEQDD ENKATEHSLD YGLFDKKKLH NYWFNECNKS FIKWHQGFAN KIHFEKYKST VTTKDGKTYS ARLVVDATGY DPVFLKLKSC GPLAVQTCYG IVGNFSKPPL KKGQFVLMDY RNDHLNDEQK KEPPTFLYAM DMGDGKYFLE ETSLGLVNPL TMENLKERLE KRLSYRNISI TSMQHEELGL FLPMNMPIPD FKQQILGYGG AASMVHPASG YLIGNVLRRA PLVAKAVSEA IKNKNLSTYH IARKGWETLW SKELIRKKSL YQFGLEKLMR FDEKLLREFF GSFFQLPKNQ WYGFLTDTLS LKEIVYAMCV MFIKAPWSVK KGLMIMHGRE FKMLLRIIFP NI
|
| |