Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_16851 |
Symbol | |
ID | 5730034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1510140 |
End bp | 1511651 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641286066 |
Product | phytoene dehydrogenase and related protein |
Protein accession | YP_001551570 |
Protein GI | 159904226 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1233] Phytoene dehydrogenase and related proteins |
TIGRFAM ID | [TIGR02733] C-3',4' desaturase CrtD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTAATG AGCAAGTAAT TGTTGTTGGA GGCGGAATCG CAGGGTTAAC TGCATCTGCT TTATTGGCTC ATGAAGGATT TAATGTAATT CTTTTAGAAT CCCATTATCA GACAGGAGGG TGTGCAGGGA CGTTTAAAAG AGGAAATTAC GTTTTTGATG TAGGTGCAAC TCAAATTGCA GGATTAGAGA AAGGAGGTAT TCATGAACGG CTATTTAGAT ATCTTGATTG GCCTCAACCT TTGGGGTCAA TTCTCGATCC AGGTTGTGAA GTAGACCTTA GAGACGGCTC TCGGCCAATA AAAATCTGGC ATGATCCTGT GCGATGGGAG ATAGAACGAA ATCAACAATT TCCAGGTAGT AAGCGTTTTT GGTCTCTTTG CAGTTTGATT CACCAAAGCA ATTGGTCTTT TGCAGAAAAA GACACAGTAT TACCAGTTTC AGACTTTTGG GATTTTAAGC AGTTCATCAG TTCAGTTGAG CCTTTAACCT TTTTGTTTGG CTTATTAAGT AGGTCATCAG TTTCTGATCT TCTTTGGCTC TGTTCTTGCC CTCAAGATCA ACGGCTCCAG AAATTTCTTG ATTTACAACT TCAACTTTAT TCTCAAGAAA AAGCAGATCA AACAGCTGCA CTTTATGGCG CCACTGTGCT TCAGATGGCA CAGGCACCAC GTGGCTTATG GCATTTGCAG GGTTCTATGA AGAGTTTGAG TGATGCATTG GAATCTTGCT TGCTTCGTGA TGGAGGTAAA TTACTTCTGA GGCATAAAGT TACTGCGATA GAGACAATTA GTACGACAGA TTCTTGGATT GTAGATGTTG TCGATGGTAA TGGTTCTACA TCCCAACTGG TTGCTTCTGA CTTAGTATTC ACTCTTCCTC CTCAATGTCT TTTGGGGTTA ACATCAGGCA ATTCATCTTT ACCTACTAAC TATCGGGACC GCATAGCTAA ATTACCTCAA CCTAATGGCG CGATAGTTTT TTATGGAGCG ATTTCGCGTC AGTACCTTCA AGAAATTCAT TCAAACCATT ATCAATTCTT TGTAGAAGAT CTTGGCTCAT TATTCCTTTC GATTAGTTGT GAAGGAGATG GTCGTGCACC TCTAGGTGAG GCAACTATCA TTGCAAGTGC TTTTACTTCT GTAGATTTCT GGGCATCTTT GGATGATCAG ATTTACCGAA AAGAAAAACA AGTGTTTTTG CAGAAAATTT TAAAAGCCAT CAAATCAGTT TTTAATATTC ATCCTGATGA TTGGTTACAT AAAGAATTGG CAACTCCAAG AAGTTTTGCA AAATGGACAG GCCGGCCACA AGGAATTGTA GGTGGATTGG GACAGAGGCC AACTACTTTT GGCCCGTTTG GATTATCTAG TAGATCTCCT GTGAAGGGTT TATGGCTCTG TGGAGATTCA ATTTATCCGG GGGAAGGCAC TGCTGGAGTT TCGCAATCTG CATTAATGGC CTGCCGACAG CTAATGGCTA CCAAGGGGCA CCCTTTCAGA TTACCCAAAT AG
|
Protein sequence | MTNEQVIVVG GGIAGLTASA LLAHEGFNVI LLESHYQTGG CAGTFKRGNY VFDVGATQIA GLEKGGIHER LFRYLDWPQP LGSILDPGCE VDLRDGSRPI KIWHDPVRWE IERNQQFPGS KRFWSLCSLI HQSNWSFAEK DTVLPVSDFW DFKQFISSVE PLTFLFGLLS RSSVSDLLWL CSCPQDQRLQ KFLDLQLQLY SQEKADQTAA LYGATVLQMA QAPRGLWHLQ GSMKSLSDAL ESCLLRDGGK LLLRHKVTAI ETISTTDSWI VDVVDGNGST SQLVASDLVF TLPPQCLLGL TSGNSSLPTN YRDRIAKLPQ PNGAIVFYGA ISRQYLQEIH SNHYQFFVED LGSLFLSISC EGDGRAPLGE ATIIASAFTS VDFWASLDDQ IYRKEKQVFL QKILKAIKSV FNIHPDDWLH KELATPRSFA KWTGRPQGIV GGLGQRPTTF GPFGLSSRSP VKGLWLCGDS IYPGEGTAGV SQSALMACRQ LMATKGHPFR LPK
|
| |