Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_11831 |
Symbol | |
ID | 4776408 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1038744 |
End bp | 1040375 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640086692 |
Product | zeta-carotene desaturase-like protein |
Protein accession | YP_001017197 |
Protein GI | 124022890 |
COG category | [S] Function unknown |
COG ID | [COG3349] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.253964 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAAG GTTTTCCGGG TAAAAAGCAA TCTCATGCGG TTGTCATCGG GGCCGGCTGG GCTGGCTGGG GTGCGGCCAA GGCACTCTGC GAAGCGGGTG TTCGCGTGAC CCTGATGGAT GGGATGGCGG ATCCAACCGG CAGCCAGCCC CTCACCACGC CAAGCGGCAA ACCGTTCGAG GCAGGCACCC GGGGGTTCTG GAAGGATTAT CCCAACATCA ATGCCCTGAC GGCAGAGCTC GGGCTCGGCT CGATCTTCAC GGAGTTCACC ACTAGTGCGT TCTGGTCACC CGAAGGCCTG GAGGCGACGG CTCCTGTCTT CGGTGACGCT CCGCTGTGGC CGAGTCCCCT GGGTCAGGTT GCTGCAACGA TCAATAACTT CAAGCGCCTG CCTGTTGCTG ATCGGCTCAG CATCGCCGGC CTGCTCTACG CCATGCTCGA TCTGAACCGC AGCGATGCGG TATACAGGAG CTACGACGCG ATCGATGCGC TGACGCTGTT TAGACAGCTC AGGATCAGTG ATCGCATGAT CGATGATTTC CTGCGGCCCA CGTTGCTGGT GGGGCTGTTT AAGCCGCCGG AGGAGCTGTC GGCCGCTGTC ACGATGGAGC TCCTCTACTA CTACGCCCTG GCGCATCAGG ATTCCTTCGA TGTGCGTTGG ATCAGGTCGA AAAGCATCGC CGAACAGCTG ATTGCGCCGC TCAGTGAGCG GCTGCAGGAG CAGCATCAGC TCGAGGTGCT GGGCGGCACC CTGGCCACCC GGCTGAACAT CTCGCCGGAG ACTCAGGCCA TCTGCTCGGT GGGAACCCGT TCCGTAACAA CTGGGAGCAC CGGTTTAATC GAGGATGTCG ATGCCGTGGT GCTTGCCGTG AGTGCCAAAG GAATGGGTGC CTTGATGGCG CAATCCCCGC AGTGCGGCGC GCTGGCGCCG GAGCTTGTGC GTGCCGCCAC GCTCGGATCG ATCGATGTGG TGTCGATCCG TCTGTGGCTG GATCGCACCG TGCCGGTCGC CGATCCCGCC AATGTGTTCT CACGTTTCAG CGCACTGAGA GGCGCCGGCG CCACCTTCTT CATGTTGGAT CAACTGCAGC GGGAGTCGGA GCAGGCTCTC TGGGGTGATC AGCCAGCGCA GGGTTCGGTG ATCGCCAGCG ATTTCTACAA CGCCTCGGCC ATCGTCGAGC TGAGCGATCA GGAGATCGTC GACTGCCTGA TGCAGGATCT GCTGCCCATA GCGCAGCCTG CTTTCAGGGG GGCCAGGGTC GTGGATCAGG AGGTGCGGCG TTATCCGGGT TCGGTGTCTC TCTTCTCGCC GGGAAGTTTC AGCAAGCGGC CACCAATGGA GACGTCACTG GCTTCGGTGG TCTGCGCCGG CGACTGGGTG CGGATGGGCG AGAAAGAACA TGGTGCTAAA GGCCTTTGTC AGGAACGCGC CTACGTGTGT GGTCTGGAAG CGGGCAACTC ACTGCTCAGG CGCGGGATCG TGAGGGGCGC CGACCTGCCC ACGGCCCTGC AGCACTCTGT GATCCCCATC CGCGCTGATG AACCGCAGGT GCTGCTCGGG CGTGCCCTTA ACAAGCTGGT GATGGATCCC CTCGAGGCCT TCGGGATCCA GTGGCCTTGG TTGGCTAGCT AG
|
Protein sequence | MAEGFPGKKQ SHAVVIGAGW AGWGAAKALC EAGVRVTLMD GMADPTGSQP LTTPSGKPFE AGTRGFWKDY PNINALTAEL GLGSIFTEFT TSAFWSPEGL EATAPVFGDA PLWPSPLGQV AATINNFKRL PVADRLSIAG LLYAMLDLNR SDAVYRSYDA IDALTLFRQL RISDRMIDDF LRPTLLVGLF KPPEELSAAV TMELLYYYAL AHQDSFDVRW IRSKSIAEQL IAPLSERLQE QHQLEVLGGT LATRLNISPE TQAICSVGTR SVTTGSTGLI EDVDAVVLAV SAKGMGALMA QSPQCGALAP ELVRAATLGS IDVVSIRLWL DRTVPVADPA NVFSRFSALR GAGATFFMLD QLQRESEQAL WGDQPAQGSV IASDFYNASA IVELSDQEIV DCLMQDLLPI AQPAFRGARV VDQEVRRYPG SVSLFSPGSF SKRPPMETSL ASVVCAGDWV RMGEKEHGAK GLCQERAYVC GLEAGNSLLR RGIVRGADLP TALQHSVIPI RADEPQVLLG RALNKLVMDP LEAFGIQWPW LAS
|
| |