Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_24321 |
Symbol | crtR |
ID | 4776744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2137288 |
End bp | 2138328 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640087951 |
Product | beta carotene hydroxylase |
Protein accession | YP_001018428 |
Protein GI | 124024121 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG3239] Fatty acid desaturase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGATGACTC AGGTACTCTC CAACCCGGAT CCTGTTGAAG CTGAAGCCAG CGCTAAGTCG GCCTCTCAAC AGCTAACGGT GCCGAAGGAG TACCTCGATC CGCCTTCCCC TTGGAATCCC ACGGTGGGAT TGTTCCTGGG CGGTTATGGC TTAGCGGCTC TAACGATTTG GCAGTGGTGT TTTGGTGATT GGCCTTTACA AGTTCTGGTG GCATTGGCGT TCTTCGCCTT GCATATGGAA GGCACTGTGA TTCATGACGC TTGTCATAAG GCGGCTCATC CGAATCCTTG GATTAATCAG GCCATGGGTC ATGGTTCAGC AATTTTGCTG GGCTTTAGTT TTCCTGTGTT CACAAGAGTG CACCTTCAGC ATCACTCACA TGTGAATGAC CCCAACAATG ATCCGGATCA CATTGTCAGC ACTTTTGGCC CGCTTTGGTT GATTGCGCCG CGGTTTTTCT ATCATGAATA TTTCTTTTTT CAACGACGTT TATGGCGTCG CTTTGAGCTG ATGCAGTGGG GACTTGAGCG CGGTTTTTTT GTTTCCATTG TGATCGCTGG TCTTCATTTT GACTTCATGA ATTTTGTTTA TAATTGTTGG TTTGGTCCTG CCTTAATGGT TGGGGTCACC CTGGGCCTTT TCTTTGACTA TTTGCCCCAT CGTCCCTTTA CTGCACGGGA CCGTTGGCAT AACGCCAGAG TGTATCCAAG CAGGATGATG AATTGGTTGA TTATGGGTCA GAATTATCAT CTTGTTCATC ATTTATGGCC TTCTATTCCC TGGTTTGAGT ATAAGCCGGC CTACGAAGCA ACAAAACCTC TATTGGATGC CAGGGGATCA CCTCAGCGAC TCGGATTTTT TGAAAGTCGA GCCGATGGGT TTAATTTTCT CTATGACATC GTTTTGGGCG TCCGTAGCCA CACAGCACGT CGTAGCAAGA TGCGTCCACT AGCCAAATTG ATACCCAGCC GCCGCTGGCG GCGACGTTGG ATTGGTTTGT TGCATCGCAC TGCTGTCTTG CCAGAACCCA AGAAACGTTG A
|
Protein sequence | MMTQVLSNPD PVEAEASAKS ASQQLTVPKE YLDPPSPWNP TVGLFLGGYG LAALTIWQWC FGDWPLQVLV ALAFFALHME GTVIHDACHK AAHPNPWINQ AMGHGSAILL GFSFPVFTRV HLQHHSHVND PNNDPDHIVS TFGPLWLIAP RFFYHEYFFF QRRLWRRFEL MQWGLERGFF VSIVIAGLHF DFMNFVYNCW FGPALMVGVT LGLFFDYLPH RPFTARDRWH NARVYPSRMM NWLIMGQNYH LVHHLWPSIP WFEYKPAYEA TKPLLDARGS PQRLGFFESR ADGFNFLYDI VLGVRSHTAR RSKMRPLAKL IPSRRWRRRW IGLLHRTAVL PEPKKR
|
| |