Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_12221 |
Symbol | crtH |
ID | 4911343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 1037032 |
End bp | 1038576 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640160809 |
Product | putative carotenoid isomerase |
Protein accession | YP_001091446 |
Protein GI | 126696560 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1233] Phytoene dehydrogenase and related proteins |
TIGRFAM ID | [TIGR02730] carotene isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATTAA ATAAGGAAAA CTTCGATGCA ATTATTATTG GCTCAGGAAT AGGAGGGTTA GTAACTGCTT CACAATTGGC AGCGAAAGGA GCTCAAGTAT TAGTTCTTGA AAAATATATT ATTCCAGGAG GGAGTGGAGG CTCTTTTAAA AGAAAAGGCT ATACCTTTGA TGTAGGAGCT TCAATGATTT TTGGATTTGG AGAAAAAGGT TATACCAATT TATTAACTCG TGCATTAAAG GATGTAAACG AAAAATGCGA AACTATTCCG GATCCTGTTC AACTGGAATA TCACTTACCA CATAACTTTA ATATTTCAGT AGATAAAAAT TATGAGCAAT TTATAAGCAA ATTATCAGCT AGTTTCCCCA ATGAAAAAAA AGGTATCAAG AAATTCTATG ATACTTGTGC CAGTGTATTT AAATGTTTAG ATTCAATGCC TCTTTTATCA ATAGAGGATC CAAGTTATCT TTTTAAAGTC TTCTTTAAAT CCCCATTATC CTGTTTAGGG TTAGCTAGAT GGTTACCTGC AAATGCAGGA GATGTTGCGA GAAAGTTTAT AAAAGATCCT GAACTTTTAA AATTTATCGA TATCGAATGT TTTTGTTGGT CTGTAATGCC AGCTCTAAAA ACCCCTATGA TTAATGCAGG GATGGTATTT ACAGATAGGC ATGCTGGGGG TATAAATTAT CCAAAAGGGG GAGTTGGAAC GATAGCAGAG AAGTTTGTTT CTGGTATTGA AAAATTAGGA GGAAAAGTAA GATATAAAGC CAACGTGACT GAAATCCTCT TAAAGGATGA GAAGGCGGTA GGAGTTAAGC TCTCAAATGG AGAAGAGATA TATTCAAATA TTATTGTATC CAACTCTACT AGATGGGATA CATTTGGATT GAAAGATAAT ACTAAAGGAT TAATTTCAAG TGAAAACGTG CCAAAAAGTG AATATAAGTG GTCAGAAACT TATAAACCCT CACCTTCTTT TGTTTCGATT CACCTTGGAG TAGAAAAAAA TCTAATACCC GACAATTTTA ATTGTCATCA TATAATAGTT AAAAATTGGG ATGAATTAGA AAGCGAAAAG GGAGTTATTT TTGTTTCTAT ACCTACTTTG CTTGACTCGT CTTTGGCTCC TGAAGGTAAA CATATCGTGC ATGCATTTAC TCCTTCATCG ATTAGTGAAT GGGAAGGCCT AACAAGGAAA GAATATTTGC AAAAGAAAGA TAAATATTTT TCTTTCCTTG TTGAAAAAAT ATCAACTATT CTTCCTAATC TTGAACAAAA TATTGACCAC AAAGAAATTG GTACTCCCAA AACTCATAAA AAGTTTCTTG GAAGATTTGA AGGTAGTTAT GGGCCAATTC CCAGTAAAAA GTTGCTTGGA CTTTTGCCAA TGCCTTTCAA CACTACAAAA ATTCAAAACC TTTATTGTGT AGGGGATTCA TGTTTCCCTG GCCAAGGCCT AAATGCCGTT GCTTTTAGTG GATACGCATG CGCTCATAAA ATAGGTGCAA AGTTAAACTT AAACAGTTTT AAATTGCCAG ATTAA
|
Protein sequence | MELNKENFDA IIIGSGIGGL VTASQLAAKG AQVLVLEKYI IPGGSGGSFK RKGYTFDVGA SMIFGFGEKG YTNLLTRALK DVNEKCETIP DPVQLEYHLP HNFNISVDKN YEQFISKLSA SFPNEKKGIK KFYDTCASVF KCLDSMPLLS IEDPSYLFKV FFKSPLSCLG LARWLPANAG DVARKFIKDP ELLKFIDIEC FCWSVMPALK TPMINAGMVF TDRHAGGINY PKGGVGTIAE KFVSGIEKLG GKVRYKANVT EILLKDEKAV GVKLSNGEEI YSNIIVSNST RWDTFGLKDN TKGLISSENV PKSEYKWSET YKPSPSFVSI HLGVEKNLIP DNFNCHHIIV KNWDELESEK GVIFVSIPTL LDSSLAPEGK HIVHAFTPSS ISEWEGLTRK EYLQKKDKYF SFLVEKISTI LPNLEQNIDH KEIGTPKTHK KFLGRFEGSY GPIPSKKLLG LLPMPFNTTK IQNLYCVGDS CFPGQGLNAV AFSGYACAHK IGAKLNLNSF KLPD
|
| |