Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_12211 |
Symbol | crtH |
ID | 4717935 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 1038185 |
End bp | 1039729 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640078937 |
Product | putative carotenoid isomerase |
Protein accession | YP_001009612 |
Protein GI | 123968754 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1233] Phytoene dehydrogenase and related proteins |
TIGRFAM ID | [TIGR02730] carotene isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATTAA ATAAGGAAAA CTTCGATGCA ATTATTATTG GCTCAGGAAT TGGAGGATTA GTAACTGCTT CACAATTGGC GGCTAAGGGA GCTCAAGTAT TAGTTCTTGA AAAATATATT ATTCCAGGCG GGAGTGGAGG CTCATTTAAG AGAAAAGGCT ATACCTTTGA CGTCGGGGCT TCAATGATTT TTGGATTTGG AGAGAAAGGT TATACCAATT TATTAACTCG TGCTTTGAAA GATGTGAATG AAAAATGCGA AACTATTCCC GATCCTGTTC AACTGGAATA TCACTTGCCA AATAACTTAA ATATTTCTGT AGATAAAAAT TATGAGCAAT TTATAAGCAA ATTATCAGCT ATTTTTCCCA AGGAAAAAAA AGGTATCAAG AAATTCTATG ATACTTGTGC AAGTGTATTT AAATGTTTAG ATTCAATGCC TCTTTTATCA ATAGAGGATC CAAATTATCT TTTTAAAGTT TTCTTTAAAT CTCCATTATC CTGTTTAGGG TTAGCTAGAT GGCTACCTGC AAATGCAGGA GATGTTGCGA GAAAGTTTAT AAAAGATCCA GAACTTTTAA AATTTATTGA TATCGAATGT TTTTGTTGGT CTGTAATGCC AGCTCTAAAA ACCCCTATGA TTAATGCGGG AATGGTATTT ACAGATAGGC ATGCGGGGGG GATAAATTAT CCAAAAGGGG GAGTTGGAAC GATAGCAGAG AAGTTTGTTT CTGGGATTGA AAAATTAGGA GGAAAAGTTA GATATAAAGC CAATGTGACT GAAATCCTCT TAAAGGATGA TAAAGCGGTA GGAGTTAAGC TCTCAAATGG GGAAGAAATA TATTCAGATA TTATTGTATC CAACTCCACT AGATGGGATA CATTTGGATT AAAAGATAAT ACTAAAGGAT TAATTTCTAG TAAAAACGTG CCAAAAAGTG AATATAAGTG GTCAGAAACT TATAAACCCT CACCTTCTTT TGTTTCGATT CACCTTGGAG TAGAAAAAAA TCTAATACCA GATAATTTTA ATTGTCATCA TATAATCGTT GAAAATTGGG ATGAATTAGA AAGCGAAAAG GGAGTTATTT TTGTTTCTAT ACCTACTTTG CTTGACTCGT CTTTGGCTCC AGAAGGTAAA CATATTTTAC ATGCATTTAC TCCTTCATTG ATGAGTGAAT GGGAAGGCCT ATCAAGGAAA GAATATATGC AAAAGAAAGA AAAATATTTT TCTTTTCTTG TTGAAAAAAT ATCAACTATT CTTCCTAATC TTGAACAAAA TATTGATCAC AAAGAAATTG GTACTCCCAA AACTCATAAA AAGTTTCTTG GAAGATATGA AGGTAGTTAT GGGCCAATTC CCAGTAAAAA GTTGCTTGGA CTTTTGCCAA TGCCTTTCAA CACTACAAAA ATTCAAAACC TATATTGTGT AGGGGATTCT TGCTTCCCTG GCCAAGGCCT AAATGCAGTT GCTTTTAGTG GATACGCATG CGCTCACAAA ATAGGTGCAA AGTTAAACAT AAACAGTTTT AAATTGCCCG ATTAA
|
Protein sequence | MELNKENFDA IIIGSGIGGL VTASQLAAKG AQVLVLEKYI IPGGSGGSFK RKGYTFDVGA SMIFGFGEKG YTNLLTRALK DVNEKCETIP DPVQLEYHLP NNLNISVDKN YEQFISKLSA IFPKEKKGIK KFYDTCASVF KCLDSMPLLS IEDPNYLFKV FFKSPLSCLG LARWLPANAG DVARKFIKDP ELLKFIDIEC FCWSVMPALK TPMINAGMVF TDRHAGGINY PKGGVGTIAE KFVSGIEKLG GKVRYKANVT EILLKDDKAV GVKLSNGEEI YSDIIVSNST RWDTFGLKDN TKGLISSKNV PKSEYKWSET YKPSPSFVSI HLGVEKNLIP DNFNCHHIIV ENWDELESEK GVIFVSIPTL LDSSLAPEGK HILHAFTPSL MSEWEGLSRK EYMQKKEKYF SFLVEKISTI LPNLEQNIDH KEIGTPKTHK KFLGRYEGSY GPIPSKKLLG LLPMPFNTTK IQNLYCVGDS CFPGQGLNAV AFSGYACAHK IGAKLNINSF KLPD
|
| |