Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_05841 |
Symbol | crtH |
ID | 5731087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 543977 |
End bp | 545527 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 641284944 |
Product | putative carotenoid isomerase |
Protein accession | YP_001550469 |
Protein GI | 159903125 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1233] Phytoene dehydrogenase and related proteins |
TIGRFAM ID | [TIGR02730] carotene isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.515676 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACCT ATAACCAAAA CAATCACTGG GATGTAATAG TAATTGGCTC TGGGATAGGA GGACTCGTTA CTGCAACCCA ACTAGCCGCA AAAGGTGTAA GAGTATTAGT TTTAGAGAGT TACACAATTC CTGGCGGCAG TAGTGGCGCG TTCTCTAGAA ATGGCTACAC ATTTGATGTA GGGGCTTCAA TGATTTTTGG TTTTGGGAAG GAGGGATATA CAAATCTTTT AACTCGCGCG CTAGCAGATG TATCGGAGAC ATGCGATACG ATCCCTGATC CAGCTCAACT TGCTTACCAT CTTCCTGAAG GGCTCAATGT AGTAGTTGAT AGAGATTATG AAAAATTTTT AACTGATTTA ACTAGCCTTT TCCCTCATGA AGCCAAGGGG ATAAGAAGGT TTTACAACGT CTGCTGGCAA GTATTCAATT GTCTTGATGC CATGCCACTA CTGTCTATAG AGGACCCTGC CTATTTAACA AAGGTCTTTT TTAAGGCCCC TCTGGCATGC TTAGGCTTGG CACGATGGCT ACCATTTAAT GTAGGTGATG TTGCCAAACG TTATATTAAA GATAAAAGGC TCCTTAATTT TATTGACATC GAATGCTTCT GCTGGTCAGT TATGCCAGCC GAGCGTACAC CCATGATTAA CGCAGGAATG GTGTTTTCTG ATCGACATGC TGGTGGAATT AACTATCCCA AAGGAGGCGT TGGAATTATT GCACAAAAGC TGGTAAAAGG CCTGGAAAAA AATGGTGGCA AGATTCTCTA CAAATCTCGT GTCACCAAGA TTCTGATAGA GAACAAAAAA GCTGTGGGAG TAGAAATCGC TTCAGGAGAG AAACTCTTTG CCAAGACAAT TGTCTCGAAT GCAACTCGAT GGGATACCTT TGGAGGAGAA GGCGTTAAAG AGCCTTTAAT AGACAAAGTT CATGAACCAA GTTCAGAGAA AAGATGGCGG AGTCGTTATA AGCCTTCACC TTCTTTCCTC TCGCTTCACC TTGGGGTCAG CAAATATTCC ATCCCTAATA ATTCCCATTG CCACCACTTG ATCTTAAATG AATGGAATAA GATGGAGTCC GAACAAGGCG TTGTTTTTGT TTCAATCCCA ACATTATTAG ACCCCTCTCT TGCACCTGAT GATCATCACA TCATTCACGC ATTCTCACCT TCTTCAATTG ACGAGTGGAA ACGACTAACT CCCTCGAACT ATAGAAAAAA GAAAGAAGAA GATTCTAATC ATCTAATTTC AAAGCTCGAA AACATCTTCC CTGAAATTTC CGGTAAAATT TCTCACAAGG AAGTTGGTAC GCCCAGAAGT CATCGAAGGT TTCTAGGTAG ACATAATGGA AGCTATGGGC CAATACCCTC AATGCGACTA CCTGGACTAT TGCCAATGCC TTTCAACACA ACTGGGATAA AAGGTCTCTA TTGCGTTGGC GACTCATGCT TTCCAGGGCA AGGACTAAAT GCTGTTGCCT TCAGTGGGTT TGCTTGCGCT CATAAGATAG GTGCAAGGCT AGGAATTAAC CCTTGGTCTC TTCCAGATTA A
|
Protein sequence | MTTYNQNNHW DVIVIGSGIG GLVTATQLAA KGVRVLVLES YTIPGGSSGA FSRNGYTFDV GASMIFGFGK EGYTNLLTRA LADVSETCDT IPDPAQLAYH LPEGLNVVVD RDYEKFLTDL TSLFPHEAKG IRRFYNVCWQ VFNCLDAMPL LSIEDPAYLT KVFFKAPLAC LGLARWLPFN VGDVAKRYIK DKRLLNFIDI ECFCWSVMPA ERTPMINAGM VFSDRHAGGI NYPKGGVGII AQKLVKGLEK NGGKILYKSR VTKILIENKK AVGVEIASGE KLFAKTIVSN ATRWDTFGGE GVKEPLIDKV HEPSSEKRWR SRYKPSPSFL SLHLGVSKYS IPNNSHCHHL ILNEWNKMES EQGVVFVSIP TLLDPSLAPD DHHIIHAFSP SSIDEWKRLT PSNYRKKKEE DSNHLISKLE NIFPEISGKI SHKEVGTPRS HRRFLGRHNG SYGPIPSMRL PGLLPMPFNT TGIKGLYCVG DSCFPGQGLN AVAFSGFACA HKIGARLGIN PWSLPD
|
| |