Gene P9211_05841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_05841 
SymbolcrtH 
ID5731087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp543977 
End bp545527 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content43% 
IMG OID641284944 
Productputative carotenoid isomerase 
Protein accessionYP_001550469 
Protein GI159903125 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02730] carotene isomerase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.515676 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCT ATAACCAAAA CAATCACTGG GATGTAATAG TAATTGGCTC TGGGATAGGA 
GGACTCGTTA CTGCAACCCA ACTAGCCGCA AAAGGTGTAA GAGTATTAGT TTTAGAGAGT
TACACAATTC CTGGCGGCAG TAGTGGCGCG TTCTCTAGAA ATGGCTACAC ATTTGATGTA
GGGGCTTCAA TGATTTTTGG TTTTGGGAAG GAGGGATATA CAAATCTTTT AACTCGCGCG
CTAGCAGATG TATCGGAGAC ATGCGATACG ATCCCTGATC CAGCTCAACT TGCTTACCAT
CTTCCTGAAG GGCTCAATGT AGTAGTTGAT AGAGATTATG AAAAATTTTT AACTGATTTA
ACTAGCCTTT TCCCTCATGA AGCCAAGGGG ATAAGAAGGT TTTACAACGT CTGCTGGCAA
GTATTCAATT GTCTTGATGC CATGCCACTA CTGTCTATAG AGGACCCTGC CTATTTAACA
AAGGTCTTTT TTAAGGCCCC TCTGGCATGC TTAGGCTTGG CACGATGGCT ACCATTTAAT
GTAGGTGATG TTGCCAAACG TTATATTAAA GATAAAAGGC TCCTTAATTT TATTGACATC
GAATGCTTCT GCTGGTCAGT TATGCCAGCC GAGCGTACAC CCATGATTAA CGCAGGAATG
GTGTTTTCTG ATCGACATGC TGGTGGAATT AACTATCCCA AAGGAGGCGT TGGAATTATT
GCACAAAAGC TGGTAAAAGG CCTGGAAAAA AATGGTGGCA AGATTCTCTA CAAATCTCGT
GTCACCAAGA TTCTGATAGA GAACAAAAAA GCTGTGGGAG TAGAAATCGC TTCAGGAGAG
AAACTCTTTG CCAAGACAAT TGTCTCGAAT GCAACTCGAT GGGATACCTT TGGAGGAGAA
GGCGTTAAAG AGCCTTTAAT AGACAAAGTT CATGAACCAA GTTCAGAGAA AAGATGGCGG
AGTCGTTATA AGCCTTCACC TTCTTTCCTC TCGCTTCACC TTGGGGTCAG CAAATATTCC
ATCCCTAATA ATTCCCATTG CCACCACTTG ATCTTAAATG AATGGAATAA GATGGAGTCC
GAACAAGGCG TTGTTTTTGT TTCAATCCCA ACATTATTAG ACCCCTCTCT TGCACCTGAT
GATCATCACA TCATTCACGC ATTCTCACCT TCTTCAATTG ACGAGTGGAA ACGACTAACT
CCCTCGAACT ATAGAAAAAA GAAAGAAGAA GATTCTAATC ATCTAATTTC AAAGCTCGAA
AACATCTTCC CTGAAATTTC CGGTAAAATT TCTCACAAGG AAGTTGGTAC GCCCAGAAGT
CATCGAAGGT TTCTAGGTAG ACATAATGGA AGCTATGGGC CAATACCCTC AATGCGACTA
CCTGGACTAT TGCCAATGCC TTTCAACACA ACTGGGATAA AAGGTCTCTA TTGCGTTGGC
GACTCATGCT TTCCAGGGCA AGGACTAAAT GCTGTTGCCT TCAGTGGGTT TGCTTGCGCT
CATAAGATAG GTGCAAGGCT AGGAATTAAC CCTTGGTCTC TTCCAGATTA A
 
Protein sequence
MTTYNQNNHW DVIVIGSGIG GLVTATQLAA KGVRVLVLES YTIPGGSSGA FSRNGYTFDV 
GASMIFGFGK EGYTNLLTRA LADVSETCDT IPDPAQLAYH LPEGLNVVVD RDYEKFLTDL
TSLFPHEAKG IRRFYNVCWQ VFNCLDAMPL LSIEDPAYLT KVFFKAPLAC LGLARWLPFN
VGDVAKRYIK DKRLLNFIDI ECFCWSVMPA ERTPMINAGM VFSDRHAGGI NYPKGGVGII
AQKLVKGLEK NGGKILYKSR VTKILIENKK AVGVEIASGE KLFAKTIVSN ATRWDTFGGE
GVKEPLIDKV HEPSSEKRWR SRYKPSPSFL SLHLGVSKYS IPNNSHCHHL ILNEWNKMES
EQGVVFVSIP TLLDPSLAPD DHHIIHAFSP SSIDEWKRLT PSNYRKKKEE DSNHLISKLE
NIFPEISGKI SHKEVGTPRS HRRFLGRHNG SYGPIPSMRL PGLLPMPFNT TGIKGLYCVG
DSCFPGQGLN AVAFSGFACA HKIGARLGIN PWSLPD