Gene P9301_12221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_12221 
SymbolcrtH 
ID4911343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1037032 
End bp1038576 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content34% 
IMG OID640160809 
Productputative carotenoid isomerase 
Protein accessionYP_001091446 
Protein GI126696560 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02730] carotene isomerase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATTAA ATAAGGAAAA CTTCGATGCA ATTATTATTG GCTCAGGAAT AGGAGGGTTA 
GTAACTGCTT CACAATTGGC AGCGAAAGGA GCTCAAGTAT TAGTTCTTGA AAAATATATT
ATTCCAGGAG GGAGTGGAGG CTCTTTTAAA AGAAAAGGCT ATACCTTTGA TGTAGGAGCT
TCAATGATTT TTGGATTTGG AGAAAAAGGT TATACCAATT TATTAACTCG TGCATTAAAG
GATGTAAACG AAAAATGCGA AACTATTCCG GATCCTGTTC AACTGGAATA TCACTTACCA
CATAACTTTA ATATTTCAGT AGATAAAAAT TATGAGCAAT TTATAAGCAA ATTATCAGCT
AGTTTCCCCA ATGAAAAAAA AGGTATCAAG AAATTCTATG ATACTTGTGC CAGTGTATTT
AAATGTTTAG ATTCAATGCC TCTTTTATCA ATAGAGGATC CAAGTTATCT TTTTAAAGTC
TTCTTTAAAT CCCCATTATC CTGTTTAGGG TTAGCTAGAT GGTTACCTGC AAATGCAGGA
GATGTTGCGA GAAAGTTTAT AAAAGATCCT GAACTTTTAA AATTTATCGA TATCGAATGT
TTTTGTTGGT CTGTAATGCC AGCTCTAAAA ACCCCTATGA TTAATGCAGG GATGGTATTT
ACAGATAGGC ATGCTGGGGG TATAAATTAT CCAAAAGGGG GAGTTGGAAC GATAGCAGAG
AAGTTTGTTT CTGGTATTGA AAAATTAGGA GGAAAAGTAA GATATAAAGC CAACGTGACT
GAAATCCTCT TAAAGGATGA GAAGGCGGTA GGAGTTAAGC TCTCAAATGG AGAAGAGATA
TATTCAAATA TTATTGTATC CAACTCTACT AGATGGGATA CATTTGGATT GAAAGATAAT
ACTAAAGGAT TAATTTCAAG TGAAAACGTG CCAAAAAGTG AATATAAGTG GTCAGAAACT
TATAAACCCT CACCTTCTTT TGTTTCGATT CACCTTGGAG TAGAAAAAAA TCTAATACCC
GACAATTTTA ATTGTCATCA TATAATAGTT AAAAATTGGG ATGAATTAGA AAGCGAAAAG
GGAGTTATTT TTGTTTCTAT ACCTACTTTG CTTGACTCGT CTTTGGCTCC TGAAGGTAAA
CATATCGTGC ATGCATTTAC TCCTTCATCG ATTAGTGAAT GGGAAGGCCT AACAAGGAAA
GAATATTTGC AAAAGAAAGA TAAATATTTT TCTTTCCTTG TTGAAAAAAT ATCAACTATT
CTTCCTAATC TTGAACAAAA TATTGACCAC AAAGAAATTG GTACTCCCAA AACTCATAAA
AAGTTTCTTG GAAGATTTGA AGGTAGTTAT GGGCCAATTC CCAGTAAAAA GTTGCTTGGA
CTTTTGCCAA TGCCTTTCAA CACTACAAAA ATTCAAAACC TTTATTGTGT AGGGGATTCA
TGTTTCCCTG GCCAAGGCCT AAATGCCGTT GCTTTTAGTG GATACGCATG CGCTCATAAA
ATAGGTGCAA AGTTAAACTT AAACAGTTTT AAATTGCCAG ATTAA
 
Protein sequence
MELNKENFDA IIIGSGIGGL VTASQLAAKG AQVLVLEKYI IPGGSGGSFK RKGYTFDVGA 
SMIFGFGEKG YTNLLTRALK DVNEKCETIP DPVQLEYHLP HNFNISVDKN YEQFISKLSA
SFPNEKKGIK KFYDTCASVF KCLDSMPLLS IEDPSYLFKV FFKSPLSCLG LARWLPANAG
DVARKFIKDP ELLKFIDIEC FCWSVMPALK TPMINAGMVF TDRHAGGINY PKGGVGTIAE
KFVSGIEKLG GKVRYKANVT EILLKDEKAV GVKLSNGEEI YSNIIVSNST RWDTFGLKDN
TKGLISSENV PKSEYKWSET YKPSPSFVSI HLGVEKNLIP DNFNCHHIIV KNWDELESEK
GVIFVSIPTL LDSSLAPEGK HIVHAFTPSS ISEWEGLTRK EYLQKKDKYF SFLVEKISTI
LPNLEQNIDH KEIGTPKTHK KFLGRFEGSY GPIPSKKLLG LLPMPFNTTK IQNLYCVGDS
CFPGQGLNAV AFSGYACAHK IGAKLNLNSF KLPD