Gene A9601_12211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_12211 
SymbolcrtH 
ID4717935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1038185 
End bp1039729 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content34% 
IMG OID640078937 
Productputative carotenoid isomerase 
Protein accessionYP_001009612 
Protein GI123968754 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02730] carotene isomerase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATTAA ATAAGGAAAA CTTCGATGCA ATTATTATTG GCTCAGGAAT TGGAGGATTA 
GTAACTGCTT CACAATTGGC GGCTAAGGGA GCTCAAGTAT TAGTTCTTGA AAAATATATT
ATTCCAGGCG GGAGTGGAGG CTCATTTAAG AGAAAAGGCT ATACCTTTGA CGTCGGGGCT
TCAATGATTT TTGGATTTGG AGAGAAAGGT TATACCAATT TATTAACTCG TGCTTTGAAA
GATGTGAATG AAAAATGCGA AACTATTCCC GATCCTGTTC AACTGGAATA TCACTTGCCA
AATAACTTAA ATATTTCTGT AGATAAAAAT TATGAGCAAT TTATAAGCAA ATTATCAGCT
ATTTTTCCCA AGGAAAAAAA AGGTATCAAG AAATTCTATG ATACTTGTGC AAGTGTATTT
AAATGTTTAG ATTCAATGCC TCTTTTATCA ATAGAGGATC CAAATTATCT TTTTAAAGTT
TTCTTTAAAT CTCCATTATC CTGTTTAGGG TTAGCTAGAT GGCTACCTGC AAATGCAGGA
GATGTTGCGA GAAAGTTTAT AAAAGATCCA GAACTTTTAA AATTTATTGA TATCGAATGT
TTTTGTTGGT CTGTAATGCC AGCTCTAAAA ACCCCTATGA TTAATGCGGG AATGGTATTT
ACAGATAGGC ATGCGGGGGG GATAAATTAT CCAAAAGGGG GAGTTGGAAC GATAGCAGAG
AAGTTTGTTT CTGGGATTGA AAAATTAGGA GGAAAAGTTA GATATAAAGC CAATGTGACT
GAAATCCTCT TAAAGGATGA TAAAGCGGTA GGAGTTAAGC TCTCAAATGG GGAAGAAATA
TATTCAGATA TTATTGTATC CAACTCCACT AGATGGGATA CATTTGGATT AAAAGATAAT
ACTAAAGGAT TAATTTCTAG TAAAAACGTG CCAAAAAGTG AATATAAGTG GTCAGAAACT
TATAAACCCT CACCTTCTTT TGTTTCGATT CACCTTGGAG TAGAAAAAAA TCTAATACCA
GATAATTTTA ATTGTCATCA TATAATCGTT GAAAATTGGG ATGAATTAGA AAGCGAAAAG
GGAGTTATTT TTGTTTCTAT ACCTACTTTG CTTGACTCGT CTTTGGCTCC AGAAGGTAAA
CATATTTTAC ATGCATTTAC TCCTTCATTG ATGAGTGAAT GGGAAGGCCT ATCAAGGAAA
GAATATATGC AAAAGAAAGA AAAATATTTT TCTTTTCTTG TTGAAAAAAT ATCAACTATT
CTTCCTAATC TTGAACAAAA TATTGATCAC AAAGAAATTG GTACTCCCAA AACTCATAAA
AAGTTTCTTG GAAGATATGA AGGTAGTTAT GGGCCAATTC CCAGTAAAAA GTTGCTTGGA
CTTTTGCCAA TGCCTTTCAA CACTACAAAA ATTCAAAACC TATATTGTGT AGGGGATTCT
TGCTTCCCTG GCCAAGGCCT AAATGCAGTT GCTTTTAGTG GATACGCATG CGCTCACAAA
ATAGGTGCAA AGTTAAACAT AAACAGTTTT AAATTGCCCG ATTAA
 
Protein sequence
MELNKENFDA IIIGSGIGGL VTASQLAAKG AQVLVLEKYI IPGGSGGSFK RKGYTFDVGA 
SMIFGFGEKG YTNLLTRALK DVNEKCETIP DPVQLEYHLP NNLNISVDKN YEQFISKLSA
IFPKEKKGIK KFYDTCASVF KCLDSMPLLS IEDPNYLFKV FFKSPLSCLG LARWLPANAG
DVARKFIKDP ELLKFIDIEC FCWSVMPALK TPMINAGMVF TDRHAGGINY PKGGVGTIAE
KFVSGIEKLG GKVRYKANVT EILLKDDKAV GVKLSNGEEI YSDIIVSNST RWDTFGLKDN
TKGLISSKNV PKSEYKWSET YKPSPSFVSI HLGVEKNLIP DNFNCHHIIV ENWDELESEK
GVIFVSIPTL LDSSLAPEGK HILHAFTPSL MSEWEGLSRK EYMQKKEKYF SFLVEKISTI
LPNLEQNIDH KEIGTPKTHK KFLGRYEGSY GPIPSKKLLG LLPMPFNTTK IQNLYCVGDS
CFPGQGLNAV AFSGYACAHK IGAKLNINSF KLPD