Gene P9303_10011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_10011 
SymbolcrtH 
ID4778823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp915200 
End bp916762 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content55% 
IMG OID640086509 
Productputative carotenoid isomerase 
Protein accessionYP_001017015 
Protein GI124022708 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02730] carotene isomerase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.243247 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACCTA TCCGCATCGA AAAAGACGCA ACGCAGCCTT GGGATGCTGT GGTTATCGGT 
TCGGGCATCG GCGGCCTGGT CACCGCCAGT CAGCTAGCTG TCAAAGGTGC CAAGGTTCTT
GTTCTGGAGA GCTACACAAT CCCAGGTGGT AGTAGCGGCT CATTCCAGCG AGAGGGCTAC
ACCTTTGATG TAGGTGCATC AATGATCTTC GGGTTCGGAG AAAAGGGATA CACCAACCTG
CTCACCCGAG CCTTGGCCGA TGTGGGTGAG CACTGTGACA CCCTGCCTGA CCCCGCCCAA
CTCGCCTACC ACCTTCCTGG AGGATTAGAG CTCGCAGTTG ATCGGAAATA CGAGCAGTTC
ATTGCTGATC TCACCGCACG CTTCCCCCAC GAAGCGAAGG GGATCCGTCA CTTTTACGAC
ATCTGTTGGC AAGTGTTCAA CTGCCTGGAT GCAATGCCAC TGCTATCGAT CGAGGATCCC
ACTTATCTGG CCAAAGTTTT TTTCAAGTCC CCCCTGGCCT GTCTCGGTCT TGCGCGCTGG
TTGCCCGTCA ACGTGGGTGA TGTAGCCAGG CGTCACATCA AAGACCCAGA ACTGTTGCGG
TTCATCGACA TTGAATGCTT CTGCTGGTCA GTGATGCCTG CCGACCTCAC ACCGATGATC
AATGCTGGCA TGGTGTTTTC AGATCGCCAT GCCGGTGGCA TCAACTACCC CAAAGGAGGC
GTTGGTGTGA TAGCCCAAAA GCTCGTCAAA GGCATGCAAC GTCATGGTGG TGAGATCCGC
TACAAGGCAC GTGTCACACG AGTGGTGCTA AAAGATAACC GTGCTGTTGG TGTACAGCTA
GCAAATGGAG AAATCATCCA CGCCCGCAGG GTGATATCCA ACGCCACTCG CTGGGACACA
TTCAGCGGTG AGGGGTCCAA ACAGGCCCTG GTGGATGCAG AACACACACC CGCAGCGGAG
CAGACCTGGC GACGCCGCTA TGTGCCCTCG CCATCCTTCC TCTCTCTTCA CCTTGGGGTC
CGTAACGAGG CCATTCCAGC CAACAGCCAC TGCCACCATC TGCTCCTAGA AAGCTGGGAT
GAAATGGAGA GTGAACAGGG TGTCGCATTC GTCTCGATGC CGACTCTCCT GGATCCATCC
CTGTCGCCAG AGGGGCATCA CATCGTCCAC GCCTTCACAC CATCCTCAAT GCAGGCATGG
CAAGACCTAA GCCCTGCGAC CTACAACAGC AAAAAGCAAG CTGATGCTGA TCGTCTAATC
AGGAAACTCG AAAAAATCCT GCCTGGCTTA AGTCAAGCCA TCGTTCACCG AGAGGTGGGC
ACTCCTCGTA GCCATCGACG TTTCCTAGGA CGATTTCAGG GTAGCTATGG GCCAATCCCC
TCAAGTCGTT TACCAGGCCT TCTCACCATG CCCTTTAACC GCACTGGACT CAAAGGGCTT
TATTGCGTCG GCGACTCTTG TTTCCCTGGC CAAGGCCTAA ATGCAGTTGC TTTTAGTGGC
TTTGCCTGCT CCCATCTCAT CGGTGCCGAT TTAGGGATCA ACCCCTGGGC CCTACCTAAT
TGA
 
Protein sequence
MEPIRIEKDA TQPWDAVVIG SGIGGLVTAS QLAVKGAKVL VLESYTIPGG SSGSFQREGY 
TFDVGASMIF GFGEKGYTNL LTRALADVGE HCDTLPDPAQ LAYHLPGGLE LAVDRKYEQF
IADLTARFPH EAKGIRHFYD ICWQVFNCLD AMPLLSIEDP TYLAKVFFKS PLACLGLARW
LPVNVGDVAR RHIKDPELLR FIDIECFCWS VMPADLTPMI NAGMVFSDRH AGGINYPKGG
VGVIAQKLVK GMQRHGGEIR YKARVTRVVL KDNRAVGVQL ANGEIIHARR VISNATRWDT
FSGEGSKQAL VDAEHTPAAE QTWRRRYVPS PSFLSLHLGV RNEAIPANSH CHHLLLESWD
EMESEQGVAF VSMPTLLDPS LSPEGHHIVH AFTPSSMQAW QDLSPATYNS KKQADADRLI
RKLEKILPGL SQAIVHREVG TPRSHRRFLG RFQGSYGPIP SSRLPGLLTM PFNRTGLKGL
YCVGDSCFPG QGLNAVAFSG FACSHLIGAD LGINPWALPN