Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_10011 |
Symbol | crtH |
ID | 4778823 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 915200 |
End bp | 916762 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640086509 |
Product | putative carotenoid isomerase |
Protein accession | YP_001017015 |
Protein GI | 124022708 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1233] Phytoene dehydrogenase and related proteins |
TIGRFAM ID | [TIGR02730] carotene isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.243247 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACCTA TCCGCATCGA AAAAGACGCA ACGCAGCCTT GGGATGCTGT GGTTATCGGT TCGGGCATCG GCGGCCTGGT CACCGCCAGT CAGCTAGCTG TCAAAGGTGC CAAGGTTCTT GTTCTGGAGA GCTACACAAT CCCAGGTGGT AGTAGCGGCT CATTCCAGCG AGAGGGCTAC ACCTTTGATG TAGGTGCATC AATGATCTTC GGGTTCGGAG AAAAGGGATA CACCAACCTG CTCACCCGAG CCTTGGCCGA TGTGGGTGAG CACTGTGACA CCCTGCCTGA CCCCGCCCAA CTCGCCTACC ACCTTCCTGG AGGATTAGAG CTCGCAGTTG ATCGGAAATA CGAGCAGTTC ATTGCTGATC TCACCGCACG CTTCCCCCAC GAAGCGAAGG GGATCCGTCA CTTTTACGAC ATCTGTTGGC AAGTGTTCAA CTGCCTGGAT GCAATGCCAC TGCTATCGAT CGAGGATCCC ACTTATCTGG CCAAAGTTTT TTTCAAGTCC CCCCTGGCCT GTCTCGGTCT TGCGCGCTGG TTGCCCGTCA ACGTGGGTGA TGTAGCCAGG CGTCACATCA AAGACCCAGA ACTGTTGCGG TTCATCGACA TTGAATGCTT CTGCTGGTCA GTGATGCCTG CCGACCTCAC ACCGATGATC AATGCTGGCA TGGTGTTTTC AGATCGCCAT GCCGGTGGCA TCAACTACCC CAAAGGAGGC GTTGGTGTGA TAGCCCAAAA GCTCGTCAAA GGCATGCAAC GTCATGGTGG TGAGATCCGC TACAAGGCAC GTGTCACACG AGTGGTGCTA AAAGATAACC GTGCTGTTGG TGTACAGCTA GCAAATGGAG AAATCATCCA CGCCCGCAGG GTGATATCCA ACGCCACTCG CTGGGACACA TTCAGCGGTG AGGGGTCCAA ACAGGCCCTG GTGGATGCAG AACACACACC CGCAGCGGAG CAGACCTGGC GACGCCGCTA TGTGCCCTCG CCATCCTTCC TCTCTCTTCA CCTTGGGGTC CGTAACGAGG CCATTCCAGC CAACAGCCAC TGCCACCATC TGCTCCTAGA AAGCTGGGAT GAAATGGAGA GTGAACAGGG TGTCGCATTC GTCTCGATGC CGACTCTCCT GGATCCATCC CTGTCGCCAG AGGGGCATCA CATCGTCCAC GCCTTCACAC CATCCTCAAT GCAGGCATGG CAAGACCTAA GCCCTGCGAC CTACAACAGC AAAAAGCAAG CTGATGCTGA TCGTCTAATC AGGAAACTCG AAAAAATCCT GCCTGGCTTA AGTCAAGCCA TCGTTCACCG AGAGGTGGGC ACTCCTCGTA GCCATCGACG TTTCCTAGGA CGATTTCAGG GTAGCTATGG GCCAATCCCC TCAAGTCGTT TACCAGGCCT TCTCACCATG CCCTTTAACC GCACTGGACT CAAAGGGCTT TATTGCGTCG GCGACTCTTG TTTCCCTGGC CAAGGCCTAA ATGCAGTTGC TTTTAGTGGC TTTGCCTGCT CCCATCTCAT CGGTGCCGAT TTAGGGATCA ACCCCTGGGC CCTACCTAAT TGA
|
Protein sequence | MEPIRIEKDA TQPWDAVVIG SGIGGLVTAS QLAVKGAKVL VLESYTIPGG SSGSFQREGY TFDVGASMIF GFGEKGYTNL LTRALADVGE HCDTLPDPAQ LAYHLPGGLE LAVDRKYEQF IADLTARFPH EAKGIRHFYD ICWQVFNCLD AMPLLSIEDP TYLAKVFFKS PLACLGLARW LPVNVGDVAR RHIKDPELLR FIDIECFCWS VMPADLTPMI NAGMVFSDRH AGGINYPKGG VGVIAQKLVK GMQRHGGEIR YKARVTRVVL KDNRAVGVQL ANGEIIHARR VISNATRWDT FSGEGSKQAL VDAEHTPAAE QTWRRRYVPS PSFLSLHLGV RNEAIPANSH CHHLLLESWD EMESEQGVAF VSMPTLLDPS LSPEGHHIVH AFTPSSMQAW QDLSPATYNS KKQADADRLI RKLEKILPGL SQAIVHREVG TPRSHRRFLG RFQGSYGPIP SSRLPGLLTM PFNRTGLKGL YCVGDSCFPG QGLNAVAFSG FACSHLIGAD LGINPWALPN
|
| |