Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_14681 |
Symbol | crtH |
ID | 4780674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1175222 |
End bp | 1176784 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 640084749 |
Product | putative carotenoid isomerase |
Protein accession | YP_001015290 |
Protein GI | 124026174 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1233] Phytoene dehydrogenase and related proteins |
TIGRFAM ID | [TIGR02730] carotene isomerase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.314761 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.880307 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCAAT CATCAAATAA AGCCAACGAT TCTTGCAAGT GGGATTCAAT AGTTATTGGT TCCGGTCTGG GTGGATTAGT AACTGCAAGT CAACTAGCAA GCAAAGGTGC AAAGGTGCTT GTTCTAGAAC AGTATAAGAT TCCGGGTGGA AGTGGTGGAT CATTTAAAAG AAAAGGATTT ACTTTTGATG TTGGAGCATC AATGATCTTT GGGTTTGGTG ATAAGGGATA CACAAATTTA CTAACGAGAG CTCTTAAAGA TGTTGGACAA AAATGTGAAA CGATTCCTGA CCCAACACAA TTGGCATATC ATCTACCCAA TCAATTAGAA ATTTCAGTTG ATAGAGATTA TCAAAAATTT ATTACCAAGC TAATAAATTT ATTTCCGCAT GAAGAGAAAG GAATAAAACA TTTTTACGAA ACATGTTGGG ATGTATTTAA TTGTCTAGAT TCAATGCCTT TACTTTCAAT AGAGGATCCT GCGTATCTAA TGAAAGTTTT TTTTAAATCA CCTTTATCAT GTCTTGGACT GGCTAGGTGG TTGCCAGTTA ATGCTGGTGA TGTTGCTAGG CGATATATTA AAGATCCTGA TCTTTTAAGG TTTATAGATA TTGAATGTTT CTGCTGGTCA GTTATGCCTG CCAACTTGAC TCCTATGATA AATGCAGGAA TGGTCTTTTC TGATAGACAT TATGGAGGTA TTAACTACCC CAAAGGAGGT GTAGGAATTA TTGCAGAAAA GCTAGTGAAG GGAATTAAAA ATCTGGGGGG GGAAGTTAGA TATAAATCCA GAGTAAAAAA AATAATTTTG GAAAACGGAA ATGCCATAGG TGTTTCACTT GAAAATGATG AAGAAATTTT CAGTAAAACA GTGGTCTCTA ATGCTACAAG ATGGGATACT TTTGGTGGTC AAGGGATTAA GAGTCCACTT ATACCAACAG AGAACAAACC AACGTCTGAG AAAAAATGGG AAAACAGATA TATACCTTCC CCTTCATTCT TATCTCTACA TTTGGGAGTG AATGAAGACT CTATACATGC TAACACTCAT TGCCATCATC TACTTCTCGA CAAATGGGAA GAAATGGAGA AAGAACAGGG AGTTACTTTC GTATCAATAC CAACTCTCTT AGATCCCACA TTGGCACCTT CAGGTAGCCA TATTGTTCAT GCTTTTACTC CTTCTTCAAT AGATAACTGG GATGGATTAA GCAATAAAGA ATATCTTACA AAGAAAAAAG AAGATGGTGA TAAACTTATT TCAAAGTTAG AGAGATTATT TCCTAACCTA AGTCAAAATA TTCTTCACAA AGAAATAGGA AGTCCAAGAA CTCATAAAAG ATTCCTTGCA AGAAACAAAG GAAGTTATGG ACCAATTCCA TCAATGAGAT TGCCGGGCCT TCTCCCCATG CCATTCAACA CTACTAAAAT TAATGGACTT TATTGTGTTG GGGACTCGTG CTTCCCCGGT CAAGGTTTGA ATGCCGTGGC TTTCAGTGGT TATGCCTGTG CTCACAAGAT AGGCACAAAG CTTGGTATGA ACAAATGGGA GCTACCAGAA TAA
|
Protein sequence | MTQSSNKAND SCKWDSIVIG SGLGGLVTAS QLASKGAKVL VLEQYKIPGG SGGSFKRKGF TFDVGASMIF GFGDKGYTNL LTRALKDVGQ KCETIPDPTQ LAYHLPNQLE ISVDRDYQKF ITKLINLFPH EEKGIKHFYE TCWDVFNCLD SMPLLSIEDP AYLMKVFFKS PLSCLGLARW LPVNAGDVAR RYIKDPDLLR FIDIECFCWS VMPANLTPMI NAGMVFSDRH YGGINYPKGG VGIIAEKLVK GIKNLGGEVR YKSRVKKIIL ENGNAIGVSL ENDEEIFSKT VVSNATRWDT FGGQGIKSPL IPTENKPTSE KKWENRYIPS PSFLSLHLGV NEDSIHANTH CHHLLLDKWE EMEKEQGVTF VSIPTLLDPT LAPSGSHIVH AFTPSSIDNW DGLSNKEYLT KKKEDGDKLI SKLERLFPNL SQNILHKEIG SPRTHKRFLA RNKGSYGPIP SMRLPGLLPM PFNTTKINGL YCVGDSCFPG QGLNAVAFSG YACAHKIGTK LGMNKWELPE
|
| |