Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_43904 |
Symbol | |
ID | 7204310 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 387855 |
End bp | 390089 |
Gene Length | 2235 bp |
Protein Length | 696 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | phytoene desaturase. zeta-carotene desaturase. bacterial-like protein |
Protein accession | XP_002186335 |
Protein GI | 219113503 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.476322 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATCAC GTGTTTGTCA CTTTCGCTTC CGATGGGATC CTTCGGGAGA TCCATCCAAA ATTCGTAATT TGGTGATGCG GAGCCGCCAA TCCAGTCGGC GTCCAGGGCA CGCGCACGAA AAGGGATCGG CGAGAGTGAC CCAAACGGCC GGATTGTTTG CCATGCGCGA TGCACTGGGT ATCTTGGCGC TAGCATGGTG CAGCCGGAGT AGGAATTCGT GCTCGTCGTC TATTGCCGTC GGCTCCTTTG CGAGTCCCCT TTCTCCGCCT CCCGGGAGGA ACCGAGCCAC GCAGCGGCAA CTAGTGGTGG CCCACCAGAC CAATTCCCAA CAATCTGTCG TTGGGGGTAG CACAAGAAAA TCACGCCGTG TTGTCGTCGT GGGTGGGGGA GTCGGTGGGT TGGCTACGGC GGCTCGTGTT GCGGTAGCTT CGCCGTCTAC GGAAGTCATC ATTTTGGAAA AGAACGACTT TGTTGGAGGC CGTTGTGGGT CATTCAACGT CGATGTGCCC GGAAAAGGAA CATTTCGTCA CGAACGAGGA CCCAGCCTGC TACTTTTGCC TCATGTCTAC CGGGAAATTT TCGAATATTG TGGGAGTACA CCGGAAGAGA ATGGGTTGGA AATGAAACAA TGCACTCCGG CATACCAGGT AGTCTTTTCG GACGGAGACT GTATTCAACT GGGTTTTCCC CGCGCATCAG ACAAGGTTGA AAGATCCGAA GAAGAATTGA AGTCGCGGAG GAAAATGGAC ACTTTTGAAA GGAATGGAGC CGCAAAATGG GATGCCTACA TGAAGACCTG TGAAGCGTTC CTTGATTGTG GGCTACCAAA CTTTATCGAG CAGCGTCTTG ACATGAAAAG TTTCCCTGCT TTTATTCGAG AAGCACTCAG GGACTTTGGT AAAGGGTGGC CGTTAAAACC CCATTCGGAC GTTTTGGATT CTTTCTTCGA GTCCGACAAG ATGCGTGCCA TGGCTTCCTT CCAAGATTTG TACGTCGGGC TTGAGCCCTA TCGCAACAAT CAACTACTTG GGGGAGGCGT TCTGACTACT ACTGCACCTG CCGTATTTGG TCTTTTGGCT GCGATTGAGC TGCATCCCAC GAATGCAAAG TGTGGTGTGT ATGCTCCAAT CGGAGGCTTT CAGGCCGTTT CTTTGGCATT GGAAAAGCTG GCAAGATCGT GCGGAGTAAA GACTCTTACT GGTAGAGCTG TATCTGAGGT AACCGACGAA GGGGTCTTCT ATGTAGAAAC ACAAGCAAAT GCTACAGAGA CAAAGTTTCT TTCTGCTGAC CTGGTGGTGG TCAATGCGGA TTTCCCATAC GCAACGCAAA GTCTTATTTC AAAATCGGAA ACCAGGTACA GCCCCCGGTT TGACTGGGAC GACCGATTTC TCTACAGCAG CGGCGTTATT GCATTTCATT GGTCGGTCGG TAGATCTCTG GATGATCTTT ATACGCACAA CGTCTTCATG GTGAGTGGTA GTCGCAAGAA GGTTGAAGAA AGCTGGAGTG TTCTGCGAGA TGCTAGTACA TCCTCCTTTG ATTTTGAGGA TGCCGAGTTT AACTTTTACG TTCATCGACC AATGGCGACA GATTCCACGG CTGCTCCCCA GGTAAGACGA TGAAAACTCG AAGAATGTCA AGAAATGTAT GCCTTTAATT TTTTGTTGAA TTTTTGCCAG GGGCTCGATG CGATTCTGAT TCTCGTTCCA TGTCCAACAC TTCTACGTGA CCCTGCCAAT ATGCATTTGA GTCGGGATGA GTCGATTGCA GCGTATAAAA ATCAGATTAG TGAAGGAAGT ATCAACATTT TACGGCGCAA AGTCATCCAA CGCCTGACGG CAATTCCTTC CTTGGCCACC TTTGGCTCGG ACGATATTCT GGACGAGCAC GTGGACACAC CCGGCACGTA CGCCGATCGG TTTCATTGTG GGGCAGGGTC CCCTTTCGCG CTAAGCCACG GATTCAAACA ACTCAGCCTC GCTAGACCTG GTCCTACGTC CCAGAAACAC CATAAGCTTA TCTTCACAGG AGCAAGTGTT CGACCGGGAA ACGGGGTTCC TTTGGTGCTG ATTGGTGCCA AGGGGGCTGA AGAGGCCATT TTGTCCAAGC TGAGGGATAC ATCAAAGGAT GTTCAGTGGG AAAGCAAAAA TGCCTGGTAG GAAAAACTAT TATATGAGCA GCCGTGTTTG GGCCACTATC TTCTAGATAA ACCCGGCAAA TAGCTCTGAT GTAAA
|
Protein sequence | MQSRVCHFRF RWDPSGDPSK IRNLVMRSRQ SSRRPGHAHE KGSARVTQTA GLFAMRDALG ILALAWCSRS RNSCSSSIAV GSFASPLSPP PGRNRATQRQ LVVAHQTNSQ QSVVGGSTRK SRRVVVVGGG VGGLATAARV AVASPSTEVI ILEKNDFVGG RCGSFNVDVP GKGTFRHERG PSLLLLPHVY REIFEYCGST PEENGLEMKQ CTPAYQVVFS DGDCIQLGFP RASDKVERSE EELKSRRKMD TFERNGAAKW DAYMKTCEAF LDCGLPNFIE QRLDMKSFPA FIREALRDFG KGWPLKPHSD VLDSFFESDK MRAMASFQDL YVGLEPYRNN QLLGGGVLTT TAPAVFGLLA AIELHPTNAK CGVYAPIGGF QAVSLALEKL ARSCGVKTLT GRAVSEVTDE GVFYVETQAN ATETKFLSAD LVVVNADFPY ATQSLISKSE TRYSPRFDWD DRFLYSSGVI AFHWSVGRSL DDLYTHNVFM VSGSRKKVEE SWSVLRDAST SSFDFEDAEF NFYVHRPMAT DSTAAPQGLD AILILVPCPT LLRDPANMHL SRDESIAAYK NQISEGSINI LRRKVIQRLT AIPSLATFGS DDILDEHVDT PGTYADRFHC GAGSPFALSH GFKQLSLARP GPTSQKHHKL IFTGASVRPG NGVPLVLIGA KGAEEAILSK LRDTSKDVQW ESKNAW
|
| |