Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49924 |
Symbol | |
ID | 7198535 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | - |
Start bp | 311504 |
End bp | 315142 |
Gene Length | 3639 bp |
Protein Length | 564 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184777 |
Protein GI | 219129188 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTAATGGCCG GATCGGTGTC GCCGTTGTAC ACCAGCACGC GAATTTTCCC ATTGATGCTT CGGTAAAAGC CGCTCAAATC TGGTTCGGTG GGGGTGTAGT CGAAATCGCC GTCGGCATTG TCGACGGAAA AGAAAGTTGC GTTGGTCACG TGCAGGGCTT CCTTGACCAC TGGCAAGCTA AGATAATGTT GCATGACAGC GTCGCCTCCG CAGGGGTAAT CGTTCAGTGC TCCTCCGAGA AGACCACCGC TGTACGTGCA TTCATCGTAC AAGGAATATC CGTAGTAGCC ACCTACTTGA CGATCTACCT TGGCGAGGGC GGCTTGACAT ATATCGCTGT CCTTGAGAAA GCGATAATTC TCCTCATTGC TCGAAAAATC GTCCGCTGAG GACGCTGACT GAGGCGACAA AGGATGTTTG TGTGGACGGC AAGCGTGCAT GACTTCCTGA AAGGTTGCCA TGGGCATTTG ACCGTGACCC GCGAGAAATA AAATATTCCA AATATCAATC GCATCGGGAT TGGAAAGATC ACCACAGATG GAAGTTTCGG TTCCGAGACA TCCGTCACCG ACGGCCAATC CTTTCAACGG AATGTGATTT TCCGGATCTT CCAAAATTCG CCGTGCGAGC GTGGGAATGT AGATGCCAGC ATAGGATTCT CCGGTCAGAT AGAGCTCGTT GGTACCGTAG CATGGGAATT TTTCGTGAAA GGCCAAGAGC GCCAGATGAG CGTTTTCCGC CGCGAGTTCG TCCGTCCAGG CCAATCCAGC GCAGGAATGC GAATCAACGT CTTCGTTGCA GTAGCTGAAT CCTACCGGAG CGGGTTGATC GATAATCAAA ACATGTCCCA GCCTTGTCCA CGCGCACGGG TTGTAAATGG GTGTAGGAAT TCCCGTGGCT TCGTACTCGG CAGTGTGGAG TGATTCGTCG CTGAAGAGCA ACGGCCCTAG TTCCGTCAAG AGTCCGAAGA GAGAGCTCGC ACCGGGACCA CCATTGCTCC AGTAAATTAA AGGCTTGTTC TTATGGAAGT CTTCTTCAGC CTCGACGAGG ACGTAATGTG TGTGGACCGT GCGGCCCTCG AGTTCATACT CGACAAAGCC GGAATACCAC GGTGACGGCA AGGGCTGATT GAACCCCGGC AGACGCGAGA CAAAATCGGG ATTGGAGGAG CCCAAGCACG CCAACGCGAA CGCCAATTTG GACACGAAGA AGGCTACAAA TGTTGATGAA CGAAATTGCA TCATGCTCGG TTTGAATGAC TGTGAGTTGG ATCTTACGGA ACTGCAGAAA GAGAATTCGA TGCAAGCGGT CGATGGAGTT TGCGTTTTGC ACACCACGAC GAGGCAGTCA CAGTCACAGT CAAATCGGAT TTGACGGTTT CTTTAGAACC CGGATATAAG AGGGATCCGT CCGTGACGGA TAGACAGACG GAAAAGATCG ACTCGCTGTG TTTGACAGCG CAAGTGACTT GTCTTCTCGA CCCAAAACGG GAAGCACGGT TTGAAATTCG AACAAATTCC GCGGATCTTT CCGAATTTCG GGTTGGAGCG AACAGCTTCT CGGAACGTCG CGCATCGCTC GAGGAGGTAC CATTGGATGG TCCCCGGTGC CTTCGGGTTG ACACTGACAA CCGACGTCCG TGCTCTTGCG TTCCTTCCTT ACAGTCAGCA CAAGCTTCCC CATTTCCGTG AGCGTGAGGA TTGCCCCACA GGCAAAAGAC TAGCCATTCA CTTTTCAGGG TTGGGAGCCG TATCATGCCG GATTCTAGCC AACGTGACGA CGAGGCTTGT ATGCGGGAGG CGATTGCCGA AGCGGCGGCG GCGACATCCG AAGGGAAAAT GCCCTTTGGG GCCGTCTTGG CTATCGACAG TGTCATCGTC GCACGAGCTC ACAATCAGTG TCCGGCGGCT GCCAAACGAG GGGGTGGAAC GGGCGACGTC ACCCGACACG CCGAAATGGA ACTCGTTCGA CTCTTCACCA GCAAACTCAC CGCGGAAGAA CGATCCAACG CCGTCCTCTA CACCAGTACC GAGCCGTGTG AGTGCAATAC AGACGTGTCA AATGTGTCAG ATTAACGGCG TGACTAATGT TATATATTTA CTTTCTTGTC TTGCTCACGT CGGGGTCTTT GGATATCATG TTAGGTGTCA TGTGTGCGGG AGCGATTTAC TGGAGCGGTG TTTCCAAGGT TGTATACGGA TGTTCGGCGC GACAGTTAGA GGCCTTGAGC GGTCCGGGCG GCTTTGACAT ACCCGTCGAC ACGCTCTACG GAATGGCGTC GAAGGGAGCG CGACGAATGG AATGTCTTGG TCCCTTGCTG GCGGAAGAGT CCCTACAGGT TCATGTCGAT TCCGGTGTCT GGAAGAATGC ACCGGTCCCT ACTACTGCGG AATTTCCCCC CGTCACCCAG GCGGATCTAG ATATTGCGGT CGAGGCCGCC CTACTGAAAA GCGGCTTGGG GTCCGCCAAG GTCGTCGACG ACGGTGTCGT GCCGGTCATT GATCTCTCCG TTGGCACGGA CGAGCAAGTC GCCGAGAAAC TGTGGCAAGC GGCCATGGAG GTTGGTTTCT TCTGCGTGGT TGGTCACGGA ATCGACCAAT CCATTATTGA CGGAGCCTTT GGCGCCTCGG AAACTTTCTT TGCGCAGCCG CTCGAAGACA AGAAAGCACA ATCACCTCTG GATATGAGTA TCAACTCGGG TTTCGAGTAC TTTGCCCAGG TCCGCCCGAG TACCGGCGTA GCGGACCAAA AGGAATCACT ACAAATCACG GCCCGCCAAG GCTGTATGGA CGATCGCTGG CCGTCGGACG AATTCCACAA GAGTGCCGAT GCATTGCTCG AAGCATCGCA CCAGTTGGCC AAGCGAATTT TGAACCTGTT GCAACCCCAA GCGATTCCCC ATGTGGAACC GGAAACGCTG GCGAATTCGC ACACGCTTTG GGAAGAAGAC GGGCAGTGTA CCTTACGATT CTTGCATTAT CCCCCACTGG ATAGTGACAC CACGGCCAAG CTCATTGACG ATGGCTATTG GCGGGCGGGA CCGCACACCG ATTGGGACAA CGTAACGTTA CTCTATCAAC AAATGGGACA GAATGGTTTG GAATGCTGTG CCAATCCACG GACAGGCGAC CCCGCTTCCA TGTACTGGAC AGCCGTGAAT CCAGTGGAAG GAGGGATCGC CATCAACGTG GGTGATATGC TGGCACGTTG GAGCGACGGC AAGCTCTTCA GCAACCTGCA CCGCGTACGT CTACCACCCG ATGCGTCCAA ATCGAGATAT TCCATCGCAT TTTTCGCCCA GTCCGACAAG AAGGCTCTGA TTGAAAGCAA GGAGTCGGAA CCGATTACGG CTGGCGACTA CATACTTTCG CGAATTCGTA GCAACTTCGA TAAGAAGTAG ACTTCCCACC GTTAAGATCG CTTTCGTTTG TTGAATATAT ATTTTTATGA TTTTCTCATA CCAAGCTCCT TCCTTACAGT CGAACAACAC CTATGTTGAT CGCCATTGCT GCAAAGTATA GGAGTCTGAT TGGTTTGGAA GCCCAGCGCC ACAACTTCGA ACTCAAAAAG GAATTGCCGT CCGAGGGGCT GTATTATAAA TTGATGAGTC CGGAAAACAT GTTTCGAAGC TTTGCATAG
|
Protein sequence | MPDSSQRDDE ACMREAIAEA AAATSEGKMP FGAVLAIDSV IVARAHNQCP AAAKRGGGTG DVTRHAEMEL VRLFTSKLTA EERSNAVLYT STEPCVMCAG AIYWSGVSKV VYGCSARQLE ALSGPGGFDI PVDTLYGMAS KGARRMECLG PLLAEESLQV HVDSGVWKNA PVPTTAEFPP VTQADLDIAV EAALLKSGLG SAKVVDDGVV PVIDLSVGTD EQVAEKLWQA AMEVGFFCVV GHGIDQSIID GAFGASETFF AQPLEDKKAQ SPLDMSINSG FEYFAQVRPS TGVADQKESL QITARQGCMD DRWPSDEFHK SADALLEASH QLAKRILNLL QPQAIPHVEP ETLANSHTLW EEDGQCTLRF LHYPPLDSDT TAKLIDDGYW RAGPHTDWDN VTLLYQQMGQ NGLECCANPR TGDPASMYWT AVNPVEGGIA INVGDMLARW SDGKLFSNLH RVRLPPDASK SRYSIAFFAQ SDKKALIESK ESEPITAGDY ILSRIRSNFD KNRTTPMLIA IAAKYRSLIG LEAQRHNFEL KKELPSEGLY YKLMSPENMF RSFA
|
| |