Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38757 |
Symbol | |
ID | 7203745 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | + |
Start bp | 133754 |
End bp | 135490 |
Gene Length | 1737 bp |
Protein Length | 559 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182778 |
Protein GI | 219125000 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0269601 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGGTCC CTAGTAGCGA CAACGGACTT GCTACTTTCT CACAATTCCC CGTCACGAAG AAGAACGGCA GGCATTCTAG AAACCCAGCT ACAGGTACCC CGTTAAGAGT TGATCCGGAT GACGTATCCA ATGACGCGAC CTCTGCTCAC TTCACAGCTC ACTGTCATAT ACAATCAAAG ACATTGAAAA GAGCCCTCGT CTTCAGTCGT AGGCGTACTT TAGATATCTT TTTCATTGGT TGCCTCGCTA ACGCCTTCTT TTGCATCATA ATTGTATTCG GCATACTGAG GCAGTCTGGT TTAACTGCTA CGACGACGAC CCTCGAAGAC GAAAGAATAG GGGCCTTACG CATGCCGTTT TTAATCAAGA ACCAGACTTT GCAGAAAGAG TTGTACAATG CTTCGATGGC TGTAGATTCT GGGCCCCTTG CTGTTGATAC CGTCGTAGAA TCGTTTGTAA TCAGTAGCGG CCGTACATCT GACCGAGCCT TCAACCAGAT TATGCAGGTC ACCGAAAGGT TCTCGGCCTG CATCCTGTTT ATGGACGACA ATCCCCGTCT TGTGGAGTGG CTAGCCTACC ATTTCTTTGC CTTAAACTTG CGCGAAGTTG TCGTGGCCGT CGATCCCCGA AGCAAGTCGA GTCCGTGGCA GTCTCTAGAA CGCTGGACGC CGTACATGAA TATCACTGTC TGGAACGACA CCGACTTCGG CTTTGTGGTT GACCAATATA TTACGGTCAA CGGAACTCGC AAGCAAAAAA TTGATGTACA TCGTGGGAGA CAAAAATTCT TTTACGGAAA ATGTATAAAG TATCTACAAG GACGAAATCG GACATGGACC GCATTCCACG ACATCGATGA GTATATCACT GTTGACGAGC GGGTCGTTTT CGATGCCAAA GAGCGTAGTT CCAAGCCTGG AAGCGTCTTG CAGATGCTTC AAGAGGTGAA GAGCATGAAA CCTGTTCCTG ACGGTTGGAC CGAGAGCTGT GTCCCCGTCC CACGGTGTCG TTTCTCAGCT GTGGAAAGCC AACCAGAAGA GGTAAGCCTG GAGGTCCCTC CTCTTATTGA TGCAAAACAG CTCGAAACTC TACGCTGGAG GTATCGCTCT TTAAAAGGTC GAGATGGTCA GCCAAAATCT ATTGTTGATA TTTCCGAAGT CACGCTGCAC AGAAATACTA AGTTCGGCCC TCATGCAGTT ATCCTTGGAA TTTGTCCCCC GCATCTTTTT GATCGCAGTT TTTTAGTGAT AAATCACTAC TTGGGCGACT GGGATATGTA GTAAGTGTCT GTGTTGGTAC AATAATGATT CAAAGTTTGT TTGTGGAGCG TGTACTCTTT CTACTCCCGC AACCTTGTAA AATCTTGCGT TCTAACAGTA CGTCTTGTTT ACCATGAAAG TTCGTTTCGC GATGATTGTC GCATAGGCAG CATGAAAAAC AGAGAGGCTT GGGAATTCCG ATCCAGCGAG AGTGAAGGCG GTACAACCGA CCAGATCCGA CCTTGGATTG GTGGGTTTGT AGCAGCCATG GGCGAAGAAC GCGCGTTGCA GTTGTTGAAA GATGTTGGAC TGCCAAAGAA TTACACGAAC CCTTACAACA AAACGGAATG GAGGATTGAA CAAAGCACGT TGGATGCCTT GTTGAAAAAG CGCCCGAGAA GGGCTACGAA CTACGTCAAG TTTCTTGAGC AGCGGATTAG GCAATCCAAT ATAAATCATT CGACTGATTA TAATTAG
|
Protein sequence | MEVPSSDNGL ATFSQFPVTK KNGRHSRNPA TGTPLRVDPD DVSNDATSAH FTAHCHIQSK TLKRALVFSR RRTLDIFFIG CLANAFFCII IVFGILRQSG LTATTTTLED ERIGALRMPF LIKNQTLQKE LYNASMAVDS GPLAVDTVVE SFVISSGRTS DRAFNQIMQV TERFSACILF MDDNPRLVEW LAYHFFALNL REVVVAVDPR SKSSPWQSLE RWTPYMNITV WNDTDFGFVV DQYITVNGTR KQKIDVHRGR QKFFYGKCIK YLQGRNRTWT AFHDIDEYIT VDERVVFDAK ERSSKPGSVL QMLQEVKSMK PVPDGWTESC VPVPRCRFSA VESQPEEVSL EVPPLIDAKQ LETLRWRYRS LKGRDGQPKS IVDISEVTLH RNTKFGPHAV ILGICPPHLF DRSFLVINHY LGDWDMYLFV ERVLFLLPQP LRLVYHESSF RDDCRIGSMK NREAWEFRSS ESEGGTTDQI RPWIGGFVAA MGEERALQLL KDVGLPKNYT NPYNKTEWRI EQSTLDALLK KRPRRATNYV KFLEQRIRQS NINHSTDYN
|
| |