Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40722 |
Symbol | |
ID | 7198603 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | + |
Start bp | 237984 |
End bp | 239043 |
Gene Length | 1060 bp |
Protein Length | 313 aa |
Translation table | |
GC content | 59% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184678 |
Protein GI | 219128981 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0000570918 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGACCT CCCCTCAGTT CAACCTGAGT GCTTTCCCTC ACAAAGTTCT CGATCCGATC GCCACTCTCA CCTCTCCGCC CACATGCGCC TCCATCAAGC TTGCCCAACG TCAGCTCAGT GCCAACGCTG CCGCCATTCC CTCTCTCAAT GGCGGCGGCG CCCACGGCCA CATGGCCTTA ACTCTGACCC CCGCGGCCTA CTCCGACCTC ACCGACGTTC CGTTCGTCAT CCCCGTCGCC CCGCCTGCCG ATCCGGTTCC TGGCACCATC CAACCGCAAA TCGCTGAGAA CAATCGCATC CACCAACGCG ACATTGCCGT TCACAGTTTG TACGTTGCCG TCAACAACGC CCTCCGTCAA CAGCTTCTCG ACGCCATCCC CCGCGTCTAC GTCCGCGAAA TCGAACACGA AATCTACGCC TACAGCCATA TCACCTGCCT CGACCTTCTG ACCCACCTCT GGACCACCTA CGGTACGATC TCGGCCACCG ACTTGAAAGA AAACATTCAA TCTATGTACA AGCCGTGGAA TCCTGCGGAT CCCATTGAAA CTGTGTTCCA TCAGCTCGCC GATGCCATAC GCGTTCTCCA TTGCCGGCGA CAATCCCATC ACCGAAACGG CTGCCGTGCG AGCCGGCTAC AACGTCTTCG AACACTCCGG TCTTTTTCCC TGTGCCTGCG AAAATTGGCG CTTTGCTCCC CCCGCCGAGC ACACCATGGC CAACCTCAAA ACCCACTTTA AGCTTGCCAA CACCAATTGA AAACGACAGG CCACCAGTGG TTCCCTTGGC TATGCCAACC TTCTCTCTGC CACACCCTCC GTGTCCCCGC CTCCACCATC CGACGCCCTC AGTTTGCCTT TCTCTGCTCT CTCCGTGTCA CACGCCTCCG TTGCCACCCC GGCTAAAACC TATTGCTGGA CCCACGGCAC CAGCAACAAT CGTCGGCACA CCAGTGCCAC TTGCCAGAAC AAGGCACCCG GCCATCGCGA CGACGCGACG GCCACCAACA CTCTTGGCGG CTCGACCAAG GTTTGGATGG CCCCCAAGCC CCCCAAATAG
|
Protein sequence | MSTSPQFNLS AFPHKVLDPI ATLTSPPTCA SIKLAQRQLS ANAAAIPSLN GGGAHGHMAL TLTPAAYSDL TDVPFVIPVA PPADPVPGTI QPQIAENNRI HQRDIAVHSL YVAVNNALRQ QLLDAIPRVY VREIEHEIYA YSHITCLDLL THLWTTYGTI SATDLKENIQ SMYKPWNPAD PIETVFHQLA DAIRVLHCRR QSHHRNGCRA SRLQRLRTLR GSLGYANLLS ATPSVSPPPP SDALSLPFSA LSVSHASVAT PAKTYCWTHG TSNNRRHTSA TCQNKAPGHR DDATATNTLG GSTKVWMAPK PPK
|
| |