Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47193 |
Symbol | |
ID | 7202185 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 765866 |
End bp | 768850 |
Gene Length | 2985 bp |
Protein Length | 709 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181441 |
Protein GI | 219122204 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0349401 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTCTGTACG CGGGGTTGTG GAGGGGTAGG AGGGCGGGCA TCGTTTAGAG TGCTTTTTCG GTGAACGGTA ACTGTAAGGT AACCTCGGCG AATTCTCGTC CCCCGTATAG TATCGAACAG TAAACGCCGT AGCTTGACCT CTCGACGTGT TTCTTTCACC ATCCCGGTCA TCGTAGTTGC AGTTCCTTCC GTGATCGATT CCCACACACG ACCTCACTAC CAGTCGTTCT TACTCGAAAC CGTTCAACAT CCACCGACTA TCAACGTAGA AATAGCGGTC CGATTCTTCG TCGTGGGAGA AAATCATCAA ACGCAAGATC TGGGTTGTGG ATTCCTGGAT TTGCGCTCAA AGTTTTTGCG TTTCCTCGTA ACGTAGAAAC TTACCGCAAC ACATCACAAT CATGTCCGCT TTCCTCAGGC ACTGCAGTTC CTGGTTGCAA CTAGTCCTAT GGGTATTGTT ACCTAGCTTG CTTTGGAAAA CCGACGCCTT TTCCACCCAC AAACCATTCG TGACGATGCC GTTACTGGTG GCAACCGACG GGTCTCCCGC GTCTCGACAA TTGAGTCCGG CATTACGGCA ACAGTCTCCA CGCCACCAAA AACTGCCGCT CGGAATTGAC AGTGAGCTAG AAGCACACTC GCGTCGCTTC CAGATGCTCT CGAGTAGTAC ACGCACGACA CGACGTTCCG CAGTTGCGGG TGAAGGCAGC GAAGAACGAT TCCGCAACGG CAGCAGGACT CCACCACCGC CGCCGAATGG GGAAAACTTG ATCTCGGCGA GTGCGGCAGC GGCGGAACGC TTCGCCAAGG CGGCACCTCT CGACGAAATT GCGCCTTCCG CACCGGGCAG TAAGCTACGC AAACTCAAAG ATCTCATGTG GGTCCGGGAA GCTCTGGAAG ACTTGACGGC GGCCGAATTC GCTTGCACGG TGGAGGCCTC GCACCACAAA CAGGACGAAA CTGTGTCTCG GCGACGCAAA CGCGCCGTTG ATTACGAAAA ATTACTCGGA CAGCTCAATC GCCGGATTCG CGATTTGGGC TGTGAGCCTG TAGAGGCTCC CAAGACGAAC GACGACGAGG TTACCGACGA AGCTGTCGTT CCCAGTGGAC AAGTGGAACC GGATGTGGGT GCCGGTACGC TCGTCTACTC ACTCAAACAA CGTCAGGCTT TATTGGATCG TTTGTTTCGT ACGCGTCAAC TTCTCCTGGA AGTCATTCAA GGCTACGAAC TGGAAATCGA TCCGATGGAT TCCTTTACCA TCAGTTTGCC TTCGATTCGC GTAGAAATCC CACGAGAAGA AGATCCTTCC TCACCCGGTC CCAAATTGTA CGTGCGCGAT GATGGTACCG TTGACTGGGA CGGAGCATTA CAAGACCAAG CTGCCATGAA AAAATTCGGA ACCGCGGTTT GGGCACGCAT CAATGGACGA GATCCTGAAT CTCTCGACGG TGAACAGCGC AATCCCCAAA ATGCCAATAT TGGCTCATCT GCTGTCATTG ACGCAGAAAT CGAGTCGGCC ACGGGCGCTG CCGTTGTGGG AGAAGTTCGT GAGGTCGAAA AGCCAACCGT GACGGCCAAA ATTGAAGAAA CTCCAGAAAT AATTAGGGCT CGTCAACGCT TGGAGATCCT CACCGCGGAT TTGGCCAAAA TGGAAGCGGA TTACATTGCG CTGATCAGCT CCGCCATTGC TCCGGGACAA GCTGTGGCCA ACGTCAACCT AGCCAATCTC GAGCCTGCGC AACGTAGCAG AATTCGTGAA TCCACCGAAG GCATTGATGT TATGAAGGAA AAGGTCTCGT TTCAAACTCT AGTGTACGAA CTCGAAAGGG TATACACTTA TTTAGTGGGA GAGATGGGTA ATCCTGCCCA AAATGGGTAC ATTCCATTGC AAGATCGATT GAATGTAGCA GAATTTGGGT TGCTAGAATC TCAAATTGAT AGCTTCCATC GACAACTGGA CGAGGGTAGC TCATCGCTCG ATACAGACGT CATGGCGGTC GTCTTGGAGC AAATGATTGA TTTTAAACGA CGATTGGGAA TCGACTACTT TGTGGCTGGT TTGTCGTTTG ACAGGGACGC GATAAAACGG TATATGAGTG AATTACTGGA AAAGACCAAG AAAGGTTTAG CCTTTTACGT CAAGGGCGTT CGCCTCTTCT GGAATGATAT CATATTTTGC TTGAGTCTGA TCAACCGCGC CGCACAAGGG TATACTCTCA AGCCTCGAGA AGTACGCACA ATTCGGTACG TATGGGTTGT CATTTATGTT CCGTCGCGGA TATGCTGAGC TTTTTCTCAA ACGTTTTTTT TCGTTTTGGA CAGACGAACC TTCAAGGATT TTTTTACATT TATTCCGTTT GTGATTATCT TGTTGATTCC GTTGTCGCCC ATCGGCCACG TTCTTGTCTT TGGTGCTATC CAGCGATTCT ACCCCGACTT TTTCCCCAGT TGCTTTACCG AGCAACGTCA GAACTTGCTG CAGCTGTACG AGAACGCTGA ATACAAGGAG TTTACAATTG ATGAAAACTG GAAGGTAAGT CTCCGTTTGT TGCCCTTGGC ACAAGAAAAT GGACAAAAAC GTTGCCGTCC TTTCGACTTA CACTGGCATT CGTCGGTAGG AAAAAATGTC GCGAATGTCG GAAGCTGCCG TTTACTTTGG AGCCAACACA TCACGAGCAT TGTTTGAAAA GATGGCCAGC ATGGTCCGGG GACAGAGTGG TGCAGCTACC GACGCCACTA CCGAGAAGAA TGGAAAAGAG CAATAACGAC GGAATATGCT TATCTTTTGC GTGTAAATGA ATTTAGTGAA ACAGACGGAC GAATCCAAAC ATGCTCACAT TCAAGACTTT CAGAGCCACC CAATGACTCA ACTTGCAAAC TCCGATCCCG ACTACACTAC CTCAGACTCT ACAAGAAAAT GTGGCAGTTT GACTCCAGAC CAAATCTACA TCGTTGTTTT CACATCGCGA GTTGATTAAA AGCTT
|
Protein sequence | MSAFLRHCSS WLQLVLWVLL PSLLWKTDAF STHKPFVTMP LLVATDGSPA SRQLSPALRQ QSPRHQKLPL GIDIAGEGSE ERFRNGSRTP PPPPNGENLI SASAAAAERF AKAAPLDEIA PSAPGSKLRK LKDLMWVREA LEDLTAAEFA CTVEASHHKQ DETVSRRRKR AVDYEKLLGQ LNRRIRDLGC EPVEAPKTND DEVTDEAVVP SGQVEPDVGA GTLVYSLKQR QALLDRLFRT RQLLLEVIQG YELEIDPMDS FTISLPSIRV EIPREEDPSS PGPKLYVRDD GTVDWDGALQ DQAAMKKFGT AVWARINGRD PESLDGEQRN PQNANIGSSA VIDAEIESAT GAAVVGEVRE VEKPTVTAKI EETPEIIRAR QRLEILTADL AKMEADYIAL ISSAIAPGQA VANVNLANLE PAQRSRIRES TEGIDVMKEK VSFQTLVYEL ERVYTYLVGE MGNPAQNGYI PLQDRLNVAE FGLLESQIDS FHRQLDEGSS SLDTDVMAVV LEQMIDFKRR LGIDYFVAGL SFDRDAIKRY MSELLEKTKK GLAFYVKGVR LFWNDIIFCL SLINRAAQGY TLKPREVRTI RRTFKDFFTF IPFVIILLIP LSPIGHVLVF GAIQRFYPDF FPSCFTEQRQ NLLQLYENAE YKEFTIDENW KEKMSRMSEA AVYFGANTSR ALFEKMASMV RGQSGAATDA TTEKNGKEQ
|
| |