Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_43830 |
Symbol | |
ID | 7203962 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 161360 |
End bp | 167196 |
Gene Length | 5837 bp |
Protein Length | 1712 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | vacuolar protein sorting-associated protein 35 |
Protein accession | XP_002186009 |
Protein GI | 219112851 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.999564 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCTCGGTACT GTTGACGAAA ACCCTCGTGG GTGCTTCTCG ACGAGTGTAT TCGACGTGTT TGGGTACGAG AGAGCAATTT TCTTTGCTCC CAAAGATCCA GACTTTTGTT TGGACCCTTG ACACTGCTTA CTGGTTTGTT TATTGTGACG GTTACAGTTG TTGCTGCGCA AGCCACGAGC GTGTGTCTTT CGCACTCCTT CCGTTGAGAT TTGGCAGATT GTAGAGAATG GAACGTGTAA CTTCCAATGC ATCCCAAGCA CCTTCGGTGC AGAGCGATGG GAGTCAAACA GGCGGAGCTT GGGGGACGAT GGGTGGCGGT CCCGGAGACG GGTTCCAGTC CGGACCTGGC GGTTCCGTCC AACCCAATCC CAACAACAGC ACTACTACTA GCACTGCGTT CGCATCGTCC CAACCCTCCC CCGAAACCCC GGCTCCGTAT TCCCCCTATC CTCCCGGTTC CGGGAGCGTC ACCCGCTTTG CGCCATCCTC CACGCTTTCA CCCTACCCCA CACAAGCAAC GCCGTACTAC GGGGGTGCAC CCACGACACC CGCACTCTAT CCCGACACGC CCATGGCGGC GTCGTCGTAC GCGGCTCGTT CGGCACACGC CTACGGGACC GCATCCTCGC AGTCTCTTCC GCAATCCCAA GCACCGCCTC CACCGCAAGG AGCGGCATCT TCCTACGCGT CGCGTTCCGC ACAAGCGCAC GCATCCCGAG ACGGGTCGGT GTCCATCCAC ACCGGACAGT CCCACGCCTA TTCCGGAGCC CCCGTGCGGA ATATCCCACC TCTGGTGGGT GGATCGTACG CGTCACGATC CGCCCACGCG TACCAGTCGC AGCCACAGCC ATCCGCAGCG TACGGTCCCT CCACCTCATC CCCGTCACCC CCGTCCGGTG GACTACCAGG AACGTACGCC AGAGGTAATC CGCCGATGAA TGTAACGGGA GGGACGTACG GGTCATACGC ACCGCAACAG CCACAAATAT CAAATCCGCA ACCGCAGCCC AACGACGGTG CACAATCCAA CTACAATACC GCTGCACAGC CCAACAGTAC CCTACCACCA CCAACAACTC AAAATCCACC GCCCGGTAGT ACCACGAGTG TCCACAGCAG AAGTCACGTT AACAACAACA ACAACAGCGC CCCCATTACT CCGCAGGCAC AGGCCGCTCA GCAGCGCATG CTCACCGACG CATCGCGCAA GGTCCAGGAA CACGCCTACT ATATGCGCCA AGCCATGGAC GAACGTAATC TACCCGTTGC GTTGGATCGG GCCGCTCACA TGGTGGGAGA ATTGGGGGGA CCTCCGCACG GACATCACCA CACGACCCAT ACGGCGACCG GTCCCACCAA TACGGGTTTG TCCGCATCGC TCACGCCCAA GAATTACTAC GAACTCTACA TGAGGGCCCT GGAAGAAATG CCGGCCTTGG AAGACTATTT GCTGAATTTG ACCAATCCAA CAATGTACAA CACCGAGCCA ACGATTGAAA TCGTTGCGTC GCCGCAGCAT CTGCGTCGCG CACCCTATAC CATGACGGAA CTCTATGATT GCGTTCAATA CTGTCCCCGG GTCGTCTCGC GCTTGTATTT GCAAATTACC GCCGGATCGG CTTGGATTCG GTCGGGAGGC GACGCGGACG TGTGCTGGGT GCTGAACGAC CTCGTTCAAG CCGTCAAGTG CGAACAGAAT CCCACGCGAG GCTTGTTCTT ACGACACTAT CTCTTGACCG CTCTCAAAGA CAAACTACCC GATACACCCG CGCCCCACCA CCCGTCGACC CCCCATCTGG AAACAATTGT TTCCGAAGAA GAATTGGCGG ACGACGAAAC CAAGAGCCAT GACGACAACG ACAATCTTGA CGTCGGTCAA ACCGCCGCGC CGGTTCCCGT CGGCACCGTC AAGGATTCGT ACGAATTCAT TCTCAATAAT TTCATGGAAA TGAACAAGCT TTGGGTGCGT ATGCAGCATT TGCCGGGGGA TGGACGGAGT AAGGAAGTCC GTCGTCGTCG AGCTCGTGAA CGCAACGAAC TGAGAGTGCT GGTGGGGACC AATTTGGTCC GTCTTTCGCA ACTCGAACAC GTCACGTCCA AAATTTACGG AGAAGTCATT CTGTCGCAGA TTCTGGAACA TATTGTCACG ACCGGGGAAC CCTTATCGCA AGCCTACTTG ATGGATTGTT TGGTCCAGGC CTTTCCGGAT GAATACCACA TCGAGACCTT GCCGATTTTG TTGAATGTCT GTCCGCGATT ACGGGACAAG GTAAACATTC GCACAATTTT GCAAGGGCTC ATGGATCGGC TGGCGAATTA CTTGGCGGAA GAAGAGTTGC TCGACGAGAG TGATACGAAT CAAGTCAAAA AGGCACTGGC TCGTGATTCT TTCCGACTTT TTGAAGACTG TGTCCAGAAA GTCTACAATG CGCGCGGACC CAAACTGACC TCCCGCGAAG TAATCCGTTT ACAATCCGCG CTCTTGCAGT TTTCTCTCAA GTGCTACCCC GGTAACTTAG ACCAAGTCAG CACCTGTTTG GGACTCTGCT CATCAGCTCT GCGCCAAGCC AACGCGTCGT ACGATCCTAG CGACGCTACC AGGGCAAGCA TCATCCGACC TCTGGACGAT GTGTCGGTGG CCGAGCTGGA AAAACTTTTG TCCATTCCAC TCGATTCGTT GGCGTTGAAA GTATTGCAGC TCGAGCATTT TAATGGGTTA ATTCGGTTCT TGCCCTGGAC GAGCCGCCGG CAAGTGGCCA TCAAAATGCT AGAAGCTGTC GACAAGGCGG GTGCGCCTCC TACAAATCTG GACGAGATTG AAGAGCTGTT TAGCGTGATT GAGCCAGTAA TTCGCAATCC CAACAATACA GCATCGGGGA TAAGTAGACC ACAGCCGCAG CCGACTCACA TGGCAAATAC AGCCAGTCTC ATGGCCGGAT TGGGGGTCAC TCAGACTGAC GCTCCATCGT TCAGCCAGTC TTCCTTTAAC GACAATGATC ATTCGTCAGC GGCGGCACCG TCCCCGGAAT TGGCACGCGA GGATGCTCTG GTTGCTAAAC TTATCCATCT TTTGGATCAC GAAGATACTG ATGTGATATT TGCCATGCTC AAAATTGCTC GTGAGCATAT CAATCTGGGT GGGACTAAAC GCGCAAGTCG AACGTTGGTA GCCGTTGTAT TTGCTTGTTG TCGACTTGGC CGCCGGATTT TTGACACGGA AAATAGCAAC GATGAGAGCC TGCCGATAGA ATATAAGGAA GACGGCAGCA CTGCTATGGC GAAAGACGGC AGCGGCGACG ATGATATTCC AAAGGAGCAG CACGAATGCA ACGATAGTGA TGGTCCGAAA GAGAATAATG ATGATGACAG AGACGACAGT CCCAGAGAAA AAGACATCGA AAACAAAGAC GACAGCATAT CAGAAAGTAA GAAAGTGGAT ATTCCGCAAA AGACAGCAGA AACTGAGTCC GAAACGAAAT CTATGTCGGA AACGAAATCT ATGTCGAAAA CGAAAGCGAC CAGGTACGAA TTCACATTTC TACCATACGA GTATGTGAAT GACGCTAAGT CTGCGCGTAC TGACAGACGT TTTTCCTTCT GCCGTACTAC AGCTCCCGAA ATGTATTCGT GTTCATCCAA GACACGCTGT CTATGATAGG AAGGGCCAAC GCGGAGGTTG GCATCAAGCT CTATTTAGAA GTATCGCTTA TTGCTGATTT GCTGGCAAAG CGATCATCGG AATTCTCCGC AATCTCTTAC GAGCTCATGA CACAAGCATT TGCCTTATAC GAAGAATCAG TATCAGACTC AAAGGTACAG TACCGTTGTG TATCACGAAT GATTGGTACC TTGCTCTCCG TGGTGTCGCT TAGCAAAGAG GATTACGAAG GGCTGATCAC AAAAACGGCA CAATTTTCAG CAAAGCTATT GAAGAAAGCG GATCAATGCG AGCTAGTGGC ACAATGCGCT TACTTGTTTT ACCCAGTGGA TGCGAGTAAC AATGCTTCCA AGTACTCCAA CCCGCAGCGT GCTTTAGAAT GTTTGCAGCG ATCTTTGAAA CTAGCTGATG CGTGTACTTC CGCCAATGCT GGGAACGTCG GTTTGTTCGT CGATCTTTTG GAGCACTACG TATTCTTCTT TGAGAAAAAA AATCCTGTGA TTTCGCATTC ATACATAACG GGACTCGTGT CACTTATCAA AGAGCACTTC AATACTTTAT CCGACGATTC TGGCGTCGCA CAAGCCAAAA CACATTTCGC TGAGCTTGTT CGCTATATCA AAGCGAAGAA ATCCAACGAT TCCTCTTCGG AGCTGTTCTC ACCTGTCCAG GTAAATATGT AGTAACAATT TAATTAGTTG GAGATTGCGC ATAGTAGCAT GGAAGAGTTA GGCGCGAGGT AGGAAGCGAA GGTGCAACGC TTGCGATAGA TATGTGCACC TATTCCTGTA CGGGAATACG CTTTGGCCGG CAGTTGAATG ACTGCAGGAG CCCGAGTATC GTTACACATG GCGGATTCCA ATCGTACGTC CGTTTCAACC ATCTCGGTTC GTCTCTTGAG GCTTGGTAAA GATTTCTGTA ATCTGCGGTT TCAGTCGCTG AATGGAGCCT TCGGGGATTA GTGGACTGCT TCATAAAAAA AGGTAAGACA AAGGCCGAAT AGGATGGGGT AAGAGTGAAG AGTGCTATCT TCTGGAGTCT CACCAAAACT GGTCATGCTA TCGCAGGAAG TATGATTCAT CCAGCTAACG CTATATTGGC TGCTTTGCTG GGTTATCTGG TCAAAGGGAG CAACGCCGAG TCGCACTACA CCTTGAAGTA CCGACAATCG AAACCAGGAG GGAATGCTGT GCGCAGGGCT ACCAAACTTC GAGGAATGAG GCACTTGAGC ATCAACAAAG GACGCACTCT TGCCATATTG AAAGATAGAA CACAGGCTTT GGCACCTAGA GGAGGAGAAA AGGTAAGTTC TCTGGTTCAC TGCTTCGGGA ACTGTCCAAT GTCTCACTCC TCATTTCGAA ATAGGGTCGC AACTTCCGTT CTGGTGCCAA TGGTATCAGT GGTAGAACCA AAAGCAATGG TCCAGCTTTC AGCGAATGGC CTGTAGACGA CGATCCGCTT ATTCCAGAAC CTGAAGTCAC AATGACGGGC GAGGATCGTC CCAGTTCAGA CGATGGGCTA GTGATCACAG GAGACGATGT CATCTCACCA ACCCCAGCAC CAAAACCTGA AAGCTCTCCC AGCAACGAAG GTGTGAACGT GAGTGGCCCC ACGCCCAGTG ATGGCGGCAT TTCGCCTACC AATGCCGGTG AAAACTCTCC AAGCAATGAA AGCGTAAACG TGAGCGGCCC CACACCCAGC GGTGGCGGCT CTACTCCTAC GAATGCCGAT GAAAACTCTC CTAGTAACGA TGGCATAAAC GTGAGCGGCC TTGCGCCCAG CGGCGGTTCT GCTCCAACGA ATGCCGATGA AAACTCTCCC AGTAACGACG GGATAAACGT GAGCGGCCTT GCACCCAGCG ATCGCGTCTC TACACCTACG AATACCGGCG AAAACTCTCC CAGCAACGAC AACATCAACG TGTCTGGTGT ATTGACGCCT CTAGACGACG ACGAAAACTC TATCAGCAAC GATGATATCA ATGCAAATGT CCCTCAAGAC GACGAGGTCG ACTATTCTCC CAGCAATGAG GGAATGGGCA TTGATCGCAG CGGCTTTCCC GCTGGTGGCG GCGGCATTGC GGGCATTCCA CCTTTGGAAG GTATGGAGTC CGAATTCGGC GACGATGTCT TTAACCCTCC AACGAATGAA GACTATGTAC GCGCCGGAAA CGTCTAA
|
Protein sequence | MERVTSNASQ APSVQSDGSQ TGGAWGTMGG GPGDGFQSGP GGSVQPNPNN STTTSTAFAS SQPSPETPAP YSPYPPGSGS VTRFAPSSTL SPYPTQATPY YGGAPTTPAL YPDTPMAASS YAARSAHAYG TASSQSLPQS QAPPPPQGAA SSYASRSAQA HASRDGSVSI HTGQSHAYSG APVRNIPPLV GGSYASRSAH AYQSQPQPSA AYGPSTSSPS PPSGGLPGTY ARGNPPMNVT GGTYGSYAPQ QPQISNPQPQ PNDGAQSNYN TAAQPNSTLP PPTTQNPPPG STTSVHSRSH VNNNNNSAPI TPQAQAAQQR MLTDASRKVQ EHAYYMRQAM DERNLPVALD RAAHMVGELG GPPHGHHHTT HTATGPTNTG LSASLTPKNY YELYMRALEE MPALEDYLLN LTNPTMYNTE PTIEIVASPQ HLRRAPYTMT ELYDCVQYCP RVVSRLYLQI TAGSAWIRSG GDADVCWVLN DLVQAVKCEQ NPTRGLFLRH YLLTALKDKL PDTPAPHHPS TPHLETIVSE EELADDETKS HDDNDNLDVG QTAAPVPVGT VKDSYEFILN NFMEMNKLWV RMQHLPGDGR SKEVRRRRAR ERNELRVLVG TNLVRLSQLE HVTSKIYGEV ILSQILEHIV TTGEPLSQAY LMDCLVQAFP DEYHIETLPI LLNVCPRLRD KVNIRTILQG LMDRLANYLA EEELLDESDT NQVKKALARD SFRLFEDCVQ KVYNARGPKL TSREVIRLQS ALLQFSLKCY PGNLDQVSTC LGLCSSALRQ ANASYDPSDA TRASIIRPLD DVSVAELEKL LSIPLDSLAL KVLQLEHFNG LIRFLPWTSR RQVAIKMLEA VDKAGAPPTN LDEIEELFSV IEPVIRNPNN TASGISRPQP QPTHMANTAS LMAGLGVTQT DAPSFSQSSF NDNDHSSAAA PSPELAREDA LVAKLIHLLD HEDTDVIFAM LKIAREHINL GGTKRASRTL VAVVFACCRL GRRIFDTENS NDESLPIEYK EDGSTAMAKD GSGDDDIPKE QHECNDSDGP KENNDDDRDD SPREKDIENK DDSISESKKV DIPQKTAETE SETKSMSETK SMSKTKATSS RNVFVFIQDT LSMIGRANAE VGIKLYLEVS LIADLLAKRS SEFSAISYEL MTQAFALYEE SVSDSKVQYR CVSRMIGTLL SVVSLSKEDY EGLITKTAQF SAKLLKKADQ CELVAQCAYL FYPVDASNNA SKYSNPQRAL ECLQRSLKLA DACTSANAGN VGLFVDLLEH YVFFFEKKNP VISHSYITGL VSLIKEHFNT LSDDSGVAQA KTHFAELVRY IKAKKSNDSS SELFSPVQLE IAHSSMEELG ASLRGLVDCF IKKGSMIHPA NAILAALLGY LVKGSNAESH YTLKYRQSKP GGNAVRRATK LRGMRHLSIN KGRTLAILKD RTQALAPRGG EKGRNFRSGA NGISGRTKSN GPAFSEWPVD DDPLIPEPEV TMTGEDRPSS DDGLVITGDD VISPTPAPKP ESSPSNEGVN VSGPTPSDGG ISPTNAGENS PSNESVNVSG PTPSGGGSTP TNADENSPSN DGINVSGLAP SGGSAPTNAD ENSPSNDGIN VSGLAPSDRV STPTNTGENS PSNDNINVSG VLTPLDDDEN SISNDDINAN VPQDDEVDYS PSNEGMGIDR SGFPAGGGGI AGIPPLEGME SEFGDDVFNP PTNEDYVRAG NV
|
| |