Gene PHATR_43830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_43830 
Symbol 
ID7203962 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp161360 
End bp167196 
Gene Length5837 bp 
Protein Length1712 aa 
Translation table 
GC content52% 
IMG OID 
Productvacuolar protein sorting-associated protein 35 
Protein accessionXP_002186009 
Protein GI219112851 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.999564 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCTCGGTACT GTTGACGAAA ACCCTCGTGG GTGCTTCTCG ACGAGTGTAT TCGACGTGTT 
TGGGTACGAG AGAGCAATTT TCTTTGCTCC CAAAGATCCA GACTTTTGTT TGGACCCTTG
ACACTGCTTA CTGGTTTGTT TATTGTGACG GTTACAGTTG TTGCTGCGCA AGCCACGAGC
GTGTGTCTTT CGCACTCCTT CCGTTGAGAT TTGGCAGATT GTAGAGAATG GAACGTGTAA
CTTCCAATGC ATCCCAAGCA CCTTCGGTGC AGAGCGATGG GAGTCAAACA GGCGGAGCTT
GGGGGACGAT GGGTGGCGGT CCCGGAGACG GGTTCCAGTC CGGACCTGGC GGTTCCGTCC
AACCCAATCC CAACAACAGC ACTACTACTA GCACTGCGTT CGCATCGTCC CAACCCTCCC
CCGAAACCCC GGCTCCGTAT TCCCCCTATC CTCCCGGTTC CGGGAGCGTC ACCCGCTTTG
CGCCATCCTC CACGCTTTCA CCCTACCCCA CACAAGCAAC GCCGTACTAC GGGGGTGCAC
CCACGACACC CGCACTCTAT CCCGACACGC CCATGGCGGC GTCGTCGTAC GCGGCTCGTT
CGGCACACGC CTACGGGACC GCATCCTCGC AGTCTCTTCC GCAATCCCAA GCACCGCCTC
CACCGCAAGG AGCGGCATCT TCCTACGCGT CGCGTTCCGC ACAAGCGCAC GCATCCCGAG
ACGGGTCGGT GTCCATCCAC ACCGGACAGT CCCACGCCTA TTCCGGAGCC CCCGTGCGGA
ATATCCCACC TCTGGTGGGT GGATCGTACG CGTCACGATC CGCCCACGCG TACCAGTCGC
AGCCACAGCC ATCCGCAGCG TACGGTCCCT CCACCTCATC CCCGTCACCC CCGTCCGGTG
GACTACCAGG AACGTACGCC AGAGGTAATC CGCCGATGAA TGTAACGGGA GGGACGTACG
GGTCATACGC ACCGCAACAG CCACAAATAT CAAATCCGCA ACCGCAGCCC AACGACGGTG
CACAATCCAA CTACAATACC GCTGCACAGC CCAACAGTAC CCTACCACCA CCAACAACTC
AAAATCCACC GCCCGGTAGT ACCACGAGTG TCCACAGCAG AAGTCACGTT AACAACAACA
ACAACAGCGC CCCCATTACT CCGCAGGCAC AGGCCGCTCA GCAGCGCATG CTCACCGACG
CATCGCGCAA GGTCCAGGAA CACGCCTACT ATATGCGCCA AGCCATGGAC GAACGTAATC
TACCCGTTGC GTTGGATCGG GCCGCTCACA TGGTGGGAGA ATTGGGGGGA CCTCCGCACG
GACATCACCA CACGACCCAT ACGGCGACCG GTCCCACCAA TACGGGTTTG TCCGCATCGC
TCACGCCCAA GAATTACTAC GAACTCTACA TGAGGGCCCT GGAAGAAATG CCGGCCTTGG
AAGACTATTT GCTGAATTTG ACCAATCCAA CAATGTACAA CACCGAGCCA ACGATTGAAA
TCGTTGCGTC GCCGCAGCAT CTGCGTCGCG CACCCTATAC CATGACGGAA CTCTATGATT
GCGTTCAATA CTGTCCCCGG GTCGTCTCGC GCTTGTATTT GCAAATTACC GCCGGATCGG
CTTGGATTCG GTCGGGAGGC GACGCGGACG TGTGCTGGGT GCTGAACGAC CTCGTTCAAG
CCGTCAAGTG CGAACAGAAT CCCACGCGAG GCTTGTTCTT ACGACACTAT CTCTTGACCG
CTCTCAAAGA CAAACTACCC GATACACCCG CGCCCCACCA CCCGTCGACC CCCCATCTGG
AAACAATTGT TTCCGAAGAA GAATTGGCGG ACGACGAAAC CAAGAGCCAT GACGACAACG
ACAATCTTGA CGTCGGTCAA ACCGCCGCGC CGGTTCCCGT CGGCACCGTC AAGGATTCGT
ACGAATTCAT TCTCAATAAT TTCATGGAAA TGAACAAGCT TTGGGTGCGT ATGCAGCATT
TGCCGGGGGA TGGACGGAGT AAGGAAGTCC GTCGTCGTCG AGCTCGTGAA CGCAACGAAC
TGAGAGTGCT GGTGGGGACC AATTTGGTCC GTCTTTCGCA ACTCGAACAC GTCACGTCCA
AAATTTACGG AGAAGTCATT CTGTCGCAGA TTCTGGAACA TATTGTCACG ACCGGGGAAC
CCTTATCGCA AGCCTACTTG ATGGATTGTT TGGTCCAGGC CTTTCCGGAT GAATACCACA
TCGAGACCTT GCCGATTTTG TTGAATGTCT GTCCGCGATT ACGGGACAAG GTAAACATTC
GCACAATTTT GCAAGGGCTC ATGGATCGGC TGGCGAATTA CTTGGCGGAA GAAGAGTTGC
TCGACGAGAG TGATACGAAT CAAGTCAAAA AGGCACTGGC TCGTGATTCT TTCCGACTTT
TTGAAGACTG TGTCCAGAAA GTCTACAATG CGCGCGGACC CAAACTGACC TCCCGCGAAG
TAATCCGTTT ACAATCCGCG CTCTTGCAGT TTTCTCTCAA GTGCTACCCC GGTAACTTAG
ACCAAGTCAG CACCTGTTTG GGACTCTGCT CATCAGCTCT GCGCCAAGCC AACGCGTCGT
ACGATCCTAG CGACGCTACC AGGGCAAGCA TCATCCGACC TCTGGACGAT GTGTCGGTGG
CCGAGCTGGA AAAACTTTTG TCCATTCCAC TCGATTCGTT GGCGTTGAAA GTATTGCAGC
TCGAGCATTT TAATGGGTTA ATTCGGTTCT TGCCCTGGAC GAGCCGCCGG CAAGTGGCCA
TCAAAATGCT AGAAGCTGTC GACAAGGCGG GTGCGCCTCC TACAAATCTG GACGAGATTG
AAGAGCTGTT TAGCGTGATT GAGCCAGTAA TTCGCAATCC CAACAATACA GCATCGGGGA
TAAGTAGACC ACAGCCGCAG CCGACTCACA TGGCAAATAC AGCCAGTCTC ATGGCCGGAT
TGGGGGTCAC TCAGACTGAC GCTCCATCGT TCAGCCAGTC TTCCTTTAAC GACAATGATC
ATTCGTCAGC GGCGGCACCG TCCCCGGAAT TGGCACGCGA GGATGCTCTG GTTGCTAAAC
TTATCCATCT TTTGGATCAC GAAGATACTG ATGTGATATT TGCCATGCTC AAAATTGCTC
GTGAGCATAT CAATCTGGGT GGGACTAAAC GCGCAAGTCG AACGTTGGTA GCCGTTGTAT
TTGCTTGTTG TCGACTTGGC CGCCGGATTT TTGACACGGA AAATAGCAAC GATGAGAGCC
TGCCGATAGA ATATAAGGAA GACGGCAGCA CTGCTATGGC GAAAGACGGC AGCGGCGACG
ATGATATTCC AAAGGAGCAG CACGAATGCA ACGATAGTGA TGGTCCGAAA GAGAATAATG
ATGATGACAG AGACGACAGT CCCAGAGAAA AAGACATCGA AAACAAAGAC GACAGCATAT
CAGAAAGTAA GAAAGTGGAT ATTCCGCAAA AGACAGCAGA AACTGAGTCC GAAACGAAAT
CTATGTCGGA AACGAAATCT ATGTCGAAAA CGAAAGCGAC CAGGTACGAA TTCACATTTC
TACCATACGA GTATGTGAAT GACGCTAAGT CTGCGCGTAC TGACAGACGT TTTTCCTTCT
GCCGTACTAC AGCTCCCGAA ATGTATTCGT GTTCATCCAA GACACGCTGT CTATGATAGG
AAGGGCCAAC GCGGAGGTTG GCATCAAGCT CTATTTAGAA GTATCGCTTA TTGCTGATTT
GCTGGCAAAG CGATCATCGG AATTCTCCGC AATCTCTTAC GAGCTCATGA CACAAGCATT
TGCCTTATAC GAAGAATCAG TATCAGACTC AAAGGTACAG TACCGTTGTG TATCACGAAT
GATTGGTACC TTGCTCTCCG TGGTGTCGCT TAGCAAAGAG GATTACGAAG GGCTGATCAC
AAAAACGGCA CAATTTTCAG CAAAGCTATT GAAGAAAGCG GATCAATGCG AGCTAGTGGC
ACAATGCGCT TACTTGTTTT ACCCAGTGGA TGCGAGTAAC AATGCTTCCA AGTACTCCAA
CCCGCAGCGT GCTTTAGAAT GTTTGCAGCG ATCTTTGAAA CTAGCTGATG CGTGTACTTC
CGCCAATGCT GGGAACGTCG GTTTGTTCGT CGATCTTTTG GAGCACTACG TATTCTTCTT
TGAGAAAAAA AATCCTGTGA TTTCGCATTC ATACATAACG GGACTCGTGT CACTTATCAA
AGAGCACTTC AATACTTTAT CCGACGATTC TGGCGTCGCA CAAGCCAAAA CACATTTCGC
TGAGCTTGTT CGCTATATCA AAGCGAAGAA ATCCAACGAT TCCTCTTCGG AGCTGTTCTC
ACCTGTCCAG GTAAATATGT AGTAACAATT TAATTAGTTG GAGATTGCGC ATAGTAGCAT
GGAAGAGTTA GGCGCGAGGT AGGAAGCGAA GGTGCAACGC TTGCGATAGA TATGTGCACC
TATTCCTGTA CGGGAATACG CTTTGGCCGG CAGTTGAATG ACTGCAGGAG CCCGAGTATC
GTTACACATG GCGGATTCCA ATCGTACGTC CGTTTCAACC ATCTCGGTTC GTCTCTTGAG
GCTTGGTAAA GATTTCTGTA ATCTGCGGTT TCAGTCGCTG AATGGAGCCT TCGGGGATTA
GTGGACTGCT TCATAAAAAA AGGTAAGACA AAGGCCGAAT AGGATGGGGT AAGAGTGAAG
AGTGCTATCT TCTGGAGTCT CACCAAAACT GGTCATGCTA TCGCAGGAAG TATGATTCAT
CCAGCTAACG CTATATTGGC TGCTTTGCTG GGTTATCTGG TCAAAGGGAG CAACGCCGAG
TCGCACTACA CCTTGAAGTA CCGACAATCG AAACCAGGAG GGAATGCTGT GCGCAGGGCT
ACCAAACTTC GAGGAATGAG GCACTTGAGC ATCAACAAAG GACGCACTCT TGCCATATTG
AAAGATAGAA CACAGGCTTT GGCACCTAGA GGAGGAGAAA AGGTAAGTTC TCTGGTTCAC
TGCTTCGGGA ACTGTCCAAT GTCTCACTCC TCATTTCGAA ATAGGGTCGC AACTTCCGTT
CTGGTGCCAA TGGTATCAGT GGTAGAACCA AAAGCAATGG TCCAGCTTTC AGCGAATGGC
CTGTAGACGA CGATCCGCTT ATTCCAGAAC CTGAAGTCAC AATGACGGGC GAGGATCGTC
CCAGTTCAGA CGATGGGCTA GTGATCACAG GAGACGATGT CATCTCACCA ACCCCAGCAC
CAAAACCTGA AAGCTCTCCC AGCAACGAAG GTGTGAACGT GAGTGGCCCC ACGCCCAGTG
ATGGCGGCAT TTCGCCTACC AATGCCGGTG AAAACTCTCC AAGCAATGAA AGCGTAAACG
TGAGCGGCCC CACACCCAGC GGTGGCGGCT CTACTCCTAC GAATGCCGAT GAAAACTCTC
CTAGTAACGA TGGCATAAAC GTGAGCGGCC TTGCGCCCAG CGGCGGTTCT GCTCCAACGA
ATGCCGATGA AAACTCTCCC AGTAACGACG GGATAAACGT GAGCGGCCTT GCACCCAGCG
ATCGCGTCTC TACACCTACG AATACCGGCG AAAACTCTCC CAGCAACGAC AACATCAACG
TGTCTGGTGT ATTGACGCCT CTAGACGACG ACGAAAACTC TATCAGCAAC GATGATATCA
ATGCAAATGT CCCTCAAGAC GACGAGGTCG ACTATTCTCC CAGCAATGAG GGAATGGGCA
TTGATCGCAG CGGCTTTCCC GCTGGTGGCG GCGGCATTGC GGGCATTCCA CCTTTGGAAG
GTATGGAGTC CGAATTCGGC GACGATGTCT TTAACCCTCC AACGAATGAA GACTATGTAC
GCGCCGGAAA CGTCTAA
 
Protein sequence
MERVTSNASQ APSVQSDGSQ TGGAWGTMGG GPGDGFQSGP GGSVQPNPNN STTTSTAFAS 
SQPSPETPAP YSPYPPGSGS VTRFAPSSTL SPYPTQATPY YGGAPTTPAL YPDTPMAASS
YAARSAHAYG TASSQSLPQS QAPPPPQGAA SSYASRSAQA HASRDGSVSI HTGQSHAYSG
APVRNIPPLV GGSYASRSAH AYQSQPQPSA AYGPSTSSPS PPSGGLPGTY ARGNPPMNVT
GGTYGSYAPQ QPQISNPQPQ PNDGAQSNYN TAAQPNSTLP PPTTQNPPPG STTSVHSRSH
VNNNNNSAPI TPQAQAAQQR MLTDASRKVQ EHAYYMRQAM DERNLPVALD RAAHMVGELG
GPPHGHHHTT HTATGPTNTG LSASLTPKNY YELYMRALEE MPALEDYLLN LTNPTMYNTE
PTIEIVASPQ HLRRAPYTMT ELYDCVQYCP RVVSRLYLQI TAGSAWIRSG GDADVCWVLN
DLVQAVKCEQ NPTRGLFLRH YLLTALKDKL PDTPAPHHPS TPHLETIVSE EELADDETKS
HDDNDNLDVG QTAAPVPVGT VKDSYEFILN NFMEMNKLWV RMQHLPGDGR SKEVRRRRAR
ERNELRVLVG TNLVRLSQLE HVTSKIYGEV ILSQILEHIV TTGEPLSQAY LMDCLVQAFP
DEYHIETLPI LLNVCPRLRD KVNIRTILQG LMDRLANYLA EEELLDESDT NQVKKALARD
SFRLFEDCVQ KVYNARGPKL TSREVIRLQS ALLQFSLKCY PGNLDQVSTC LGLCSSALRQ
ANASYDPSDA TRASIIRPLD DVSVAELEKL LSIPLDSLAL KVLQLEHFNG LIRFLPWTSR
RQVAIKMLEA VDKAGAPPTN LDEIEELFSV IEPVIRNPNN TASGISRPQP QPTHMANTAS
LMAGLGVTQT DAPSFSQSSF NDNDHSSAAA PSPELAREDA LVAKLIHLLD HEDTDVIFAM
LKIAREHINL GGTKRASRTL VAVVFACCRL GRRIFDTENS NDESLPIEYK EDGSTAMAKD
GSGDDDIPKE QHECNDSDGP KENNDDDRDD SPREKDIENK DDSISESKKV DIPQKTAETE
SETKSMSETK SMSKTKATSS RNVFVFIQDT LSMIGRANAE VGIKLYLEVS LIADLLAKRS
SEFSAISYEL MTQAFALYEE SVSDSKVQYR CVSRMIGTLL SVVSLSKEDY EGLITKTAQF
SAKLLKKADQ CELVAQCAYL FYPVDASNNA SKYSNPQRAL ECLQRSLKLA DACTSANAGN
VGLFVDLLEH YVFFFEKKNP VISHSYITGL VSLIKEHFNT LSDDSGVAQA KTHFAELVRY
IKAKKSNDSS SELFSPVQLE IAHSSMEELG ASLRGLVDCF IKKGSMIHPA NAILAALLGY
LVKGSNAESH YTLKYRQSKP GGNAVRRATK LRGMRHLSIN KGRTLAILKD RTQALAPRGG
EKGRNFRSGA NGISGRTKSN GPAFSEWPVD DDPLIPEPEV TMTGEDRPSS DDGLVITGDD
VISPTPAPKP ESSPSNEGVN VSGPTPSDGG ISPTNAGENS PSNESVNVSG PTPSGGGSTP
TNADENSPSN DGINVSGLAP SGGSAPTNAD ENSPSNDGIN VSGLAPSDRV STPTNTGENS
PSNDNINVSG VLTPLDDDEN SISNDDINAN VPQDDEVDYS PSNEGMGIDR SGFPAGGGGI
AGIPPLEGME SEFGDDVFNP PTNEDYVRAG NV