Gene PHATRDRAFT_15555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_15555 
Symbol 
ID7195330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp605183 
End bp610303 
Gene Length5121 bp 
Protein Length1706 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183777 
Protein GI219127092 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAAACACAGG TCAACGGCCA ACCCGTATAC GGTGGAGCCA ACGACCCGCG TTTGGGCAAT 
CTACACGACA AGTCCGATCC GGGATACTTT GGACACTTGG ATTTGGCCAA ACCAGTGTAT
CACCAGGGAT TTTTCAACAC CACGTTGCGG GCACTCCGGT GCGTCTGTTT TCACTGTTCC
CGACTCCGCA TGCTCCCGGA CGAATTCAAG TTCCAAAAGG CCATACAGAT CAAATCGCGC
AAACGCCGAC TCGAAGCTCT GCACGAATCA CTCCGCGGGA AGAAAAAATG CGATCACTGT
CAAGGTGTAC AACCCAAATA CACCAAGGTG GATCTGCACG TCGAAGCGGA CTTTCCCGAA
GACGGAATGC ACGGAAGTAC CGGGGGCGGA GGGGATTCCA AACAATTCTT GTCCGGGGAC
ACCGTGGTCA AGATATTCAA GCAAATTCGG GAAGAAGATA TTGTGTTGTT GGGTTTGGAT
GTCCAGCACG CGCGACCGGA TTGGTTGCTG GTGCAGGTAT TGCCCGTCCC GCCCCTACAC
GTTCGACCCA GCGTTACTGT CGGGGGTGGT ACGCAATCGT CCGAAGACGA TTTGACGCAT
CAACTCGTCA ACGTTATTAA ATCGAATCTC TCGTTGCAGC AGGCCGTCTC CAACGGTGAA
CCCCAAATTG TGGTGGAACA GTTCGAACTG GCCCTGCAAC ACAACGTCGC CGCCTTTATG
GACAATGAAC TACGAGGCAT GCCGCAAGTC ACTCAACGCA GTGGACGACC CCTTAAAACC
ATTACGCAAC GTCTCAAAGG CAAGGAAGGG CGAATTCGGG GAAATCTCAT GGGAAAGCGG
GTCGACTTTT CCGCGCGTAC CGTGATTACG GCCGATCCCA ATCTCGGTAT TCATCAAGTC
GGTGTGCCCC GGAGTGTCGC CATGAACTTG ACTGTCCCGA TTCGCGTGAC GGCCTTCAAT
CAAGCCGAAC TTAGCGCCCT CGTGGCCAAC GGCCCCACCA TGCATCCCGG GGCCAAGCAC
ATTATCCGAT CGGACGGAAC GCGGATCGAT CTGCGTTACG TCAAAAACAA ATCGGAACTT
CTCCTGGCCC ACGGCTGGAT TGTGGAACGG CATTTGCGTG ACGACGATAT CGTGTTGTTC
AATCGGCAGC CCAGTCTACA CAAGATGAGT ATCATGGGAC ACAAGGCCAA GGTACTGGAT
TGGAGTACTT TTCGATTGAA TTTGTCGTGT ACGAGTCCGT ACAATGCCGA TTTCGACGGC
GACGAAATGA ACCTTCACGT GCCTCAGGGA TTGGCGGCTC GTGCTGAAGC GGAACTCATG
ATGTTGAGCT CCCGGGTTAT TGTCTCGGGT CAATCGAATC GACCAGTCAT GAGTATTGTT
CAGGACAGTT TGTTGGCGAC TCAAAAAATG ACGAAACGGT CGGTTTTCAT CGAAAAGGAT
TTATGCTACA ATATGCTCAT GTGGGTGCCG CAGTGGAACG GGCAGATTCC CATTCCTGCC
GTGATCAAAC CAAAGGAATT GTGGACCGGT AAGCAATTGC TCAGTACAAT CCTGCCCAAG
GTGAATCTCA AGTCCAAGGC AAACAATGGC CCCGGAAAAG ATGCTCGTGG CAAGAACATG
CCGAATACGT TCAACATGTA CGATCATTTG GTGACGATTC AGGATGGTGA ACTGTTGGAG
GGTACAGTTG ATAAGAAGAC AATCGGCAGC TCCATGGGTG GCTTGATCCA CACGGCTTGG
TTAGACGTTG GGTTTGAAGA AACGGCTCGT TTTATGAATC AAATTCAGCA GCTAGTCAAT
CATTGGATTT TGCAGTACTC GTTTTCCATT GGAGCGATCG ATGCCGTCGC CGATGCAGAT
ACTATGCGAC AGATTGAGTC GACCATTGAC AAGGCAAAGC GGCAGGTGCA AGATTTGGTT
CGCCAAGGGC AATTGGGAGA ACTTGAGATT CAACCCGGTC GTACCATGAT CGAGTCGTTT
GAACAGCTCG TCAACAAGGT GCTGAACACG GCTCGTGATC ACGCCGGAAA ATCTGCACAA
TCTTCTTTGG ACGAAACAAA CTCGGTCAAG GCCATGGTGA CGGCTGGTTC CAAAGGTTCA
TTTATTAATA TTTCGCAAAT TATTGCCTGC GTGGGGCAGC AGAACGTGGA AGGCAAACGC
ATACCGTACG GTTTCAAGAA ACGAACCCTA CCGCACTTCT CCAAGGATGA TATCGGCTCC
GAGTCCCGAG GCTTTGTCGA GAATTCGTAT TTGCGTGGTC TGTCTCCTCA GGAATTTTTC
TTCCACGCGA TGGGTGGACG GGAAGGTTTG ATCGATACGG CTTGCAAGAC CGCCGAAACC
GGATACATTC AACGTCGCCT GGTCAAGGCA ATGGAAACCG TCATGGCGCG TTATGATGGA
ACTTTGCGAA CGAGCAGTGG ACAGATTGTT CAATTTTTGT ACGGAGAGGA TGGCATGGAC
GCGGTCTGGA TTGAAAAGCA AAATTTTGAC TCTTTGACGC TGGCAAAGCC AGAGTTCAAC
AAGCGTTTCT TATTCGACAC ATCCAGCCCA GAGTTCGGAC ACGATGAGCA AGGTATTCCG
TTTCTGGAAC CAGACGTAAT TGAGGAGTGT CGTCGCGACC CTGATATCCA GGCTACTTTG
GATCAAGAAA TTGAGATTCT TCGGGAAGAT CAAGCAATTC TCCGAATTGT CATGCGCAGC
CGAGAAGCTG GGAGAGAGAG CGACGACAGC TCATACGCAC CAGGCAATGT GCGTCGTGTG
ATTCACAACG CAATGCGTCA ATTTCGTATC GACAAGAGCA AGCCAACGGA CCTGCATCCC
ACAGAAGTGA TACAGATTGT TAACAACCTG TTGGAGCGTC TGATTGTAGT GGTTGGGAAC
GACCCGCTAA GTGTTGAAGC GCAGTCAAAC GCAACCACTC TTTACCGCAT TCTGATTCGA
ACCATGCTTT CGAGCAAGCG TGTCTTAAAG GACTGGCGTT TGAGCAAGGC TGCTTTGAAC
TGGGTGGTAG GCGAAATTGA AACTAGATTC AATATTGCTA TGGTTAATCC TGGTGAAATG
GCTGGAGTAT TGGCCGCTCA GAGTATTGGT GAGCCTGCAA CCCAGATGAC GCTCAACACC
TTCCATTATG CTGGTGTTTC CGCCAAGAAC GTGACGCTGG GTGTCCCTCG ACTGAAAGAA
ATCATCAATG TTGCTAAAAC TCCAAAGACT CCTGGCCTAA CTATTTATCT TCAGGAAGAG
GTCAGTGGTG ACGAAAAAGT TGCCGAGCAG GTCGTTGCTA TGCTGGAATT TACTGTTTTG
GGCGACGTTG TAAAGAAGAC AGAAATTTAT TACGATCCTG ACGTGAAAAA TACGGTTGTC
ACTAAGGATC GGGAGTTCGT CAAGGAATTC TATGACTTTA CGGATAAGAC AGACGATGAT
TTGCGTCGCA TGAGTCCTTG GGTTCTTCGT GTTGAGCTTG ACAAACCGCT ACTTTATGTC
AAGAAAATTA AGATGGAGGA AATCGCTAAA GAGATTGGGG AAGAATACGG TGCGGATCTG
AACGTAGAAG TGACAGACGA CAACGCCGAC GAAATGGTCG TCCGGATTCG AATCGTGAAC
GATACGCCGT TCAACTCAGG CCAAACAGAT GAAGGCGGAA ATTTGATGGA CGATCAACCG
GAAGTTGGCC AAGAAGACGA TATTTTCTTG AAACGTCTAG AAAAAAGCAT GCTTTCGAGT
CTGAAGCTTC GCGGGGTAGA CCATGTGAAG AAAGTGTTTA TGCGCGGTGG TGCGAAACGT
ACAGTTTGGG ACGATGTAAA AGGTTTCGGC GTTAGAGATG AGTGGGTACT AGAAACGGAT
GGGACAAACT TGATGGCAGT TCTTGGCGTG GACTACGTGG ATGGTACGAG ATCTGTCAGT
AATGACATCG TCGAGGTGTT CGTAGCGCTT GGCATTGAAG GAGTCCGCGG AGCGTTGTTA
AGTGAGCTTC GCAACGTCAT TAGTTTCGAC GGTTCTTACG TAAACTATCG CCATTTGGCT
TGTCTGGTGG ATGTCATGAC AATGCAGGGG CACTTAATGG CTATTGATCG CCACGGCATC
AATCGAGTCG ACACTGGTCC ATTGCTCCGA GCTTCATTCG AGGAAACGGT TGATATGCTC
ATGGATGCAG CTGTGTACGC TGAGGAGGAG ATTCTTAAGG GCGTGACCGA AAACATCATG
ATGGGTCAGC TTGCTCGAGT TGGCACCGGT GATGTAGACT TACTACTGGA TGAAGACAAA
GTTGTTCGAG AAGCAGTTGA AGTTGTTGTG GACGAGTTTG CTGTCGACAA AGATCTCGGT
ATGGCCGGAG TGGGGGGTGT AGGAGGAGCG ACCCCTTATG CCACCACTCC ATTTGCCGCT
AGCCCAATGG TGGGGGATGG CGCAGCAGCG TCTCCTTTTG TGGATGGCGG AGCCGCTTTT
TCTCCAGCAG TTGGTGCGGC AAGTTTCTCA CCGGCTTATT CTCCAGACAG CGGTAGTTAT
GGTTCTGGAT TTGCGAGTGG AAGTTACGGA GCTGGCGACA GCCCAGCGTA CAGTCCGACG
TCTCCGCAGT ATTCGCCGAC TTCTCCGGCG TACAGCCCCA CGTCTCCAGC ATATTCGCCC
ACAAGCCCAG CATACAGCCC TACCAGTCCA GCGTACAGTC CAACGTCACC TGCATATTCG
CCAACAAGTC CGGCCTACAG CCCAACTTCG CCTGCGTATT CACCAACGAG CCCCGCATAT
TCGCCAACGT CCCCGGCTTA CAGCCCGACG TCGCCGGCAT ATAGTCCGAC GAGTCCCGCA
TATTCGCCAA CGTCGCCTGC GTATTCTCCA ACGAGCCCAG CTTACAGCCC AACTTCACCG
GCCTACAGCC CTACATCTCC AGCATACAGT CCGACTTCAC CGGCTTACTC ACCCACCTCG
CCAGCTTACA GTCCTACGTC TCCGGCTTAT TCTCCGACGT CTCCCGCGTA CAGTCCTACA
TCCCCCGCGT ACTCGCCGAC ATCTCCGGCC TATTCGCCAA CATCCCCCGC ATATTCGCCG
ACCTCGCCAG CCTATTCACC GACCTCGCCG GCCTACTCAC CGTCGGGTGG CGATGATAAG
AAAGACGAAA TGGAAGACTA A
 
Protein sequence
ETQVNGQPVY GGANDPRLGN LHDKSDPGYF GHLDLAKPVY HQGFFNTTLR ALRCVCFHCS 
RLRMLPDEFK FQKAIQIKSR KRRLEALHES LRGKKKCDHC QGVQPKYTKV DLHVEADFPE
DGMHGSTGGG GDSKQFLSGD TVVKIFKQIR EEDIVLLGLD VQHARPDWLL VQVLPVPPLH
VRPSVTVGGG TQSSEDDLTH QLVNVIKSNL SLQQAVSNGE PQIVVEQFEL ALQHNVAAFM
DNELRGMPQV TQRSGRPLKT ITQRLKGKEG RIRGNLMGKR VDFSARTVIT ADPNLGIHQV
GVPRSVAMNL TVPIRVTAFN QAELSALVAN GPTMHPGAKH IIRSDGTRID LRYVKNKSEL
LLAHGWIVER HLRDDDIVLF NRQPSLHKMS IMGHKAKVLD WSTFRLNLSC TSPYNADFDG
DEMNLHVPQG LAARAEAELM MLSSRVIVSG QSNRPVMSIV QDSLLATQKM TKRSVFIEKD
LCYNMLMWVP QWNGQIPIPA VIKPKELWTG KQLLSTILPK VNLKSKANNG PGKDARGKNM
PNTFNMYDHL VTIQDGELLE GTVDKKTIGS SMGGLIHTAW LDVGFEETAR FMNQIQQLVN
HWILQYSFSI GAIDAVADAD TMRQIESTID KAKRQVQDLV RQGQLGELEI QPGRTMIESF
EQLVNKVLNT ARDHAGKSAQ SSLDETNSVK AMVTAGSKGS FINISQIIAC VGQQNVEGKR
IPYGFKKRTL PHFSKDDIGS ESRGFVENSY LRGLSPQEFF FHAMGGREGL IDTACKTAET
GYIQRRLVKA METVMARYDG TLRTSSGQIV QFLYGEDGMD AVWIEKQNFD SLTLAKPEFN
KRFLFDTSSP EFGHDEQGIP FLEPDVIEEC RRDPDIQATL DQEIEILRED QAILRIVMRS
REAGRESDDS SYAPGNVRRV IHNAMRQFRI DKSKPTDLHP TEVIQIVNNL LERLIVVVGN
DPLSVEAQSN ATTLYRILIR TMLSSKRVLK DWRLSKAALN WVVGEIETRF NIAMVNPGEM
AGVLAAQSIG EPATQMTLNT FHYAGVSAKN VTLGVPRLKE IINVAKTPKT PGLTIYLQEE
VSGDEKVAEQ VVAMLEFTVL GDVVKKTEIY YDPDVKNTVV TKDREFVKEF YDFTDKTDDD
LRRMSPWVLR VELDKPLLYV KKIKMEEIAK EIGEEYGADL NVEVTDDNAD EMVVRIRIVN
DTPFNSGQTD EGGNLMDDQP EVGQEDDIFL KRLEKSMLSS LKLRGVDHVK KVFMRGGAKR
TVWDDVKGFG VRDEWVLETD GTNLMAVLGV DYVDGTRSVS NDIVEVFVAL GIEGVRGALL
SELRNVISFD GSYVNYRHLA CLVDVMTMQG HLMAIDRHGI NRVDTGPLLR ASFEETVDML
MDAAVYAEEE ILKGVTENIM MGQLARVGTG DVDLLLDEDK VVREAVEVVV DEFAVDKDLG
MAGVGGVGGA TPYATTPFAA SPMVGDGAAA SPFVDGGAAF SPAVGAASFS PAYSPDSGSY
GSGFASGSYG AGDSPAYSPT SPQYSPTSPA YSPTSPAYSP TSPAYSPTSP AYSPTSPAYS
PTSPAYSPTS PAYSPTSPAY SPTSPAYSPT SPAYSPTSPA YSPTSPAYSP TSPAYSPTSP
AYSPTSPAYS PTSPAYSPTS PAYSPTSPAY SPTSPAYSPT SPAYSPTSPA YSPTSPAYSP
TSPAYSPTSP AYSPSGGDDK KDEMED