Gene PHATRDRAFT_47010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47010 
Symbol 
ID7202244 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp169475 
End bp172978 
Gene Length3504 bp 
Protein Length1133 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181151 
Protein GI219121600 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.671968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAAGA TTGCGGAAAT TTCCGATCGC GCGGCCTGTG CAGCGTGGTG CCCACTCCAA 
GCACATCCCA ACGTTGTAGC TCTCGGAACC AAGGTACGTA CAAAGTACTG TTCTATTGAG
ATTCACGGTA AAGAGTAGAT GGCTTCTTGA TCGCATGCTT ACGTAACTTG CTCTGGTTCC
TCTGTTCTCG CTCAGGACTC TGGTGGAGGT GGTTTTGACG ACACCGGCGG AGAATTGGAA
GTTTACGACT TGTGGAGTAG CAACATCACT TCTGGTGCTG ACGACAACGA CAAGGTTGGC
TCTACGCAAC CCAAACTACT GGGATCCATC AAGACCGGAG CCCGATTTGC TAGCGTCGCA
TGGACACCGT ACACACGCAA TGGAAAGTAC CCCATGGGAT TGCTCGCGGG CGGGATGGTA
GATGGAACGA TTTACAATTG GGATCCTAGT CTAGTGATTG CCGCGTTTTC GCAAAACCAA
ACAGCGCAAG CTGAGTTAGC GGCGTGTTTG GTGCACACCG TCCAGGCTCC GCCGGCATCG
TCCAACAGCT CGTTCGGAGC CATGCAATTC AATCCACTCG AGCCGCATCA GCTCGCAGCG
GGAGCCGCTA ATGGACACGT TTCCATTATC GATATTTCCG CCGATAAAGC GTCGGTGCAG
GAACCCACCA CCGTAGCCTC ACAATTTCAA ACCTCCGAGA TTAAGGCCGT CGCCTGGAAT
CCTCAAGTAT CCCATATTGT TGCGTCTGCC GCGGCCGATG GTTCCGTTGT CGTTTGGGAC
TATAAATCCC GCAAAGCTTG GTGCGAGTTA CGGGCGGCAC ACGCTGCCGC GGCAGTCAAT
GACGTGTGCT GGAATCCGTC ACAGGGACTG CATTTGCTCA CCTGCAGCGG AGATGACCGG
GATGCCGTGC TCAAAGTATG GGATTTAGGA GCCAGTACGT CCATGCCACT CACGACTTTG
ACGGGACACG CCGCTGGAAT TTTGAGCGCG GGCTGGTGTC CTCACGACGA AACATTGTTG
CTCACGACGG CCAAGGATAA TCGTACGTTG TTGTGGGATT TGCAAACACT CCGACCCGTA
GCGGAACTCC CGAATGCAGC GGCGGAAGCA GAATTGCAAC AGAAGCAGCA ACAACAAACA
CAGCAAGCTC ATCCCACAAA CGTCTTTGCA GCCGCTGGTG CTGGTTCAGC CGGGGGTATG
TCCGATCAAC GACACATGCG TTACGCCGTC GCTTGGTCTC CGTGGAAACG TGGTGTTGTC
CTGACGTGTA GCTTGGATCG TAAAGTCGAG GCGCATTCCG TTTTGACCCT GGCCACCAAG
TCGGGACGAC CGCCGGCCTG GATGCGACCA GCCTCGAGTG TATCATGCTC CTTTGGAGGT
CTCGTCGTGT CCTGCGGAAG TCAGGATAAG GTTGTTTCGT TGCGAACCGT GTGTGAGCAA
CCCGAATTCG CGGAAACGTC ACAACGACTA GAAACGGAAC TCAAGTCCCA AACCATTGTG
GACTTTTGTC GATTCCGTCA TGTGACGGCA GCGAGAAACA AGACGGAAGC CGATATGTGG
GGATTTATGC AAGTCCTCTT TGACGCCAAC GCGCGTCAAG CTCTTTTGCA GCATTTAGGA
TACGATGCGG ATAGTATTGC GTCCACGGTG GCGGCCCGGT ACCCCCATGT AGCAACTACC
ACCGAGCACG GAGAGTCATC CGTTAATGGT AGTAGCAACA AGAGCACCAC TGGGCTCCCC
AAGCCGCCTT CCATGAAAAC CGCGTCAAGC ATGCTAGCAA AGTCTACCTT TGACCAGGCC
GCCGAAGATT CCGTGAAGCA AGCCTTGCTT GTGGGGAACT TTGAGGCTGC CGTAGACGTT
TGCTTGGCCA CCGGAAATTG GGCTGACGCA CTGGTCTTGG CCAGCTGCGG GGGTCCCGAT
CTCTGGCAGG TTGTACAAAC GCGCTTCTTT GCTTCGGAAA CTGCACAGCG TCCATATTTG
AACATTGTCA GCGCCGTAAT TCGTTCTCAA CTCTCCGATC TCGCCACTAG TCCGGACATA
GCCACAAAGT GGTCGGAAAC GCTGGCCATT TTTTCGACCT ACGGTGCGTC GGAGGAATTC
CCACAGCTCT GCGTGGCGCT TGGTGAATCT TTGGAAGAAG CGGGCGACCC GGCCAGTGCT
ACACTCTGCT ACATGTGTGC TCTGAATTTG GATCACACAG TGAGGTTCTG GAAGTCACAA
TTGGCGGAAG CTAGTCGCCA GAAAGGTGAG GGAACGAGTG ATGTCTTGGC ACTGCACGAA
TTTGTGACGA AAGTGTCCGT CTTTTTGGAA GCGGTAGGGC CGTCCGCGAT TCTGTCGGAG
GATATTGCAA CTTTGTTTGC CGATTATGCC GAGGTGTTGG CACAACAAGG TCTTTTGGTT
ACGGCGGCCA AATATGCCAG AAAGGGGACT TCGATCGAAA GCCAGAAGCT GCGCGATCGT
CTCTACAGGA GCCGCGCAAG TGCTTCTTGT TACGCGTCCT TAGGTACTGC ACCAGAATTT
CCCTACCGCA TGTCAGCTGT GGAACCCAGT CGAGGACCGA CGGTTGTCAA GAATGCGTCT
TATGCGCAGC ACCAACCGGA TGCAATTCAT CTGAATAAGC CGCAACCGAA AACCACCTAC
GAACAGCGTC AGCAGTCCTC GATATACGGA CGGCAACATG AACGACGACA GACGTCATCT
GGAGCACCAG TCCCAGCTGC AGTCATCGAG CTTCCTAGTG GATGGATGGA GCTTCATGAC
CCTAACAGTA GACTCCCATA CTACGCAAAC CAGGCCACAG GGGAAACTAC GTGGGATCGT
CCACAAGCAA TCCCGTCAAC AAATTCTCAG ACAGTGCCGC AGCAGACGTA CGCTAGTGTA
CCAGCTAGTC AAGAGCCGGT GATGGATTCG TCTCGACACT CAACGCGTTC TCAGACCTCG
GTCACTTCTG GCGTTTCAAC GCTGAGACCC AAGCCTAGTG TTGTCTCCAA ATACGGCGAC
GGTTTTGTGA CGTCCGCGTC GCATCCAGAA CTTGCCGATC AATACGGCAA CATTGGCACC
AGCAATCCCT ATAGCGGTGT AAATCGACCC GGTACAGCCG CAGCTGTGGC CGCGACAGCC
CAGTCCGCTG TAGCTCCAAT ATCAGACACC CTCAACATTG AAACCCTGGA GGTCTCACCC
GAGTTTGCTC ACATCAAGGA CACGCTGCTG GCGTGTGTCA ATGCCTTGAA AGACTATCCG
TTATCTCCCG CTGACAAGCG ACAGTTGGCG GAAGCCGAAA AGGGAGTTGC TATCCTCGTC
AAAAAAATTG TCCGCTATGA TATCGACGAA GAGACAGTCT CAAAGGTCTC ATTTATGATC
GGCGCTCTGG CCAACGGTGA CTACCCCGCA GCGACGGCGA CCAATACCGC ACTAGTTAAT
AGCGATTGGC GCGACCACAA GGACTGGCTT AAAGGGATGA AATCGTTGCT AGCGTTAGCG
TCGAAGAAGT TTGCTCGTCA GTAG
 
Protein sequence
MTKIAEISDR AACAAWCPLQ AHPNVVALGT KDSGGGGFDD TGGELEVYDL WSSNITSGAD 
DNDKVGSTQP KLLGSIKTGA RFASVAWTPY TRNGKYPMGL LAGGMVDGTI YNWDPSLVIA
AFSQNQTAQA ELAACLVHTV QAPPASSNSS FGAMQFNPLE PHQLAAGAAN GHVSIIDISA
DKASVQEPTT VASQFQTSEI KAVAWNPQVS HIVASAAADG SVVVWDYKSR KAWCELRAAH
AAAAVNDVCW NPSQGLHLLT CSGDDRDAVL KVWDLGASTS MPLTTLTGHA AGILSAGWCP
HDETLLLTTA KDNRTLLWDL QTLRPVAELP NAAAEAELQQ KQQQQTQQAH PTNVFAAAGA
GSAGGMSDQR HMRYAVAWSP WKRGVVLTCS LDRKVEAHSV LTLATKSGRP PAWMRPASSV
SCSFGGLVVS CGSQDKVVSL RTVCEQPEFA ETSQRLETEL KSQTIVDFCR FRHVTAARNK
TEADMWGFMQ VLFDANARQA LLQHLGYDAD SIASTVAARY PHVATTTEHG ESSVNGSSNK
STTGLPKPPS MKTASSMLAK STFDQAAEDS VKQALLVGNF EAAVDVCLAT GNWADALVLA
SCGGPDLWQV VQTRFFASET AQRPYLNIVS AVIRSQLSDL ATSPDIATKW SETLAIFSTY
GASEEFPQLC VALGESLEEA GDPASATLCY MCALNLDHTV RFWKSQLAEA SRQKGEGTSD
VLALHEFVTK VSVFLEAVGP SAILSEDIAT LFADYAEVLA QQGLLVTAAK YARKGTSIES
QKLRDRLYRS RASASCYASL GTAPEFPYRM SAVEPSRGPT VVKNASYAQH QPDAIHLNKP
QPKTTYEQRQ QSSIYGRQHE RRQTSSGAPV PAAVIELPSG WMELHDPNSR LPYYANQATG
ETTWDRPQAI PSTNSQTVPQ QTYASVPASQ EPVMDSSRHS TRSQTSVTSG VSTLRPKPSV
VSKYGDGFVT SASHPELADQ YGNIGTSNPY SGVNRPGTAA AVAATAQSAV APISDTLNIE
TLEVSPEFAH IKDTLLACVN ALKDYPLSPA DKRQLAEAEK GVAILVKKIV RYDIDEETVS
KVSFMIGALA NGDYPAATAT NTALVNSDWR DHKDWLKGMK SLLALASKKF ARQ