Gene PHATRDRAFT_47518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47518 
Symbol 
ID7202292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp869255 
End bp872970 
Gene Length3716 bp 
Protein Length953 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181645 
Protein GI219122631 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.668741 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACAT CCGTATGCAA TAGCTCCAGC AATCCACGAT ACGGCCCATT GGAGCCTCGC 
CGTCGAAAAA CTAATATACA AAGGATGCTA CGACAGGAGA GTGGTGGTCG GGCATCCACC
TACCGTCTAC GAATTACCCA GCCCTGTCCA ATCCTTTTGC TGCTTCTTTT ACTAATCGCA
CAGTCTTCGG TGTTTGTATG GTACCACACC TGTGATGAAG CATTTTTTGT AGACCGTCCC
ATCAATGAAC AAGGCATCCC AGCATTTTAC AAGCCTTCCG AACATGATTT CTGTAAGAGC
TTCGTCCGGC ACGGGCTCTT TTCGCTTCTG GGGAATGGAA TGGTTCCGAA TGACGGTAAA
ATCGAGAACG ACCTCTTCAA CACAGCAGCG CACACCCCGA AAGACCGCAC TACCTTGTCC
GCTTCACGTC GTGCTTCTGC AGATTTGAAG ATGATCAAAG CCACAATCGC TGGAGAACCG
ACAAAGGCTA CTGGGCCTGA GGACAGCTCT GTTTCGCGCG ATTTCATGAA TACCTCTGCA
TCAAAAGAAG AGCATCTCGA AGTGTATGGA GGAGTTCCAG GCAAGAATCA ACATACAGAT
GAAGCAATTC AACCGGTTTC CATGTTACTG CTCCGAGCTG TCGGAAACTC TCTTCCACCG
CGTCACAGCG CTGAACAAAC ACTAAGAAAT CTGGACTTTG TCCTGACGTA CGAGGAAGCT
TTCCCAAATC TGTCTCGCCA TTGGTTCCTG AACCGGCTTG TGGATCCACA GGTAGAGCAG
CAAGTGGTGC AAAGGCTTCA ACAAGCAAAC GAATCCTATA CGATTATTCC CTTCGATTTG
AAGTCCTACG ACTCCCACGT GTATCGTTTC GATTTGTTGG ACTTTCCCGA TCAAATTCAT
TCGACCTACT CATACAACAC GTACACAGGG TTTTTCCTCG GGTTGAATGC CGAGGATCGT
ATTATGCGTG ATAAGCTGGT GTATACTGCC AACGTCAATG GAGTTCGCAA TGAAATGCTC
AATTATGGCA GGCACCTTAG TTCTGCAGAA TACATTCTCC CGTTTGACGG CAACTGCTTT
CTCGCTCGAA ACGCTTGGGA AGCTATGCAG CGGAATATAC AACACAATCC AGCGGCAAAG
TACTTTGCCG TTCCAATGGA TCGTCTTGTC GAGGAAAATG CAGCACTGCT GTCAGGTTCA
TACAAGCCCA ATCCAGTCGA AGAACCCCAA ATTATATTTC ATCGAACGTC AATAGCCAAT
TTCAATCCAA ATTTGCCTTA CGGTCGTCGT AACAAAGTGG ACTTGCTGCG TCGGTTGGGT
ATTAAAGGCA TTTGGGAAAA CAATGAAGGG AATAAGTGGG ATGTCATCCT AGAGGCAACA
AACCCAGTTG CGGAGATAGA AGCAAACGTG ATCGAAATGA CTGGAACGGC TGGATGGATC
TCTCGGCTGT ACTCAGGGAA CAAAAATGCG GAGTTGACTG ATGCGGGTGC TTCTACAACT
CGGGTCAGGA TGCGTCGTAC AGCAATATCG ACGCTACTGG ACCGTTTGGA CTTGCGTGCA
GCTGTGACCT TGTACAATTT TACTTCCAAT ACGATGCTTT TCTATGACGA GCAGCAGCTG
TACCAATCGC ATTTACTTTG GAAAAATGGA CAATTGCAAA ATCCAATGAT CAACAGCTTA
ATGGAAAAGG CTGCCCAATC CCTAAACGAA GGTCCTTGGT CTGTTGTGGA CAAACGATCA
TTCGGCTGCG GGCCCTCACT GAGTTGTCGT GACTACTACA ACCCCTGGCC ATACATGTGG
CCGCAGCGCA ACGAATCCGG TAGTATCGAT TGGACAAGGG AGTTTGTAAT GAACAACGGT
GAAATCCTTC CGGGAAGTGT ATTGTATACT GAAGGAAGTG AGCACTTTGA CTACACGCGC
TTAGAGTCGA TGCAGCGCAA TACTACCGTT TTAGCGCTGG CCTATGCAAT ATCTGCCAAC
CAGTCCTACG CCGAAAAGGC TGCTGCCAAC CTTCGTGTAT GGTTTCTTGA ACCGGGGACC
AGAATGAATC CCAACCTCTC GTACACTCAG GTTGCGTGGG TGGGCAATCC GCCCCAGTGG
AGACGAAAAG CCTTTGGCAC GATCGAAATG AGTGGTGTGT ACTTTTTTCT GGACGCAGTT
AGGATAGTAG AAGGCTCCGG AGCCCTCTCG ATTCTGGAGA TCCATAATTT ACGAAGATGG
TTTCGCGAAT ACCTTCATTG GCTATATATG TCCAACGGAC GGAAAGAAGC TTCTGAATTC
GATCCAAATG GCGGCATACC GGAAGTGTTT GCACCAAATC ATATCGGTCT TAACTTTGAT
ATACAGCTTG CTTCTGTTGC CGCATATTGT GGCAATCTTT CCTTGGCAGT CCGCACTATC
CATAGGGCTG TTTCTCGTCT AGATGCACAT GTCAACTCTA CCGGTGCGAT GCCAAAGGAA
CAACGGGTGG GCAGCTGCGA GCACTTTACC GCCTTGGCCT TGAACGGGTG GTCGGTGGCT
GCTCGTGTCG CTTCATCGAT AGGGATTGAT TACTGGCGAC GATTCCGGGT GGCGAATCGA
AACATTTCCA AGCTGTGCCT TGCCATGGAA TACGGCAACC CACTCCTGGA CAACCGAGAG
ACTTGTGCAG GCGACGGAGC TCCGATTGAT CCCTCCCGCT GGTGGCCACT CTTTTTCGAA
GCCAAGTCAC GGTGTCCTCA CTTAGAAACG CGCTACCTTG TACAGAATGA GCGCGATCCG
CCGGAGCATA GTGACTTTCC TAGTCATCAC AACCTGATTT CGGGCTCCAA CATGCCATAC
TCAGGCGTAG CACCCTTTTG GAATTTGAAT CTGCCACCGT AGCCACAACA TATTACCATG
ATAGTAACGA CAACACTTTA CCATTCTAAA CCGCCCTCCC CCTCAATCGA AAAACGGCAC
AACGTGTTTG CATACGTTGC CTATGCGTAT ATCAAACATC AATAGTGCCG TTCTTGGCAG
TTCAATTCCG GCACAAATCC GCGTTTGTGC CGTCTTCCTG TATTGTCTTT AGCGGTGTGA
AATTGCTGGA CACGACGTTA CGAAACAAAC AATCAAATAC CTCTGTATTG TAGGCTCGAT
ATAAATTGTC GTCGCGCTCA TACGCATCGC ATACGGCATT GTTAGCTGGA CAGAAAGTGA
CTTCCGTCAA CGTAGGCTCA ATTTTTGGTA CCGACTGTGA GGTCAATCCG TCGTCAATTT
CCTGTGAATC GATTGCAAAC ATGCAATCGA CACCATACAC GGCACGCGAA TTAGGACTCT
CCGCCATGGT GGGAAAACCT CGGGTCATGC CGTCAAATAA CTCACGAACC ATGACTTGGA
TTTTAGGCCA GATGCAATCC TTCCAGTCAA ACGCAGTGTA ATCCTGCTCC AACTTGGCGA
TAGTTTTGGT GTCGAAAGGA AGAATATGGA TATCGTCGTC GGTCGACCGA GCTTCGCTAT
CGAGCAAGTG AGTGGCGGTC AAGACAGATT CAGGATCCAT ACGATCACGA GGTGTCGAAA
TCGAATGAGA CTTGTGGGCA ATACGAAAGT ACACTCGATT GTGGATGTAC AGGCGAGGTT
GGCCGGGAGT TGCTTCGGTT AAAAGTACGA TACAACGGCA GTCAATTTTG CGGCCGTCGT
AGGTGACGGA ATTTTCCAAA TAGCGTTGAA CGACACGACT TTCGCCACCA GCATCA
 
Protein sequence
METSVCNSSS NPRYGPLEPR RRKTNIQRML RQESGGRAST YRLRITQPCP ILLLLLLLIA 
QSSVFVWYHT CDEAFFVDRP INEQGIPAFY KPSEHDFCKS FVRHGLFSLL GNGMVPNDGK
IENDLFNTAA HTPKDRTTLS ASRRASADLK MIKATIAGEP TKATGPEDSS VSRDFMNTSA
SKEEHLEVYG GVPGKNQHTD EAIQPVSMLL LRAVGNSLPP RHSAEQTLRN LDFVLTYEEA
FPNLSRHWFL NRLVDPQVEQ QVVQRLQQAN ESYTIIPFDL KSYDSHVYRF DLLDFPDQIH
STYSYNTYTG FFLGLNAEDR IMRDKLVYTA NVNGVRNEML NYGRHLSSAE YILPFDGNCF
LARNAWEAMQ RNIQHNPAAK YFAVPMDRLV EENAALLSGS YKPNPVEEPQ IIFHRTSIAN
FNPNLPYGRR NKVDLLRRLG IKGIWENNEG NKWDVILEAT NPVAEIEANV IEMTGTAGWI
SRLYSGNKNA ELTDAGASTT RVRMRRTAIS TLLDRLDLRA AVTLYNFTSN TMLFYDEQQL
YQSHLLWKNG QLQNPMINSL MEKAAQSLNE GPWSVVDKRS FGCGPSLSCR DYYNPWPYMW
PQRNESGSID WTREFVMNNG EILPGSVLYT EGSEHFDYTR LESMQRNTTV LALAYAISAN
QSYAEKAAAN LRVWFLEPGT RMNPNLSYTQ VAWVGNPPQW RRKAFGTIEM SGVYFFLDAV
RIVEGSGALS ILEIHNLRRW FREYLHWLYM SNGRKEASEF DPNGGIPEVF APNHIGLNFD
IQLASVAAYC GNLSLAVRTI HRAVSRLDAH VNSTGAMPKE QRVGSCEHFT ALALNGWSVA
ARVASSIGID YWRRFRVANR NISKLCLAME YGNPLLDNRE TCAGDGAPID PSRWWPLFFE
AKSRCPHLET RYLVQNERDP PEHSDFPSHH NLISGSNMPY SGVAPFWNLN LPP