Gene PHATRDRAFT_47388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47388 
Symbol 
ID7202533 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp495490 
End bp498993 
Gene Length3504 bp 
Protein Length1057 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181570 
Protein GI219122476 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.911661 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGAAGACTGC TCCTTGTCTC TCGACTGTTG TGTTGTGTGT AGTAGACGAG TGCGTGCGTG 
AGTCGCGAAG CGAACGGAAC GGAACTTTCC AACGGGAACT CGCGTGGGAG AAAGAGTTGG
AAACAGCGTC GTACACCAGA CCAAACGGTA GATTCGACCC AACCGACGGG ATTGTCAACA
GACATGTCGA AAAAGTTCTT TCGCCCGCCG CCACAGGCTA TGGATGACTC GGATTCGGAA
GAAGACGAGG AAGAACTCTT GGACGATGAC GCCGAAGTAG ACGACGAAGA CGTCACCGAG
GTGTCTCCAC CCAACAAATC CAAACCGACC AACAATGCCA ATCTTAGCGA GTCGGACGAT
GACGACGACG AGCCAGACGA ACGACCCCCG AAACGCAAAG GTCGCCACAT CGATAACAAC
GGCAATGCGG ACGCTCGCTC TTCCAACAAA CGTGCCAAAC CCACTTCGTT CTTCGAACTC
GAAGCCGAGG CCTCCGACAG CGAAGACGAC GACGCCGATG ACGCCGCCAA GCAACTGCAC
AAGGACAGTG AACAAATGGA CGAAGAAGCC AGGGAAATCA TTCGTCAACA AGACCGGCGC
CGTGCCCAGG CCGGTGGTCG CTTCGACCGA TCCGTGGCCG AAATATCCGC CGATATCGAA
AACCGGCACC GTGCGCAACG TCGCGTTGTC GACCGTCACC GGATTCGCGA GCCAGCATTT
GCTCCTCCTC CTCCTTCCAA ACAAAAGTCT CTGCCACGTC CCGGTAATAA GGTACGTTCC
CAGCGTCCGC CGATGGACCG GGACGACGAA GACGACTTTG ACGACGGTGG GGCACCGCCA
GACGCTACCC CCGAAGGCTA CACCGCCGTG GCGCAGCAGT CCCTCGTCCC CTCCGTTTCG
GATCCATCCA TGTGGATGGT CTCTTGTGCC AACGGCAAGG AACACGAACT CGTTTGTCAA
ATCATGAACA AGTGTGTCGC TATGGCCCGT CAAGGACGAC CGGTCGGTAT TAGCTCCGCG
ATTGCCGCAC AGTCCAAGGG TAAGATATAC TTGGAGAGTT TTAGCGAACC GGCCGTGGTG
GACGCCATCC AAGGAGTCCG CGGCTTGCTA CAGTACACGA TGCGGCTCGT ACCAATCGGT
GACATGACGA CCGTCATGAC CGTCACGTCA CGCAAGAAAC CCGTACAAAA GAACGAATGG
GTCCGCATGA CCCGCGGACA CTACAAGCAC GACCTCGCCC TCGTCAAGGA CGTTAAGGAA
AGCGGACTCA AGTGCGTTGT GCAGTGTGTC CCGCGACTGG ACCTTACTCT GGCAGATCTA
CCGCCCGCTG AAGCCCGCAT GCGACGTAAA ACCGTGCGTC CGCCGCAAAA ATTCTTCAAC
CCGCAAGAAA TCGCGGCGTT GGGACGACAC GGACTGACCC GCCAACGCTT TCCCGGCCTG
AACCTGTACT GTGACTACTT TGACGGGAAC TACTATCACG ACGGGTACCT CCTCAAGGAA
ATGACGGTGG GATCCATGAT CAAAGCCTGT ACGGACGAGG ATCCGCCGAC GTTGGATGAA
CTGCAACGGT TTCAGCGGCG TCAATCCAAG CAGAATGAGG AAGACGGTGG CGACGAGAAT
GAAGGATCCA AAATGGCCGC ATCGTTGTTG GACGAGCTGT CCGACTTGCA AGGACGGACG
GGGCTTGGTA AAACAAAGAC TGGCGGCGGT GGGGGTCTAT TGATAGGTGA TACTATTGAA
GTGATCGAAG GCGATTTAAT TGGCATGCGT GGAAAGTTGG TGAGCTTGGA CGGTACCACC
GTCAAGGTCA AACCCATCAA CGCCAACGTC GATTTGGGTG GAACGGACGA AATTGAGTTC
TTGGGTAATC AGTTGCGCAA GTACATTCCG GTCGGAGCTC ACGTCAAGGT GACGGATGGA
CGGTACGCCA ACGAGACGGG CGTTGTGGTG GCCGTCGAGC AACTCGATGG CGAAACGGAC
TTCACCGCGG TTGTATTGAC CGACGTGACC AACAAGGAAA TCTCGGTCCG CACGTCTCAG
TTACGAGAAT CCGCTGAAGT TGCGTCTGGT CAAGACAAGC TACAAGGGTA TGAACTATAC
GACTTGGTCG TGTTGAGTGG TGGAGGCTCC GCCAATGAAG TCGGTGTAAT TGTTCGTGTG
GGTCGGGAAG ACTTTACCGT CATCAACAAT CACGGTATTG TTCGAGAAAT TCGACCGGAG
GAGCTGCGTG GAAAGCGCAA CTCGACATCA CAACGTGCGG TGGCTCTCGA TGTACAAGGC
AACCAAATTC GTGCCAGTGA TTCCGTCAAC GTTGTGGAAG GTCCTCACAA AGGCAAGACG
GCGACGATCA AGCGCATGAG CCGAGCGCAA CTCTTTTTGC ATTCTCAGAC CCGCACGGAT
CATGCCGGTA TCTTTGTCGT TCGCAGCCGA AGCTGTGTAT TGGCCGGGAC ACGGACACAA
GCCCGTGGCG CAACACCCGA CAGCGGAGTC AGCCCTTTCG CGACGCCTCA GTCGCAATCG
CGAGGAGGGC CACAACCTGG TCGGGGCAAA CGCGACGATG GTTTGATCGG CAAGACTGTG
CGTATTCAGG CGGGACAATG GAAGGGATAC CTGGGGACTG TTGCGGATGC GACGGCGACA
CACGTTCAGG TCGAGTTACA CTCCCGATTA AAGAAAGTGA TGGTTGTTCG TGAACGAGTA
GCCGTTGCGG GTGACAAGTT TGGGGCCACG CAGGAACAGA ATCGCATGGT GGATGTGAAT
CCCAATCCCA TTGCCCCGAC TACGCCTTTT GTAGCGGCTG GTCAAACCCC AATGCACGGC
GGAGCAACAC CGATGCATGG GGGTGCGCTG GATGGCGATA CCAGTGATGA GGTATGGCAG
CCTGGTGGTG CTGTTGATCA GGATGCTGTT AAAGATGACG ATGGCTGGGG CTCGACCGCT
AATGATGGAT TCGGTTCGCC GAAAGACGGC GACAGCGACG GTTGGGGATC TACTAGCGAT
CAGCCGAAAG GCTGGGGTAC GTACCCAGCA GTCAAGGATA ATACTGTTAA GAAGGAAGTA
CCTTCTACCA ATGGCAACGG TGTAAAGCGC GAGCAGGCAA AGCGCGAAAT GGAAGCAGAG
CTTGATACGG AAGAGACGCC CGGCTGGTAC ATGGAACGAG TTTGTGTGCA GAACAAATCA
AAAGATAAGC AGGGAGTCAT CAAGGAGATT GATCCTGCCA CAAAGGCGGC TGTCGTGGAG
TACGAAGACC AAACATTTGA AACTCTTCGT GCGAGTGAAC TTGCTATGGT CCCACCCAGC
GAACACGACA CCGTTCTGGT CACTGGGGGC AACGAAATTG GATTGGAAGG GTCATTGGTT
TGCGTCGATG GTTCTGATGC CATTCTCAAG GATGCAAACG ATGAATTTAA GATTGTTGAG
ATCTCGTTCC TGGCCAAGGT GAAAACTATA TAGGTATTAC ATCGTAGCCC TAACAAATCT
AATTAACATA ACCCGAATGC AACG
 
Protein sequence
MSKKFFRPPP QAMDDSDSEE DEEELLDDDA EVDDEDVTEV SPPNKSKPTN NANLSESDDD 
DDEPDERPPK RKGRHIDNNG NADARSSNKR AKPTSFFELE AEASDSEDDD ADDAAKQLHK
DSEQMDEEAR EIIRQQDRRR AQAGGRFDRS VAEISADIEN RHRAQRRVRP PMDRDDEDDF
DDGGAPPDAT PEGYTAVAQQ SLVPSVSDPS MWMVSCANGK EHELVCQIMN KCVAMARQGR
PVGISSAIAA QSKGKIYLES FSEPAVVDAI QGVRGLLQYT MRLVPIGDMT TVMTVTSRKK
PVQKNEWVRM TRGHYKHDLA LVKDVKESGL KCVVQCVPRL DLTLADLPPA EARMRRKTVR
PPQKFFNPQE IAALGRHGLT RQRFPGLNLY CDYFDGNYYH DGYLLKEMTV GSMIKACTDE
DPPTLDELQR FQRRQSKQNE EDGGDENEGS KMAASLLDEL SDLQGRTGLG KTKTGGGGGL
LIGDTIEVIE GDLIGMRGKL VSLDGTTVKV KPINANVDLG GTDEIEFLGN QLRKYIPVGA
HVKVTDGRYA NETGVVVAVE QLDGETDFTA VVLTDVTNKE ISVRTSQLRE SAEVASGQDK
LQGYELYDLV VLSGGGSANE VGVIVRVGRE DFTVINNHGI VREIRPEELR GKRNSTSQRA
VALDVQGNQI RASDSVNVVE GPHKGKTATI KRMSRAQLFL HSQTRTDHAG IFVVRSRSCV
LAGTRTQARG ATPDSGVSPF ATPQSQSRGG PQPGRGKRDD GLIGKTVRIQ AGQWKGYLGT
VADATATHVQ VELHSRLKKV MVVRERVAVA GDKFGATQEQ NRMVDVNPNP IAPTTPFVAA
GQTPMHGGAT PMHGGALDGD TSDEVWQPGG AVDQDAVKDD DGWGSTANDG FGSPKDGDSD
GWGSTSDQPK GWGTYPAVKD NTVKKEVPST NGNGVKREQA KREMEAELDT EETPGWYMER
VCVQNKSKDK QGVIKEIDPA TKAAVVEYED QTFETLRASE LAMVPPSEHD TVLVTGGNEI
GLEGSLVCVD GSDAILKDAN DEFKIVEISF LAKVKTI