Gene PHATRDRAFT_47067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47067 
Symbol 
ID7202153 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp315757 
End bp319539 
Gene Length3783 bp 
Protein Length1237 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181182 
Protein GI219121664 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.375267 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACGGT CCTGCGCCGA TTTCGTGCGT ACAGCGAGCT CGTGGAAAGA TCGAATGTAC 
GTATTCAACT TCCGGGTCAC GTTTTACGGT AGTAGGGGAT GGCCACACTG TCCGTACTTC
AACAGACAAA ACCGGAACCT TATCTGGAAA TTCTTCACCG GCCAAGATCC ATCAATTTTT
TCCCTTGGGC GTCGCCGTTT TGGAGTATCT CCCTCCTTCT GCTATGCTCC TTGCGCTGCT
GTGAGCTTCA CGGAAGCCGA CACCAGTCAT CCTCCACCTA CTTTTACAAG TGCCAACTGT
TTGTTGGAAT TTGTCCGTGT CTGCTTGACC AGAATGTCGA ATGGTATGGC AACGGCGACT
GCCCTCGGCT TTGGCGGTCG GGCGCCCGCC CTGGTGCTTG ACTTGCTATC AAGGGCCGTC
CATGAGGAAG GGATCGTGAC GGTATCGCCA AGGCGCTCGC AAGCTTTACC GCTGGCCTTG
ATGGAAGCGC CGTTTCGGAC GTTTCTCAGC AACGACAATA ACAACAACAA GACGGAGAGT
GGAGGCGGAA AATCACCTCT TGACTTGCTC GTGTACTGTC TACCAACGAG TGTGACGGAA
TATCTCAACG CTGCTCAAGA AGCCACTCTC GTCCGTTCCA AGCTCCTGTA TGGCAGTTCC
GATCGCCAGA TCGTGGTGGT CATATACGAT GTCCACAAGA ATGCTTTTGT AGGCGGTGAG
ACTGCGCAAA TCCTCGACGC GGTTGGTGGC AACAATTCGT ATCGACAAAA ACTCGCCATG
GCGAGTTTGG CGTCGCTTGC CGAACTCGAT CAAAAGGCGG CCGCTCAGCT CGGCCCCGAT
CGTCAACAAG ATTCCGTGAT TGTCGTAGGT GCCGGAGGCC GCGAGCATGC GCTCGCTGTG
GCTTTGGCGC AATCGCCCCT TGTCGGAAGA GTCCTGTGTT GCCCTGGTAA CGGGGGTACC
GCCGTCGAAG GTGGGAAAAT CGCTAACGTT CCCAACGGTC AACAAGACAA CGAAAGTGTC
GTAGCATTGG TCAAGGAAAC TAACGCTGCA ATGGTGGTGG TGGGTCCGGA AGCTCCTTTA
GTTGACGGTC TGGTGGATGC CTTGGCGAAG GAATGTCCGG GTACCATGGC GTTTGGTCCG
ACACAAGCGG CGGCAGAACT GGAAGCTTCC AAAGCGTTTT CCAAAGACTT TCTGCAGGAG
CACGGTATTC CCACCGCAAA GTATCGCAAC TTTACGGACG TATCTGAAGC AATCGCCTAC
GTGGAAAGTT TGGATGAGTC AGATCGACAG GTCGTGAAGG CGTCGGGTCT CGCAGCCGGA
AAAGGGGTCT TACTGCCAAC GAACAAAGCC GACACCATTG CGGCTGTCAA GGAAATAATG
TCCGACAAGG CTTTCGGCAA TGCCGGTGAT ATCTGTGTCA TTGAATCCTT CCTGGTAGGA
CCCGAAGCTT CCTGCTTGGC ATTTTGCGAC GGAAAAACTG CTCGTCTCAT GCCAGCGGCT
CAGGACCACA AACGAGCCCT CGACGACGAT CAAGGTTTGA ACACCGGAGG GATGGGAGCC
TACGCACCAG CACCGTGCGT CACTCCCGTA TTACAGCGTA CCATTGAGGA AATGTGTATC
AAGACGGTCC AGAAAATGGC AGAGCGTGGT ACACCGTACG TTGGAGTGTT GTACGCGGGT
ATGATGCTGA CGCCGAATGG CCCGTACGTT CTTGAATTTA ACTGCCGGTT TGGGGACCCC
GAGACTCAGG TTGTCTTACC GTTACTCGAA ACGGATCTGT ACGAGATCCT GACGGCGTGC
TGTTCGGGAA ACCTGGATGC GATAGATGTA CGTTTCAAGG AAGGTCAATC GGCGGCAACA
GTTGTTTGTG CTGCCTTGGG ATACCCTGAG GTATATCCAA AGGGTATGGA AATAACCGGT
TTGGATGCCG CAAATTCTTC AAATGGGGTC AAAGTTTATC ACGCGGGCAC GGATGTAGAC
AACGCCGGCG TCACGCGTTG TTCGGGTGGT CGTGTCTTGG CCATAACGGG TACCGGTAGC
AGTCTTAAGA ATGCACTTCA GTCAGCTTAT AACGGTGTCA AAAGTATTCA GTTCATCGAT
GTACATGGCA AACATCAGTT GCACCGACGA ACAGATATTG GCAAGAAGGC AACGCAAAAG
AACCTCCGAA TTGGGGTGCT GGGTTCAACT CGCGGAACAG CTCTGATTCC CGTCGTTGAA
GCGTGCCGGA GTGGAGAGCT CGACGCCGAA ATTGTGGCAT TGATCAGTAA CAAGTCCTCA
GCCCCTATTC TTGAAAAAGG GAGAGCTCTT GGCGTAACAG TTCTGTCAAA ATTCATTTCT
GCGAAGGACT TGAGCCGCGA GCAGTACGAT TCTGAGTGCA CCGCTGCTTT GGTGGCCGCT
GGTGTGGACT TTGTTTTGCT TGTAGGATAC ATGCGAATTT TGTCCAAGTC GTTCACAGAC
TTTTGGAAAA ACCGATGCAT CAACGTGCAC CCGTCTCTTC TCCCTAAACA CGCCGGCGGT
ATGGACCTTG CAGTGCATCA AGCCGTCATT AACGCAAAAG AAACAGAAAG TGGTTGTACG
ATTCATCAGG TAACGGAAGC CGTTGATGGC GGCCCTATTG TCATACAGAA GAGAGTATTG
GTAGATAGTG GAGACACTGC AGAATCGTTA AAAGTCAAAG TGCAATTGCA AGAGGGACCA
GCGTTTGTTG AGGCAATCAA GCAGTTCTCC CAAGGTGCTA CTATAAGTTA TGCCGATGCG
GGAGTTAGCA TTGACGCCGG CAATAAGTTT GTGGACTTGA TAAAACCACT TTGTAAGGCC
ACTCGTCGTG CAGGATGTGA CGCTGATCTC GGCGGCTTTG GCGGACTGTT TGATCTAGCA
GCGGCTGGCT ATGATTCGGC CAATACAGTC ATTATCGGTG CCACAGATGG TGTCGGTACG
AAACTGCGCA TTGCACAAGC AACCGGGAAA CACGAGACAA TTGGCGTTGA TCTCGTTGCG
ATGTGCGTCA ACGATTTGAT TGTAGCCGGT GGCGAGCCAT TGTTCTTTCT AGACTACTTT
GCCACTGGGC ATCTAGACGT CCACGAGGCA GCTGCGGTTG TAAAAGGGAT TGCCGAAGGC
TGCCAACAAG CTCAATGTGG ACTCATAGGC GGAGAAACTG CAGAGATGCC ATCCATGTAC
GCCCCTGGTG ACTACGATGT TGCAGGCTTT GCCGTTGGCG CTGTTCCTCG TGATAAAATT
CTTCCCTGTA GTATTTCCTC CGGAGATGTG TTGTTGGGCC TTGCAAGCAG TGGCATTCAC
AGCAATGGCT TCAGCTTGGT ACGGAAGCTC ATCGAAAAAG AGGGGCTAAA CTATTCAAGT
CTATGCCCTT GGGAAGAATC TGGTGTTACG ATTGGAGATT CGCTCTTGAC GCCCACTAAA
ATTTACGTTA GGTCATGCCT TCCCATGATC AAAAACGGAC TGCTGAAAGG CCTGGCTCAT
ATCACGGGAG GCGGTCTTTT GGAGAACCTT CCTCGAAGCC TTCCGTCTGG TGTTTCCGCC
GAAATTACTG CGCATCCAAA ACTACCTCCT GTGTTCAAAT GGATGAAAAA AGCTAGTGGT
TTGTCGGATA CGGAGATGCT CCGTACCTTT AATTGCGGAA TTGGAATGGT TCTCATCCTT
TCTCAGGAGA ATGTTGGCGA GGCAAGAGAT CTGCTTACCG CAAGCGGAGA AACAGATTTT
TTCGAGTTGG GTGTTTTGGT GGAAGGAGTG GGAGAAGTCG TTATGAAAAC TACTCTTACT
TAG
 
Protein sequence
MSRSCADFVR TASSWKDRIQ NRNLIWKFFT GQDPSIFSLG RRRFGVSPSF CYAPCAAVSF 
TEADTSHPPP TFTSANCLLE FVRVCLTRMS NGMATATALG FGGRAPALVL DLLSRAVHEE
GIVTVSPRRS QALPLALMEA PFRTFLSNDN NNNKTESGGG KSPLDLLVYC LPTSVTEYLN
AAQEATLVRS KLLYGSSDRQ IVVVIYDVHK NAFVGGETAQ ILDAVGGNNS YRQKLAMASL
ASLAELDQKA AAQLGPDRQQ DSVIVVGAGG REHALAVALA QSPLVGRVLC CPGNGGTAVE
GGKIANVPNG QQDNESVVAL VKETNAAMVV VGPEAPLVDG LVDALAKECP GTMAFGPTQA
AAELEASKAF SKDFLQEHGI PTAKYRNFTD VSEAIAYVES LDESDRQVVK ASGLAAGKGV
LLPTNKADTI AAVKEIMSDK AFGNAGDICV IESFLVGPEA SCLAFCDGKT ARLMPAAQDH
KRALDDDQGL NTGGMGAYAP APCVTPVLQR TIEEMCIKTV QKMAERGTPY VGVLYAGMML
TPNGPYVLEF NCRFGDPETQ VVLPLLETDL YEILTACCSG NLDAIDVRFK EGQSAATVVC
AALGYPEVYP KGMEITGLDA ANSSNGVKVY HAGTDVDNAG VTRCSGGRVL AITGTGSSLK
NALQSAYNGV KSIQFIDVHG KHQLHRRTDI GKKATQKNLR IGVLGSTRGT ALIPVVEACR
SGELDAEIVA LISNKSSAPI LEKGRALGVT VLSKFISAKD LSREQYDSEC TAALVAAGVD
FVLLVGYMRI LSKSFTDFWK NRCINVHPSL LPKHAGGMDL AVHQAVINAK ETESGCTIHQ
VTEAVDGGPI VIQKRVLVDS GDTAESLKVK VQLQEGPAFV EAIKQFSQGA TISYADAGVS
IDAGNKFVDL IKPLCKATRR AGCDADLGGF GGLFDLAAAG YDSANTVIIG ATDGVGTKLR
IAQATGKHET IGVDLVAMCV NDLIVAGGEP LFFLDYFATG HLDVHEAAAV VKGIAEGCQQ
AQCGLIGGET AEMPSMYAPG DYDVAGFAVG AVPRDKILPC SISSGDVLLG LASSGIHSNG
FSLVRKLIEK EGLNYSSLCP WEESGVTIGD SLLTPTKIYV RSCLPMIKNG LLKGLAHITG
GGLLENLPRS LPSGVSAEIT AHPKLPPVFK WMKKASGLSD TEMLRTFNCG IGMVLILSQE
NVGEARDLLT ASGETDFFEL GVLVEGVGEV VMKTTLT