Gene PHATRDRAFT_45536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_45536 
Symbol 
ID7200729 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011675 
Strand
Start bp490993 
End bp494195 
Gene Length3203 bp 
Protein Length839 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179657 
Protein GI219117735 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.607338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAACAAGCGT CTCACAGTCA GTCCTGTTAG AGGACACCTA TAAGGACTCC TGCGTCGATG 
GTCGGCACGT CAGCTTTATG CCATCACTAG ATTAAACCTA CTTCAAATCG TATAACGATA
AGGATGGATG ACGGTACGCT AGCCTTACGA CAAGAATCCA AACAGAATAT GCCCTTATTG
AGGTCAATAG CGTGGGTATC TTCGAGCACG CCATTTTCGC TTTGTTCCTT GATCGTTCTA
TTCGCAGTGT CATTGACTTT GGACAATACA GAGGCCTTTG TATTGCCAGA CCATATTCCC
CACCCACCGA GTTCTCAATC GTTATTGTTT GCCTCGCAAG AGATTGAGCA CTCAAGAAGA
ACATGGGCTC AGCGCAAATC ACCTCGTGAG AAAGTGACCC TGATTCCGGT CGATACCAAA
TGGTCTGGTG GTAACAAACT CGAAAAAACT GGCCACAGTA GCCGAGCTGG ATCCCCGTCT
AGATTCGACC AGCTTCCGTC ATCATCAACA ACAACAATAG TATCATCACA AAGGCGCCAA
AAAACGAAAC CCATGCCTGT GACCGGGTAT GATGCTCAAT CCATCGAGGT ACACTATGAC
CGACGACCTT TGGAGGTTGG TTGGCGCTTA AACTCACTCG GTATTCCACT GCTGGGTACG
TCGTTCATTA TTGTGATCGT GGAATGTTGC CTTCATGGTC GAAGAATTCT GCTCAAACTC
ACCTTCGTAT TCGCTCTCAT ATCTAGGTTG GTACATGCGA TTATTGCTGG ATAGAGCAAT
GGGTCTAGAC GGTGATGAAA ATGTGCAGCG GAAGCGCGGA CAGGAGCTGC GCGAGCATCT
GATTCGCTCC AAATCAGTGG CTCTCATCAA GTCGGGGCAG GCGGCCTCAC TCCGACCTGA
CTTGATTCAG AATCGTTTTT GGGCTGAAGA ACTCGGTAAA CTTGTAGATG CAGTTGGATC
ATTTTCGGAT TTACAGGCTA TGAAAATTAT GCGTAATGAA TTGCGTGATA TAAGGCCGCG
TTTAGATGTG ACCCGAACAT CCTGGCAAAC CGCATCAAGA GCTCGTCGGA GGAAAGGTCG
TATGAATAGG GTTGAAAAAA TGGTAGAGGC GGATGATGTT CTGAACTTGT TTGAATTTTA
TAACGAGAAT CTAGCGGTGG CGTCTGCATC CATTGGGCAA GTGTATAAGG CTCGAATCAG
AAGTGGGCCT CAATTGGAAG CTGCGATAGG GCCTGAGCAA GCTGCTAAGT GGGGAGGGAA
GGTTGTGGCT ATTAAAGTAC AGCGTCCAGA TGTGGAAGCT TCGGCTTCTT TAGATATGTA
CCTACTGCGA CGAACAGCAA TGTGGCTCAG TAAAATACGA GGAGGCGACC TACCGAAAGT
TGCGGACTGC TTCGGAATGC AGCTCTTTGG AGAACTGGAC TATGTTCGGG AAGCCAACAA
CTGTGACAGG TTTCGAGAAC TGTACGAAGG TTGGAGCGAC ATAAAAGTAC CCGCTGCTTG
CTCGGCGTTT ACCCGAAGAC GGGTCCTAGT AATGGAATGG ATAGACGGGG TAAAGGGGCC
CTGGGACGGG CAAAGGGGTA TCGATATGGT TCGCATTGGG TTGCGGTGCT CGGTAGATCA
GCTCATGACG ACTGGGCTTT TTCATGCCGA TCCCCATCGC GGCAATATGC TGAGCACTCC
GGATGGGCGG CTGGCCTTGA TAGACTTTGG GATGATGGCA GACATAGATG AGAAGGATCG
ATACGGGCTC TTTGGTCTAG TGATTGGTCT GCAGAACAAG GACTTGGCCC TTGTCACCGA
AAATCTGTTG GAGGTAAGAA ATACAAAGAG TTTTGGAAGC TGTTGTAGTC ACAAGGGCGT
TTCTCACCAA CCTATTGCTA TAGTTGGGGT TCTTGAAAGA TACGACCCAG ATTGATCAAC
TAATTCCTCG GTTGAGGGCT GCTCTAATGA ATGCCACAGG CGGTAGTGGA AAAGCCTCGG
ACGTAAACTT TGCGCGTCTT CAAGCAGAGC TGGATGATAT TAGTCGAGAG AATGTTCTGC
GCTTCTCAAC GCCCCCCTTT TTCACGGTGA TCATTCGAAG CTTGACCATT CTCGAGGGCG
TCGCGTTAAG TGTTGACCCT GCATTCAAGC TTGTTCGTGG AGCTTATCCG TATGTCCTGC
GACAGCTACT TTCTCCCGAG GATCAAGTCC GTATGCCAGC AGCGTTGCAG AAACTTTTGA
AACGCCTGCT CACCGTCAAC GGGGAAGAAC GCGAGATAGA TTGGGAGCGC TTACGTGATT
TTCTTAGACT CGCTCAAAAG GCGGCAAGGA AATACGACCC CTCAATGAGT GAAGTAGACG
ACAAAGCGTC GCTTTCTCGG CAGACGATTG AACTGTTTGT CCAATTTTTG ACCAGTAGAG
CAGGTTTGTG GCAGAGATCC ATCGTCTTAT CTGTGTTGAC ATACATTCTT TTTAACTCAT
CAGTTTGATT CATTTGCAGG CATCTTCCTG AAGAAGCCCT TGGTCCATGA ACTCGCCGAA
GCTATTGATG GCATGGCCAG TATTGGCGAA GGCAACTTGT ACCGCATGTC TCGGGGGCTG
TTACCCGCTC TACCTGGTAT GAACGGACCC GTGAATTCTC GCCGTATGGA TGAGATCTCC
ATGATGCTGA ATACCTTTGA AGATGCGCTT GTGATGGAGA ATAACGACGG GGGTAGCCGC
GCCCGAATGG AAGCTATTAT GGAGCTCTTC CGGGAAGTTT CCGCCGCGCT GGGGGATGAA
CGGCTGCGTC AAGATGCTGG CCCGTTGTTG GTAGAACTAC AATCGGTGAT CCAGATGGTT
GCTGTCGAAG TGCTGGAGAT TCGTGGGTCT CGAGCTATGC GATCCATCCT CCGCGTCTAA
CTTACATTTT AAAATTGCTA GAGCTATTGT CATGATGACA CATTTATCCA AGACCTCATT
ACCGACTCTC CCTCGGCTGC TCCTGAGCAT ATACGTGGGC ATACGACACG TATGGCTGGA
TTGCATCAGG AACATCCTTT GGTGCCGCCT CCGGAGCTAG CGACCTTAAT TCGTCCCTTG
ACTCCAGAAC GAAGAACCCG ATGTCTGTGT TAGGTAACAA AGACGGAACC CCTCCTCCGA
CTACCACGGT TTCCCACTAG TTTTCTTCAA AAGTTTCGCA ACTCGTGCAA TAGAAGCTTT
TTTCCTTTCC TTTTGGTACG TTC
 
Protein sequence
MDDGTLALRQ ESKQNMPLLR SIAWVSSSTP FSLCSLIVLF AVSLTLDNTE AFVLPDHIPH 
PPSSQSLLFA SQEIEHSRRT WAQRKSPREK VTLIPVDTKW SGGNKLEKTG HSSRAGSPSR
FDQLPSSSTT TIVSSQRRQK TKPMPVTGYD AQSIEVHYDR RPLEVGWRLN SLGIPLLGWY
MRLLLDRAMG LDGDENVQRK RGQELREHLI RSKSVALIKS GQAASLRPDL IQNRFWAEEL
GKLVDAVGSF SDLQAMKIMR NELRDIRPRL DVTRTSWQTA SRARRRKGRM NRVEKMVEAD
DVLNLFEFYN ENLAVASASI GQVYKARIRS GPQLEAAIGP EQAAKWGGKV VAIKVQRPDV
EASASLDMYL LRRTAMWLSK IRGGDLPKVA DCFGMQLFGE LDYVREANNC DRFRELYEGW
SDIKVPAACS AFTRRRVLVM EWIDGVKGPW DGQRGIDMVR IGLRCSVDQL MTTGLFHADP
HRGNMLSTPD GRLALIDFGM MADIDEKDRY GLFGLVIGLQ NKDLALVTEN LLELGFLKDT
TQIDQLIPRL RAALMNATGG SGKASDVNFA RLQAELDDIS RENVLRFSTP PFFTVIIRSL
TILEGVALSV DPAFKLVRGA YPYVLRQLLS PEDQVRMPAA LQKLLKRLLT VNGEEREIDW
ERLRDFLRLA QKAARKYDPS MSEVDDKASL SRQTIELFVQ FLTSRAGIFL KKPLVHELAE
AIDGMASIGE GNLYRMSRGL LPALPGMNGP VNSRRMDEIS MMLNTFEDAL VMENNDGGSR
ARMEAIMELF REVSAALGDE RLRQDAGPLL VELQSVIQMV AVEVLEIRGS RAMRSILRV